Data Analytics Studio Installation
Also available as:
PDF

Prerequisites for Data Analytics Studio Engine

Perform these tasks before installing the Data Analytics Studio Engine on a cluster.

  • Ensure that the clusters are running the latest version of HDP
  • Ensure that the following HDP components are installed and configured:
    • Hive
    • Knox
  • Ensure that you have installed DataPlane before installing DAS or DAS-Lite. For installing DataPlane, see Installing DataPlane.
  • Make sure that you have installed and configured Knox SSO. For more information, see Setting up Knox SSO.

    After you set up Knox SSO, validate that it is configured correctly by signing out of Ambari. Upon signing out from Ambari, you should see the Knox SSO login page, and you should be able to log in using your credentials.

  • Go to Ambari > Services > Hive > CONFIGS > ADVANCED. Make sure that the following Hive configurations are as follows:
    • hive.exec.failure.hooks: org.apache.hadoop.hive.ql.hooks.HiveProtoLoggingHook
    • hive.exec.post.hooks: org.apache.hadoop.hive.ql.hooks.HiveProtoLoggingHook
    • hive.exec.pre.hooks: org.apache.hadoop.hive.ql.hooks.HiveProtoLoggingHook
    • hive.metastore.transactional.event.listeners: org.apache.hive.hcatalog.listener.DbNotificationListener
  • Go to Ambari > Services > Tez > CONFIGS > Custom tez-site. Make sure that the following Tez configuration is as follows:
    • tez.history.logging.service.class: org.apache.tez.dag.history.logging.proto.ProtoHistoryLoggingService
  • To download logs, make sure that the DAS service user has write permission to the /tmp directory. Also make sure that the /tmp directory has sufficient storage space to hold logs from a query for the download logs feature to work.