DLM Installation and Upgrade
Also available as:
PDF

Prerequisites

Before you begin the installation process, verify the following:

  • Ensure Hive is setup with Ranger authoriser and doas=false

  • Ensure HDP Clusters that are involved in replication have symmetric configuration. It means the following:

    • If security is enabled, it should be enabled on both the source and the target clusters.

    • If HA is enabled, it should be enabled on both the source and the target clusters.

  • You must have root access to the nodes on which the DLM App and DLM Engine will be installed.
  • Ensure required services Knox, Ranger, HDFS, YARN and Hive are installed.

  • Ensure to have one of the following external databases installed: MySQL or Postgres.

    See the Hortonworks Support Matrix for the compatible versions of DataPlane Platform, HDP and DLM.

  • Knox SSO
    DPS Platform and the DLM leverage Knox SSO to provide users and services with simplified and consistent access to clusters, data and other services. You must configure Knox SSO on the HDP clusters that you plan to use with DLM.
    Note
    Note

    The Knox SSO of your cluster must be configured to use the same LDAP/AD as your DP instance for user identity to match and propagate between the systems.

    Refer to the following documentation on how to configure your cluster for Knox SSO:
    Resource Documentation
    Install Knox and enable in Ambari HDP Security Guide, Install Knox
    Configure SSO topology HDP Security Guide, Configuring an Identity Provider (IdP)
    Configure Knox SSO for Ambari HDP Security Guide, ​Setting up Knox SSO for Ambari
    Configure LDAP with Ambari Ambari Security Guide, Configuring Ambari Authentication with LDAP or Active Directory Authentication
  • Perform the DataPlane Platform pre-installation tasks. For more information, see Prepare your clusters.
  • Install or upgrade to the supported version of Ambari. See Support Matrix for details of the supported Ambari versions. See Apache Ambari installation for more details.
  • Install or upgrade to the supported versions of HDP on your cluster using Ambari. See DLM Support Matrix for details of the supported HDP versions. See the HDP installation documentation for more details.
  • Set up LDAP Authentication for Ambari
  • Set up Knox SSO with LDAP Authentication for Ambari
  • Ranger
    Ranger enables you to create services for specific Hadoop resources (HDFS, HBase, Hive and so on) and add access policies to those services. If you use Ranger for authorisation in your cluster for LDAP users:
  • Knox Gateway

    Configuring Knox Gateway is required if your cluster is configured with Kerberos or with wire encryption. This simplifies certificate management for DPS and cross-cluster communication, as the only security certificate that needs to be managed is for Knox.

    Refer to the following documentation on how to configure your cluster for Knox Gateway:

    Resource Documentation
    Configure a reverse proxy with Knox HDP Security Guide, Configuring the Knox Gateway
    Configure LDAP with Knox for proxy authentication HDP Security Guide, Setting Up LDAP Authentication
  • Hive

    You must configure Hive with Ranger authoriser. For more information, see

    Authorization using Apache Ranger Policies and hive.server2.enable.doAs=false

  • YARN

    DLM runs the replication jobs using YARN. For on-premise to on-premise replication, the replication job runs on the target cluster. For on-premise to cloud replication, the replication job runs on the source cluster. Make sure YARN is installed on the cluster where the replication job runs.

  • Ensure HDP clusters that are involved in replication have symmetric configuration. It means the following:

    • If security is enabled, it should be enabled on both the source and the target clusters.

    • If HA is enabled, it should be enabled on both the source and the target clusters.

  • Ensure to have one of the following external databases installed: MySQL or Postgres.

    See the Hortonworks Support Matrix for the compatible versions of DataPlane Platform, HDP, and DLM.