Prerequisites
Before you begin the installation process, verify the following:
-
Ensure Hive is setup with Ranger authoriser and doas=false
-
Ensure HDP Clusters that are involved in replication have symmetric configuration. It means the following:
-
If security is enabled, it should be enabled on both the source and the target clusters.
-
If HA is enabled, it should be enabled on both the source and the target clusters.
-
- You must have
root
access to the nodes on which the DLM App and DLM Engine will be installed. -
Ensure required services Knox, Ranger, HDFS, YARN and Hive are installed.
-
Ensure to have one of the following external databases installed: MySQL or Postgres.
See the Hortonworks Support Matrix for the compatible versions of DataPlane Platform, HDP and DLM.
- Knox SSODPS Platform and the DLM leverage Knox SSO to provide users and services with simplified and consistent access to clusters, data and other services. You must configure Knox SSO on the HDP clusters that you plan to use with DLM.Note
The Knox SSO of your cluster must be configured to use the same LDAP/AD as your DP instance for user identity to match and propagate between the systems.
Refer to the following documentation on how to configure your cluster for Knox SSO:Resource Documentation Install Knox and enable in Ambari HDP Security Guide, Install Knox Configure SSO topology HDP Security Guide, Configuring an Identity Provider (IdP) Configure Knox SSO for Ambari HDP Security Guide, Setting up Knox SSO for Ambari Configure LDAP with Ambari Ambari Security Guide, Configuring Ambari Authentication with LDAP or Active Directory Authentication - Perform the DataPlane Platform pre-installation tasks. For more information, see Prepare your clusters.
- Install or upgrade to the supported version of Ambari. See Support Matrix for details of the supported Ambari versions. See Apache Ambari installation for more details.
- Install or upgrade to the supported versions of HDP on your cluster using Ambari. See DLM Support Matrix for details of the supported HDP versions. See the HDP installation documentation for more details.
- Set up LDAP Authentication for Ambari
- For Ambari 2.6: See Configuring Ambari for LDAP Authentication.
- Set up Knox SSO with LDAP Authentication for Ambari
- For Ambari 2.6: See Setting up Knox SSO for Ambari.
- RangerRanger enables you to create services for specific Hadoop resources (HDFS, HBase, Hive and so on) and add access policies to those services. If you use Ranger for authorisation in your cluster for LDAP users:
- Configure LDAP for Ranger usersync. For more information, see Advanced Usersync Settings.
- Configure LDAP Hadoop group mapping. For more information, see Setting Up Hadoop Group Mapping.
- Knox Gateway
Configuring Knox Gateway is required if your cluster is configured with Kerberos or with wire encryption. This simplifies certificate management for DPS and cross-cluster communication, as the only security certificate that needs to be managed is for Knox.
Refer to the following documentation on how to configure your cluster for Knox Gateway:
Resource Documentation Configure a reverse proxy with Knox HDP Security Guide, Configuring the Knox Gateway Configure LDAP with Knox for proxy authentication HDP Security Guide, Setting Up LDAP Authentication - Hive
You must configure Hive with Ranger authoriser. For more information, see
Authorization using Apache Ranger Policies and hive.server2.enable.doAs=false
- YARN
DLM runs the replication jobs using YARN. For on-premise to on-premise replication, the replication job runs on the target cluster. For on-premise to cloud replication, the replication job runs on the source cluster. Make sure YARN is installed on the cluster where the replication job runs.
-
Ensure HDP clusters that are involved in replication have symmetric configuration. It means the following:
-
If security is enabled, it should be enabled on both the source and the target clusters.
-
If HA is enabled, it should be enabled on both the source and the target clusters.
-
-
Ensure to have one of the following external databases installed: MySQL or Postgres.
See the Hortonworks Support Matrix for the compatible versions of DataPlane Platform, HDP, and DLM.