Mandatory CDP policy-level properties

You need to configure mandatory Hive policies on CDP before you load workload replicated data. You learn what value to set for each property, whether or not you can specify property configuration options in the REPL LOAD command, and if the property is modifiable with an alter query in the REPL LOAD command.

The following table lists the properties you must set on the CDP Private Cloud Base cluster before loading your replicated data onto CDP. The Modifiable column indicates whether or not you can modify the property with an alter query in the REPL LOAD command.

Table 1. Mandatory Policy-level Properties
Property Description Required Value Modifiable
hive.repl.rootdir Staging location path. Enter the same path for REPL DUMP and REPL LOAD.

[***HDFS path***]

Note: Use the same property value in REPL DUMP and REPL LOAD.

No

hive.repl.include.external

.tables

Includes external tables for replication.

true No

hive.repl.dump.metadata

.only.for.external.table

Includes only external table metadata for replication.

false yes

hive.repl.replica.external.

table.base.dir

Fully qualified base directory on the target warehouse to store external tables. The directory path is prefixed to the source external table path on the target cluster. Enter the same path for REPL DUMP and REPL LOAD commands.

[***HDFS path***]

Note: Use the same property value in REPL DUMP and REPL LOAD commands.

yes

hive.repl.ha.datapath.re

place.remote.nameservi

ce

Set to true when the following are true:

- HDFS is HA-enabled

- Both the source and target clusters are configured with the same nameservice name.

See Description before you configure this option.

Yes

hive.repl.ha.datapath.re

place.remote.nameservi

ce.name

Provides a reference to the nameservice on HDP when HDFS is HA-enabled and both HDP and CDP clusters are configured with the same nameservice name.

[**remote nameservice

name**]

Note: Use the same parameter value in REPL DUMP and REPL LOAD commands.

Yes

The property values of the nameservice.name and remote.nameservice must be different. For example, if the clusters use the nameservice name ns, then use a different property value. For example, ‘hive.repl.ha.datapath.replac e.remote.nameservice.name’ = ‘nsRemote’.