Hortonworks Docs
»
Data Lifecycle Manager 1.5.0
»
DLM Administration
DLM Administration
Also available as:
Introduction
Purpose and scope
Audience and assumptions
Replication concepts
Data Lifecycle Manager terminology
Communicating within services in HDP
How Policies Work in Data Lifecycle Manager
Replication policy
UI overview
Cluster Health panel
Policies panel
Jobs panel
Recent Issues panel
Clusters map
Issues & Updates table
Preparing to setup replication policy
Roles required
Infrastructure Admin role
DLM Admin
DLM User
Working with clusters
Add clusters
Cluster pairing
Pairing considerations
Preserving replication timestamps
Cloud credentials
Register cloud credentials
Registering Amazon S3 cloud account
Considerations for Amazon S3
Registering Microsoft WASB cloud account
Considerations for Microsoft WASB
Registering Google cloud account
Considerations for Google Cloud Storage
Data replication use cases
HDP release-specific features
Replication of HDFS data
HDFS on-premise replication
Replication of data on-premise to on-premise in HDFS
HDFS cloud replication
On-premise to Amazon S3 replication in HDFS
Replication of data on-premise to Amazon S3 in HDFS
Amazon S3 to on-premise replication in HDFS
Replication of data from Amazon S3 to on-premise in HDFS
On-premise to Microsoft WASB replication in HDFS
Replication of data on-premise to Microsoft WASB in HDFS
Microsoft WASB to on-premise replication in HDFS
Replication of data from Microsoft WASB to on-premise in HDFS
On-premise to Google Cloud replication in HDFS
Replication of data from on-premise to Google Cloud Storage in HDFS
Google Cloud to on-premise replication in HDFS
Replication of data from Google Cloud Storage to on-premise in HDFS
Replication of HIVE data
Hive replication concepts
Hive tables - Managed and External
ACID tables replication
Bootstrap and incremental replication
Storage-based authorization
Statistics replication
Replication differences between HDP 2.6.5 to 3.x
Hive on-premise replication
Replication of data on-premise to on-premise in Hive
Hive cloud replication
Setting target cluster for cloud storage in Hive
On-premise to Amazon S3 replication in HIVE
Replication of data on-premise to Amazon S3 in Hive
On-premise to Microsoft WASB replication in HIVE
Replication of data on-premise to Microsoft WASB in Hive
On-premise to Google Cloud replication in HIVE
Replication of data on-premise to Google Cloud in HIVE
DLM operations using Command-Line Interface
Commands overview
CLI authentication
dp login
dp logout
dp
dp dlm
dp dlm policy
dp dlm policy create
dlm dp policy validate
dp dlm policy get
dp dlm policy list
dp dlm policy rerun
dp dlm policy abort
dp dlm policy resume
dp dlm policy delete
dp dlm policy suspend
dp dlm policy instance
dp dlm policy instance list
dp dlm policy instance get
dp dlm events
Metadata replication
Ranger metadata
Atlas metadata
Snapshot replication between HDP clusters
Replication policy operations
Monitoring replication
Policies page
Overview page
Notifications page
Viewing replication logs
Tuning replication policy (advanced options)
Suspend data replication
Activate data replication
Update replication policy
Browsing data directory
Tracking replication progress
Cloud credentials operations
Update cloud credentials
Delete credentials
Unregistered credentials
Miscellaneous
Update Cluster Endpoint
Failing Over Manually
Make the destination cluster the new source
Remove the Ranger deny policy
Activate a new destination cluster
DLM version Information
Tuning DLM Engine
Tuning DLM Engine
Troubleshooting DLM
Ranger UI does not display deny policy items
Replication fails with TDE and non-TDE data
ReplChangeManager error
Hive data cannot be replicated
Hive policy suspension
Instance of a policy stuck in a running state
Hive replication failure
About requested events missing in Notification Log table
Hive replication concepts
DLM supports Hive replication.
Hive tables - Managed and External
Managed tables are Hive owned tables where the entire lifecycle of the tables’ data are managed and controlled by Hive. External tables are tables where Hive has loose coupling with the data.
ACID tables replication
Hive managed tables supporting insert, update, delete operations with ACID semantics are called
ACID
tables. ACID tables support only ORC file format.
Bootstrap and incremental replication
DLM allows you to replicate Hive databases from a source cluster to a target location on a destination cluster.
Storage-based authorization
Hive supports
doAs=true
plus storage-based authorization that enables security at Hive Metastore API level.
Statistics replication
Basic statistics such as the number of rows of a table or partition and the column statistics such as histograms (min, max, count) of a particular interesting column are important in many ways.
Replication differences between HDP 2.6.5 to 3.x
In HDP 2.6.5 version, managed tables are managed by Hive.
Hive on-premise replication
Before you can begin replicating data using clusters on Hive, you must make sure that there are at least a couple of clusters that are registered in your DLM App instance. The replication load happens on the target cluster.
Parent topic:
Replication of HIVE data
© 2012–2019, Hortonworks, Inc.
Document licensed under the
Creative Commons Attribution ShareAlike 4.0 License
.
Hortonworks.com
|
Documentation
|
Support
|
Community