Hortonworks Docs
»
Data Lifecycle Manager 1.5.1
»
DLM Administration
DLM Administration
Also available as:
Introduction
Purpose and scope
Audience and assumptions
Replication concepts
Data Lifecycle Manager terminology
Communicating within services in HDP
How Policies Work in Data Lifecycle Manager
Replication policy
UI overview
Cluster Health panel
Policies panel
Jobs panel
Recent Issues panel
Clusters map
Issues & Updates table
Preparing to setup replication policy
Roles required
Infrastructure Admin role
DLM Admin
DLM User
Working with clusters
Add clusters
Cluster pairing
Pairing considerations
Preserving replication timestamps
Cloud credentials
Register cloud credentials
Registering Amazon S3 cloud account
Considerations for Amazon S3
Registering Microsoft WASB cloud account
Considerations for Microsoft WASB
Registering Google cloud account
Considerations for Google Cloud Storage
Data replication use cases
HDP release-specific features
Replication of HDFS data
HDFS on-premise replication
Replication of data on-premise to on-premise in HDFS
HDFS cloud replication
On-premise to Amazon S3 replication in HDFS
Replication of data on-premise to Amazon S3 in HDFS
Amazon S3 to on-premise replication in HDFS
Replication of data from Amazon S3 to on-premise in HDFS
On-premise to Microsoft WASB replication in HDFS
Replication of data on-premise to Microsoft WASB in HDFS
Microsoft WASB to on-premise replication in HDFS
Replication of data from Microsoft WASB to on-premise in HDFS
On-premise to Google Cloud replication in HDFS
Replication of data from on-premise to Google Cloud Storage in HDFS
Google Cloud to on-premise replication in HDFS
Replication of data from Google Cloud Storage to on-premise in HDFS
Replication of HIVE data
Hive replication concepts
Hive tables - Managed and External
ACID tables replication
Bootstrap and incremental replication
Storage-based authorization
Statistics replication
Replication differences between HDP 2.6.5 to 3.x
Hive on-premise replication
Replication of data on-premise to on-premise in Hive
Hive cloud replication
Setting target cluster for cloud storage in Hive
On-premise to Amazon S3 replication in HIVE
Replication of data on-premise to Amazon S3 in Hive
On-premise to Microsoft WASB replication in HIVE
Replication of data on-premise to Microsoft WASB in Hive
On-premise to Google Cloud replication in HIVE
Replication of data on-premise to Google Cloud in HIVE
DLM operations using Command-Line Interface
Commands overview
CLI authentication
dp login
dp logout
dp
dp dlm
dp dlm policy
dp dlm policy create
dlm dp policy validate
dp dlm policy get
dp dlm policy list
dp dlm policy rerun
dp dlm policy abort
dp dlm policy resume
dp dlm policy delete
dp dlm policy suspend
dp dlm policy instance
dp dlm policy instance list
dp dlm policy instance get
dp dlm events
Metadata replication
Ranger metadata
Atlas metadata
Snapshot replication between HDP clusters
Replication policy operations
Monitoring replication
Policies page
Overview page
Notifications page
Viewing replication logs
Tuning replication policy (advanced options)
Suspend data replication
Activate data replication
Update replication policy
Browsing data directory
Tracking replication progress
Cloud credentials operations
Update cloud credentials
Delete credentials
Unregistered credentials
Miscellaneous
Update Cluster Endpoint
Failing Over Manually
Make the destination cluster the new source
Remove the Ranger deny policy
Activate a new destination cluster
DLM version Information
Tuning DLM Engine
Troubleshooting DLM
Ranger UI does not display deny policy items
Replication fails with TDE and non-TDE data
ReplChangeManager error
Hive data cannot be replicated
Hive policy suspension
Instance of a policy stuck in a running state
Hive replication failure
About requested events missing in Notification Log table
Replication policy operations
This page provides information about various tasks while running the data replication policy.
Monitoring replication
Ensure that the frequency is set so that a job finishes before the next job starts. Jobs based on the same policy cannot overlap.
Policies page
You can check job status from several places in the DLM UI.
Tuning replication policy (advanced options)
Specify bandwidth per map, in MBps. Each map is restricted to consume only the specified bandwidth. This is not always exact. The map throttles back its bandwidth consumption during a copy in such a way that the net bandwidth used tends towards the specified value.
Suspend data replication
When you create and run the replication policy, during the course of the replication, you can suspend data replication.
Activate data replication
You can activate an already suspended replication policy.
Update replication policy
You can edit some settings in your policies to better align with changing requirements. For example, you might want to change the frequency of a policy depending on the data size and importance of the data being replicated.
Browsing data directory
Any user with access to the DLM UI has the ability to browse, within the DLM UI, the folder structure of any clusters enabled for DLM.
Tracking replication progress
DLM now provides the status of replication progress for each replication policy.
© 2012–2019, Hortonworks, Inc.
Document licensed under the
Creative Commons Attribution ShareAlike 4.0 License
.
Hortonworks.com
|
Documentation
|
Support
|
Community