Managing Hive ACID table replication policies

After you create a replication policy, you can run the replication job, disable or delete the job, edit the policy configuration, or view the replication job history in Cloudera Manager.

  1. Go to the Cloudera Manager > Replication > Replication Policies page.
    The following replication policy details appear on the page:
    Columns Description
    ID Automatically generated replication policy ID.
    Name Name of the replication policy.
    Type Shows Hive ACID for Hive ACID replication policies.
    Source Source cluster used in the replication policy.
    Destination Target cluster used in the replication policy.
    Progress Shows a spinner when the replication policy job is running.
    Completed Timestamp when the replication job is submitted to the Hive service.
    Next Run* Shows Managed by Hive message. Hover over the message to see more information about the next scheduled run.
    Message Shows the status of the replication job.

    The following job states of the replication job run appear depending on the replication job status:

    • Waiting for Update appears after the replication policy creation is complete and remains until the job status is confirmed by the Hive service.
    • Running appears when the replication job is in progress.
    • Failed appears after the replication policy has failed.
    • Skipped appears when the replication job is skipped.
    • Success appears after the replication job completes successfully.
    *When you schedule and submit a Hive ACID replication policy, the Next Run field shows the None scheduled message on the Replication Policies page. When the next run is scheduled, the date and time do not appear. You can ignore the None scheduled message as the replication job runs on Hive as scheduled or as per the schedule clause. Note that the schedules are managed by Hive. Cloudera Manager does not run any scheduled runs.
  2. Select the required replication policy.
  3. Click Actions to view the following action items:
    1. Show History opens the Replication History page where you can view the replication policy job history.
      On this page, you can view the replication policy name, the replication policy type, the chosen source and destination clusters for the policy, and the next scheduled run.

      The page also shows the following statistics for each replication policy job:

      • Start Time of a replication policy job.
      • Duration or time taken to complete the job.
      • Outcome of the current job status.
      • Origin of collected Hive metrics. Click SOURCE or TARGET in the field to view the metrics for the replication job.
      • Total number of Tables to be replicated to the number of tables replicated successfully.
      • Functions column is incremented whenever a function is processed during dump and load operations.
      • Events column is incremented for every event dumped during dump operation and every event loaded during load operation. The counts for dump and load operation might not match because they are distinct operations.
    2. Edit Configuration allows you edit the schedule of the replication policy.
    3. Run Now runs the replication job.
    4. Disable the selected replication job.
    5. Start Failover initiates the replication policy failover process. For more information, see Switching to disaster recovery cluster using Replication Manager.
    6. Delete the selected replication job.