Configuring clusters and replications
You can expand an existing deployment of Streams Replication Manager by adding new clusters and replications to the configuration. To do this, you need to specify cluster aliases and cluster connection information, as well as add and enable replications.
Specifying your clusters and enabling replications does not start replication of data
itself. When clusters and replications are added with the following method to the
configuration, SRM will connect and set up communication with them, but will not
automatically replicate any data. To start replicating data you need to specify which topics
to replicate with the srm-control
command line tool.
Use the following steps as reference when you want to add new clusters or replications to your deployment.
-
If you are planning on replicating data to or from a Kafka service running in either a CDH 5.x or 6.x cluster and you are using Sentry for authorization, make sure that the
streamsrepmgr
user is added to the Kafka Super users property. You can find the Super users property by going to . Do this on all CDH 5.x or 6.x clusters where data replication will happen. - If you are planning on replicating data to or from a Kafka service running in Runtime
7.x and you are using Ranger for authorization, make sure that the
streamsrepmgr
user has all required permissions assigned to it in Ranger. Do this on all Runtime 7.x clusters where data replication will happen.
- In Cloudera Manager, select Streams Replication Manager.
- Go to Configuration.
- Specify cluster aliases:
- Specify cluster connection
information:
- Add and enable
replications:
- Enter a Reason for change, and then click Save Changes to commit the changes.
- Restart Streams Replication Manager.
srm-control
tool to kick off replication by adding topics or
groups to the allowlist.