Learn how to check the current state of data replication.
The MirrorSourceConnector keeps track
of its progress in the source cluster using the Kafka Connect framework. Kafka Connect
allows checking and manipulating the source offsets of the connectors. You can check the
current state of data replication by extracting source offsets and comparing them with
the end offsets of replicated partitions.
These steps use the connect_shell.sh and
kafka_shell.shCloudera Streams Messaging - Kubernetes Operator tools. Ensure that these tools are available
to you. Running kafka_shell.sh is only necessary if your source
Kafka cluster is deployed with Cloudera Streams Messaging - Kubernetes Operator. See Using kafka_shell.sh and Using connect_shell.sh.
Use connect_shell.sh to exec into a Kafka Connect admin pod of
the replicator Kafka Connect cluster.
The kafka-get-offsets.sh tool accepts a regex string as
the topic filter, but does not accept a list of regexes. To specify
multiple regex expressions in a single command (as a single regex
string), chain expressions together with pipes (|).
--topic "test.*|abc.*|zxc.*"
If the source Kafka cluster is a Cloudera Streams Messaging - Kubernetes Operator
Kafka cluster, use kafka_shell.sh to run the
kafka-get-offsets.sh tool
Compare extracted end offsets with the source offsets extracted in Step 2.