Restoring Kudu data into the target CDP cluster
Once you have backed up your data in Kudu, you can copy the data to the target CDP cluster and then restore it using the Kudu backup tool.
- If you applied any custom Kudu configurations in your old clusters, then you manually have to apply those configurations in your target cluster.
- Copy the backed up Kudu data to the target CDP cluster.
- While scanning or reading data from Kudu tables using Impala (for example,
through
impala-shell
or Hue) to verify the records in the destination table, remember that the Impala table might point to a Kudu table with a different name, which is defined by thekudu.table_name
property. Meanwhile, backup and restore tools work does not depend on Impala table names, but rather depends on actual Kudu table names.To get the information on thekudu.table_name
property for a table, you can use theSHOW CREATE TABLE
statement inimpala-shell
or Hue:> SHOW CREATE TABLE my_kudu_table; CREATE TABLE my_kudu_table ( id BIGINT, name STRING, PRIMARY KEY(id) ) PARTITION BY HASH PARTITIONS 16 STORED AS KUDU TBLPROPERTIES ( 'kudu.table_name' = 'my_kudu_table_renamed' );