Migrating a Hive table to Iceberg
You see how to use a simple ALTER TABLE statement from Hive or Impala to migrate an external Hive table to an Iceberg table. You see how to configure table input and output by setting table properties.
NUM_THREADS_FOR_TABLE_MIGRATION
query option to tweak the performance of the table migration. It sets the maximum number
of threads to be used for the migration process but could also be limited by the number
of CPUs. If set to zero then the number of available CPUs on the coordinator node is
used as the maximum number of threads. Parallelism occurs on the basis of data files
within a partition, which means one partition is processed at a time with multiple
threads processing the files inside the partition. In case there is only one file in
each partition, sequential execution occurs.