Impala configuration differences in CDH and CDP

There are some configuration differences related to Impala in CDH and CDP. These differences are due to the changes made in CDP for the optimal interoperability between Hive and Impala for improved user experience. Review the changes before you migrate your Impala workload from CDH to CDP.

Default Value Changes in Configuration Options

Configuration Option Scope Default in CDH 6.x Default in CDP
DEFAULT_FILE_FORMAT Query TEXT PARQUET
hms_event_polling_interval_s Catalogd 0 2
ENABLE_ORC_SCANNER Query TRUE FALSE
use_local_catalog Coordinator / Catalogd false true
catalog_topic_mode Coordinator full minimal

New Configuration Options

Configuration Option Scope Default Value
default_transactional_type Coordinator insert_only
DEFAULT_TRANSACTIONAL_TYPE Query insert_only
disable_hdfs_num_rows_estimate Impalad false
disconnected_session_timeout Coordinator 900
PARQUET_OBJECT_STORE_SPLIT_SIZE Query 256 MB
SPOOL_QUERY_RESULTS Query FALSE
MAX_RESULT_SPOOLING_MEM Query 100 MB
MAX_SPILLED_RESULT_SPOOLING_MEM Query 1 GB
FETCH_ROWS_TIMEOUT_MS Query N/A
DISABLE_HBASE_NUM_ROWS_ESTIMATE Query FALSE
enable_insert_events TRUE