Recommended configurations
Following are the recommended configuration setting for the best performance with Impala.
- Set the
--use_local_tz_for_unix_timestamp_conversionsstartup flag and the--convert_legacy_hive_parquet_utc_timestampsstartup flag both to true. Setting these startup flags to true ensures that the timestamps between Hive and Impala match. See TIMESTAMP Data Type for more details. - Always set the
--idle_session_timeoutand the--idle_query_timeouttimeouts for the Impala daemon (impalad). Ensure that the setting foridle_session_timoutis less than the setting for the timeout set for your load balancer. See Setting the Idle Query and Idle Session Timeouts for impalad for details. - Set the
--fe_service_threadsstartup option for the Impala daemon (impalad) to 256. This option specifies the maximum number of concurrent client connections allowed. See Startup Options for impalad Daemon for details. - Increase the
--num_metadata_loading_threadsstartup option to 64 to improve metadata loading performance. See Configuring Impala Startup Options through Cloudera Manager for more information.
