Impala workload management table format

Learn about the available columns in the query history and live query system tables.

Table Format

The following columns are available as part of the query history and live query system tables:

Column Name Description Data Type Sample Value
cluster_id String specified through the Impala startup flag to uniquely identify an instance. string cluster-123
query_id Impala assigned query identifier. string 214d08bef0831e7a:3c65392400000000
session_id Impala assigned session identifier. string ea4f661af43993d8:587839553a41adb8
session_type Client session type. string HIVESERVER2
hiveserver2_protocol_version Version of the HiveServer (HS2) protocol that was used by the client when connecting. string HIVE_CLI_SERVICE_PROTOCOL_V6
db_user Effective user on the cluster. string csso_name
db_user_connection Username from an authenticated client. string csso_name
db_name Name of the database being queried. string default
impala_coordinator Name of the coordinator for the query. string coord-22899:27000
query_status Status of the query when it completes. string OK
query_state Final state of the query. string FINISHED
impala_query_end_state Final Impala state of the query. string FINISHED
query_type Type of the query. string QUERY
network_address Client IP address and port. string 127.0.0.1:40120
start_time_utc Time when the query started. Time zone is in UTC. timestamp 2024-07-17 17:13:46.414316000
total_time_ms Difference between the query end time and start time, in milliseconds (digits after the decimal point represent milliseconds). decimal(18,3) 136.121
query_opts_config List of query options stored as a single string containing comma-separated values of key-value pairs. string TIMEZONE=America/Los_Angeles,CLIENT_IDENTIFIER=Impala Shell v4.4.0a1 (04bdb4d) built on Mon Nov 20 10:49:35 PST 2023
resource_pool Name of the resource pool for the query. string default-pool
per_host_mem_estimate Size, in bytes of the per-host memory estimate. bigint 5
dedicated_coord_mem_estimate Size, in bytes of the dedicated coordinator memory estimate. bigint 4
per_host_fragment_instances Comma-separated string listing each host and its fragment instances. string myhost-1:27000=1,myhost-2:27001=2
backends_count Count of the number of backends used by this query. integer 2
admission_result Result of the admission (not applicable to DDLs). string Admitted immediately
cluster_memory_admitted Cluster memory, in bytes that was admitted. integer 4
executor_group Name of the executor group. string executor_group
executor_groups List of all executor groups including the groups that were considered and rejected as part of Workload Aware Auto Scaling. string executor_group1, executor_group2…
exec_summary Full text of the executor summary. string
num_rows_fetched Number of rows fetched by the query. bigint 6001215
row_materialization_rows_per_sec Count of the number of rows materialized per second. bigint 3780
row_materialization_time_ms Time spent materializing rows converted to milliseconds. decimal(18,3) 1.58
compressed_bytes_spilled Count of bytes that were written (or spilled) to scratch disk space. bigint 241515
event_planning_finished Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 27.253
event_submit_for_admission Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 30.204
event_completed_admission Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 30.986
event_all_backends_started Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 31.969
event_rows_available Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 31.969
event_first_row_fetched Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 135.175
event_last_row_fetched Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 135.181
event_unregister_query Event from the timeline.The value represents the number of milliseconds since the query was received. decimal(18,3) 141.435
read_io_wait_total_ms Total read I/O wait time converted to milliseconds. bigint 15.091
read_io_wait_mean_ms Average read I/O wait time across executors converted to milliseconds bigint 35.515
bytes_read_cache_total Total bytes read from the data cache bigint 45823
bytes_read_total Total bytes read bigint 745227
pernode_peak_mem_min Minimum value of all the per-node peak memory usages bigint 5552846
pernode_peak_mem_max Maximum value of all the per-node peak memory usages bigint 5552846
pernode_peak_mem_mean Mean value of all the per-node peak memory usages bigint 5552846
sql SQL statement as provided by the user string SELECT db_user, total_time_ms from impala_query_log where query_state = 'EXCEPTION';
plan Full text of the query plan string
tables_queried Comma-separated string containing all the tables queried in the SQL statement. Aliased tables are resolved to their actual table names. string db.tbl,db.tbl
select_columns Comma-separated string containing all columns from the select list of the sql. Aliased columns are resolved to their actual column names. Each column is in the format database.table.column_name. string db.tbl.col1,db.tbl.col2
where_columns Comma-separated string containing all columns from the where list of the sql. Aliased columns are resolved to their actual column names. Each column is in the format database.table.column_name. string db.tbl.col1,db.tbl.col2
join_columns Comma-separated string containing all columns from the sql used in a join. Aliased columns are resolved to their actual column names. Each column is in the format database.table.column_name. string db.tbl.col1,db.tbl.col2
aggregate_columns Comma-separated string containing all columns from the group by and having lists of the sql. Aliased columns are resolved to their actual column names. Each column is in the format database.table.column_name. string db.tbl.col1,db.tbl.col2
orderby_columns Comma-separated string containing all columns from the order by list of the sql. Aliased columns are resolved to their actual column names. Each column is in the format database.table.column_name. string db.tbl.col1,db.tbl.col2
coordinator_slots Number of query slots used by the query on the coordinator. bigint 1
executor_slots Numberof query slots used by the query on the executors. The value in this column represents the slots used by a single executor, not the total number of slots across all executors. bigint 1