Impala actions that produce Atlas entities

Impala operations that create, update, or delete Hive metadata will affect Atlas entities; operations that only affect data do not show up in Atlas.

The following table lists the Impala actions that produce or update metadata in Atlas.

This Action in Impala... ...Produces metadata for these Atlas entities ...Triggers HMS to produce metadata for these Atlas entities ...Produces metadata for these Atlas relationships
CREATETABLE_AS_SELECT

impala_process,
impala_process_execution,
impala_column_lineage,
hive_db
hive_table_ddl

hive_table,
hive_column(s),
hive_storagedesc,
hive_db
hive_table_ddl

hive_table_db,
hive_table_columns,
hive_table_partitionkeys,
hive_table_storagedesc,
hive_process_process_execution,
hive_process_columnlineage,
hive_table_ddl_queries,
hive_db_ddl_queries

CREATEVIEW

impala_process,
impala_process_execution,
impala_column_lineage,
hive_table_ddl

hive_table,
hive_column(s),
hive_db

hive_table_db,
hive_table_columns,
hive_table_partitionkeys,
hive_process_process_execution,
hive_process_columnlineage,
hive_table_ddl_queries

ALTERVIEW_AS_SELECT

impala_process,
impala_process_execution,
impala_column_lineage,
hive_table_ddl

Updates to:
hive_table,
hive_column(s)

hive_process_process_execution,
hive_process_columnlineage,
hive_table_ddl_queries

INSERT INTO,
INSERT,
OVERWRITE

impala_process,
impala_process_execution

If not already in Atlas, HMS sends metadata for data assets indicated in the query:

hive_table,
hive_column(s),
hive_storagedesc,
hive_db

hive_process_process_execution

Notable actions in Impala that do NOT produce process or process execution entities in Atlas, meaning that no lineage is produced for these operations:

  • LOAD DATA INPATH
  • CREATE TABLE (table metadata produced by HMS)
  • ALTER VIEW (table metadata produced by HMS)
  • SELECT or other queries that don’t produce output