Wrong join order

Adjusting query join order or join type can improve Impala performance.

By: Manish Maheshwari, Data Architect and Data Scientist at Cloudera, Inc.

Description and cause:

A slow query can also be caused by using incorrect join order or incorrect join types.

Solution:

  • To identify the join type and join ordering errors, compare the actual number of rows (#Rows) to the estimated number of rows (Est. #Rows) in the query profile. If the actual versus estimated numbers of rows vary significantly, the join order and the selected join type might be incorrect.
  • Ensure that you have statistics collected on all tables and columns so the correct join order and type can be automatically selected.