Comparing Hive and Impala queries in Hue

You can compare two queries to know how each query is performing in terms of speed and cost-effectiveness. Hue compares various aspects of the two queries, based on which you can identify what changed between the executions of those two queries, and you can debug performance-related issues between different runs of the same query.

The query comparison report provides you a detailed side-by-side comparison of your queries.

For Hive queries, it includes recommendations for optimizing each query, metadata about the queries, visual explain for each query, query timeline, query configuration, Directed Acyclic Graph (DAG) information, DAG flows, DAG swimlanes, DAG counters, and DAG configurations.

For Impala queries, the query comparison report includes query details, execution plan details, and the aggregated metrics for both the queries and provides a variance between the two.

  1. Go to the Cloudera Data Warehouse (CDW) web interface and open Hue from your Virtual Warehouse.
  2. Click Jobs from the left assist panel.
    The Job Browser page is displayed.
  3. Go to the Queries tab.
    A list of queries that were run is displayed.
  4. Select the two queries you want to compare and click Compare.
    Query comparison report for Hive queries:

    Report showing comparison of two Hive queries in Hue.
    Query comparison report for Impala queries:
    Report showing comparison of two Impala queries in Hue.