Displaying the Metastore Analytics

Learn how to analyze, identify, and troubleshoot table changes and inefficiencies, including which tables are hot and which tables are cold.

Steps for troubleshooting your tables and their data with the Cloudera Observability On-Premises Metastore Analytics feature.
  1. Verify that you are logged in to the Cloudera Observability On-Premises web UI.
    1. In the URL field of a supported web browser, enter the Cloudera Observability On-Premises URL that you were given by your system administrator and press Enter.
    2. When the Cloudera Observability On-Premises Log in page opens, enter your Cloudera Observability On-Premises user name and password access credentials.
    3. Click Log in.
      The Cloudera Observability On-Premises web user interface landing page opens, which by default displays the Analytics Environments page that lists your Workload cluster environments.
    4. From the Environment Name column in the Environment's table, select the environment required for analysis.

      The Environments navigation panel opens, which hierarchically lists the environment's cluster, engines, and if applicable the Hive Metastore category.

  2. Depending on the environment selected, verify that the Cluster Summary page is displayed for the environment's cluster required for analysis.
  3. To display the top 25 hot tables and the bottom 25 cold tables, do the following:
    1. Locate the Data Temperature chart.
    2. In the Hot Tables chart widget, hover over each table to view information about how often the table was accessed, its volume, and the number of partitions and files it contains.
    3. View more details about a table of interest, such as the hottest table, by clicking on the table's component element.
      The Overview Details side drawer panel opens, which enables you to view more information about the table, such as historical trends, column names, data types, and key-value pair properties. This information can be useful before you process or make changes to a query.
    4. Review the table's metadata from the Details, Columns, and Property tabs.
    5. Close the Overview Details side drawer panel and do the same steps in the Cold Tables chart widget.
  4. To display the top 25 hot tables that were most frequently queried by either the Hive or Impala engine, do the following:
    1. From the cluster's ENGINES, select the Hive or Impala engine of interest.
    2. In the workload engine's Summary page, locate the Data Temperature chart.
    3. In the Hot Tables chart widget, hover over each table to view information about how often the table was accessed, its volume, and the number of partitions and files it contains.
    4. View more details about a table of interest, such as the hottest table, by clicking on the table's component element.
      The Overview Details side drawer panel opens.
    5. Review the table's metadata from the Details, Columns, and Property tabs.
  5. As your tables and data increases it becomes more difficult for you keep track of your tables and their data, the HMS Tables view lists your tables and provides details about each table available in your system, regardless of whether they have been queried or not.
    To list the details about each table available in your system, do the following:
    1. Expand the HIVE METASTORE category for the cluster of interest.
      One or multiple metastores are displayed.
    2. Select the metastore of interest.
      The metastore's HMS Summary page opens displaying information about the current state and activity of all your tables in the selected Environment.
    3. To open the HMS Tables view, click the Tables tab.
    4. Optional: Locate specific tables of interest with the filter and sort functions. For example:
      • Sort the tables by their name or by a table's column value, such as the highest number of Partitions.
      • Reduce and locate tables by a specific value, such as filtering by their Table Type.
      • Locate the tables with a specific number of rows by selecting the Rows filter, entering the minimum and maximum row values that you require, and clicking Apply.
    5. Analyze the details of those tables of interest and look for inconsistencies or issues that may interfere with optimal query performance.
    6. View more details about a specific table by doing the following:
      1. Click the table's name.

        The Overview Details side drawer panel opens.

      2. Review the table's metadata from the Details, Columns, and Property tabs.
      3. Close the Overview Details side drawer panel and analyze another table.