Displaying the Metastore Analytics

Learn how to analyze, identify, and troubleshoot table changes and inefficiencies, including which tables are hot and which tables are cold.

Steps for troubleshooting your tables and their data with the Cloudera Observability Metastore Analytics feature.
  1. Verify that you are logged in to the Cloudera Observability web UI and that you selected an environment from the Analytics Environments page.
    1. In a supported browser, log into the Cloudera Data Platform (CDP).
      The CDP web interface landing page opens.
    2. From the Your Enterprise Data Cloud landing page, select the Observability tile.
      The Cloudera Observability landing page opens to the main navigation panel.
    3. From the Cloudera Observability Environments page, select the environment required for analysis.

      The Environment navigation panel opens.

  2. Depending on the environment selected, verify that the Cluster Summary page is displayed for the environment's cluster required for analysis.
    To display the Cluster Summary page for a Data Hub environment type, do one of the following:
    • From the Environment's panel, expand the service's category and locate and select the Data Hub's cluster that is required for analysis.
    • In the Data Services table, drill-down through the service links to locate and select the Data Hub's cluster that is required for analysis.
  3. To display the top 25 hot tables and the bottom 25 cold tables, do the following:
    1. Locate the Data Temperature chart.
    2. In the Hot Tables chart widget, hover over each table to view information about how often the table was accessed, its volume, and the number of partitions and files it contains.
    3. View more details about a table of interest, such as the hottest table, by clicking on the table's component element.
      The Overview Details side drawer panel opens, which enables you to view more information about the table, such as historical trends, column names, data types, and key-value pair properties. This information can be useful before you process or make changes to a query.
    4. Review the table's metadata from the Details, Columns, and Property tabs.
    5. Close the Overview Details side drawer panel and do the same steps in the Cold Tables chart widget.
  4. To display the top 25 hot tables that were most frequently queried by either the Hive or Impala engine, do the following:
    1. From the cluster's ENGINES, select the Hive or Impala engine of interest.
    2. In the workload engine's Summary page, locate the Data Temperature chart.
    3. In the Hot Tables chart widget, hover over each table to view information about how often the table was accessed, its volume, and the number of partitions and files it contains.
    4. View more details about a table of interest, such as the hottest table, by clicking on the table's component element.
      The Overview Details side drawer panel opens.
    5. Review the table's metadata from the Details, Columns, and Property tabs.
  5. As your tables and data increases it becomes more difficult for you keep track of your tables and their data, the HMS Tables view lists your tables and provides details about each table available in your system, regardless of whether they have been queried or not.
    To list the details about each table available in your system, do the following:
    1. Expand the HIVE METASTORE category for the cluster of interest.
      One or multiple metastores are displayed.
    2. Select the metastore of interest.
      The metastore's HMS Summary page opens displaying information about the current state and activity of all your tables in the selected Environment.
    3. To open the HMS Tables view, click the Tables tab.
    4. Optional: Locate specific tables of interest with the filter and sort functions. For example:
      • Sort the tables by their name or by a table's column value, such as the highest number of Partitions.
      • Reduce and locate tables by a specific value, such as filtering by their Table Type.
      • Locate the tables with a specific number of rows by selecting the Rows filter, entering the minimum and maximum row values that you require, and clicking Apply.
    5. Analyze the details of those tables of interest and look for inconsistencies or issues that may interfere with optimal query performance.
    6. View more details about a specific table by doing the following:
      1. Click the table's name.

        The Overview Details side drawer panel opens.

      2. Review the table's metadata from the Details, Columns, and Property tabs.
      3. Close the Overview Details side drawer panel and analyze another table.