Collecting Data Lake diagnostics

Cloudera Manager can collect system status, configuration details, logs, and other information from a Data Lake cluster into a zip file to send to Cloudera Support.

To help with solving problems when using Cloudera Manager on your data lake cluster, Cloudera Manager can collect diagnostic data. You can choose to trigger this collection to aid with resolving a problem, or you can configure Cloudera Manager to send a diagnostic bundle to Cloudera on a regular schedule. Cloudera Support analyses the diagnostic information to proactively identify problems.

Collecting a diagnostic bundle for a Data Lake

To trigger creation of a diagnostic bundle for a Data Lake, navigate to the Cloudera Manager UI that manages the Data Lake cluster. In the lower section of the left navigation pane, choose Support > Send Diagnostic Data and set the options in the dialog box that opens.

Specifically:

  1. Log in to CDP.
  2. Find and open the environment for the Data Lake.

    In the left navigation panel, click Environments, then search for the appropriate environment or scroll to find it. Click the environment name.

  3. In the environment detail page, open the Data Lake tab.
  4. Open Cloudera Manager by clicking its link in the Services section.
  5. If prompted, log in to Cloudera Manager.
  6. In the bottom of the left navigation pane, click Support > Send Diagnostic Data.
  7. Choose whether to send the diagnostic bundle to Cloudera automatically or only to collect the bundle.

    If the Cloudera Manager host does not have an internet connection, you may want to collect the bundle then move it to a host with access to the internet.

  8. If appropriate, enter a Cloudera Support case number.
  9. Select the Data Lake cluster if this instance of Cloudera Manager is managing more than one cluster.
  10. If appropriate, limit the diagnostic collection to a single host, service, or role.

    Open the Restrict log and metrics collection and choose the Data Lake host, service, or role for which you want to collect diagnostics.

  11. Set the logic for what data to collect, by target size or by date range.

    If you choose a target size, set the end time to a time to a few minutes after the event that you are trying to capture diagnostics for. The time range is based on the timezone of the host where Cloudera Manager Server is running.

  12. Add a comment to describe the reason for collecting the diagnostic data.
  13. Start the collection by clicking Collect Diagnostic Data or Collect and Upload Diagnostic Data.

    The Cloudera Manager task dialog appears to track the jobs involved in collecting diagnostics.

  14. When the collection tasks are complete, click Download to save the diagnostic bundle locally.