Understanding Data Lake details
To access information related to your Data Lake cluster and access cluster actions, navigate to the Management Console service > Data Lakes.
Each Data Lake cluster is represented by an entry on the Data Lakes page. To get more information about a specific Data Lake cluster, click on the tile representing your cluster.
Environment Details
This section includes information related to the CDP environment in which the Data Lake cluster is running:
![](../images/mc-data-lake-overview-environment.png)
Item | Description |
---|---|
Cloud Provider | The logo of the cloud provider where the cluster is running. |
Name | The name of the environment used to create the cluster. |
Credential | The name of the credential used to create the cluster. |
Region | The region in which the cluster is running in the cloud provider infrastructure. |
Availability Zone | The availability zone within the region in which the cluster is running. |
Services
Click logos in the Services section to open the user interface for the components that are running in the Data Lake cluster.
![](../images/mc-data-lake-overview-services.png)
Cloudera Manager Info
The Cloudera Manager Info section provides the following information:
![](../images/mc-data-lake-overview-cm-info.png)
Item | Description |
---|---|
CM URL | Link to the Cloudera Manager web UI. |
CM Version | The Cloudera Manager version which the cluster is currently running. |
Platform Version | The Cloudera Runtime platform version which the cluster is currently running. |
Event History and other tabs
The Data Lake page provides additional details organized in tabs, starting with the Event History tab:
![](../images/mc-data-lake-overview-tabs.png)
Item | Description |
---|---|
Event History | Events logged for the cluster, with the most recent event at the top. The Download option allows you to download the event history to a local file. The events are formatted in JSON and compressed. |
Hardware | Information about your cluster instances: instance names, instance IDs,
instance types, their status, fully qualified domain names (FQDNs), and private and
public IP addresses. Click ![]() |
Cloud Storage | External storage locations for database and files used by Data Lake services, such as HMS database, Ranger audit database, HBase files (storage for Atlas metadata). |
Tags | User-defined tags, listed in the order they were added. |
Endpoints | Endpoints for various cluster services, such as the URL for the Ranger user interface for defining data access policies. |
Recipes | Future home of a list of custom scripts attached to this Data Lake. Each "recipe" lists its name, type, and the host group on which it was executed. |
Attached clusters | The workload clusters using the services of this Data Lake; this information repeats the list of clusters found in the other tabs of this CDP Environment. |
Repository Details | Cloudera Manager and Cloudera Runtime repository information, in more detail than shown in the Cloudera Manager Info section. |
Image Details | Cluster node base image details. |
Network | Names of the network and subnet in which the cluster is running and the links to the related cloud provider console |
Telemetry | The instance profile and cloud storage location specified during environment setup under Log Storage and Audits for service logs. |
Show CLI Command
You can click the Show CLI Command button to review the CLI command used to create the Data Lake, and copy it if you want to create a similar Data Lake through the CDP CLI. Ensure that any Data Lake that you create has a unique name. For more information on the CDP CLI commands to create a Data Lake, review the CLI documentation for Data Lakes.