About Cloudera Data Catalog
Cloudera Data Catalog is a service within Cloudera that enables you to understand, manage, secure, and govern data assets across the enterprise.
-
Organize and curate data globally:
-
- Organize data based on business classifications, purpose, protections needed, etc. For more information, see:
- Promote responsible collaboration across enterprise data workers. For more information, see Collaborate with other users.
-
-
Understand where relevant data is located:
-
- Catalog and search to locate relevant data of interest (sensitive data, commonly used, high risk data, etc.).
- Understand what types of sensitive personal data exists and where it is located
- For more information, see:
- VM-based environments:The Cluster Sensitivity Profiler
- Compute cluster enabled environments: The Data Compliance Profiler
-
-
Understand how data is interpreted for use:
-
-
View basic descriptions: schema, classifications (business cataloging), and encodings
-
View statistical models and parameters
-
View user annotations, wrangling scripts, view definitions etc.
-
- For more information, see Viewing Data Asset details.
- For more information, see Navigation in Asset Details.
- VM-based environments: The Hive Column Profiler / Compute cluster enabled environments: The Statistics Collector Profiler
-
-
Understand how data is created and modified:
-
-
Visualize upstream lineage and downstream impact
-
Understand how schema or data evolve
-
View and understand data supply chain (pipelines, versioning, and evolution)
-
- For more information, see Navigation support for hive entities within Lineage.
-
-
Understand how data access is secured, protected, and audited:
-
-
Understand who can see which data and metadata (for example, based on business classifications) and under what conditions (security policies, data protection, anonymization)
-
View who has accessed what data from a forensic audit or compliance perspective
-
Visualize access patterns and identify anomalies
-
- For more information, see
- VM-based environments: The Ranger Audit Profiler
- Compute cluster enabled environments: The Activity Profiler
- Viewing Ranger access audits
- Viewing Atlas entity audits
- Viewing Ranger policies
-
