About Cloudera Data Catalog
Cloudera Data Catalog is a service within Cloudera that enables you to understand, manage, secure, and govern data assets across the enterprise.
-
Organize and curate data globally:
-
- Organize data based on business classifications, purpose, protections needed, etc. For more information, see:
- Promote responsible collaboration across enterprise data workers. For more information, see Bookmarks overview.
-
-
Understand where relevant data is located:
-
- Catalog and search to locate relevant data of interest (sensitive data, commonly used, high risk data, etc.).
- Understand what types of sensitive personal data exists and where it is located
- For more information, see:
-
-
Understand how data is interpreted for use:
-
-
View basic descriptions: schema, classifications (business cataloging), and encodings
-
View statistical models and parameters
-
View user annotations, wrangling scripts, view definitions etc.
-
- For more information, see Viewing Data Asset details.
- For more information, see Navigation in Asset Details.
- The Hive Column Profiler
-
-
Understand how data is created and modified:
-
-
Visualize upstream lineage and downstream impact
-
Understand how schema or data evolve
-
View and understand data supply chain (pipelines, versioning, and evolution)
-
- For more information, see Navigation support for hive entities within Lineage.
-
-
Understand how data access is secured, protected, and audited:
-
-
Understand who can see which data and metadata (for example, based on business classifications) and under what conditions (security policies, data protection, anonymization)
-
View who has accessed what data from a forensic audit or compliance perspective
-
Visualize access patterns and identify anomalies
-
- For more information, see
-
