The Data Catalog profiler engine runs data profiling operations as a pipeline on data located in multiple data lakes. These profilers create metadata annotations that summarize the content and shape characteristics of the data assets.
|Cluster Sensitivity Profiler||A sensitive data profiler- PII, PCI, HIPAA, etc.|
|Ranger Audit Profiler||A Ranger audit log summarizer.|
|Hive Column Profiler||Provides summary statistics like Maximum, Minimum, Mean, Unique, and Null values at the Hive column level.|