Profiler tag rules

You can use preconfigured tag rules or create new rules based on regular expressions and values in your data to limit the number of assets to be profiled by the Cluster Sensitivity Profiler. When a tag rule is matching your data, the selected Apache Atlas classification (also known as a Cloudera Data Catalog tag) is applied. This way you can save compute resources instead of running the profiler on the whole of your data.

Tag rule types

Tag Rules are categorized based on their type into the following groups:
  • System Deployed: These are built-in rules that cannot be edited. You can only enable or disable them for your data.
  • Custom Deployed: Tag rules that you create, edit and deploy on clusters after validation will appear under this category. Hover your mouse over the tag rules to deploy or suspend them as needed. Click the icon in the Action column to enable your custom tag rules. You can also edit these tag rules.
  • Custom Draft: You can create new tag rules and save them for later validation and deployment on clusters. Such rules appear under this category.

After creating your rule, you have to validate them with test data and, then Deploy them from Custom Draft.