Auto Tagging workflow with Custom Cluster Sensitivity Profiler Rules

You can auto tag workflows while working with Custom Cluster Sensitivity Profiler.

Use the following information to create a custom tag and assign the same to the Custom Sensitivity Profiler.
  1. Data Catalog > Profilers > Select the Tag Rules tab.
  2. Click + New to open the Custom Rule window.
  3. Under the Resources pane, click + to open the Regular Expression Editor window.
  4. Enter the name and input the regular expression value in such a way that it matches the test string.
    If your regular expression value is [a-z][a-z][a-z][a-z] and the test string is “baby”, there is a match.
  5. Click Save.
  6. On the Custom Rule window, enter the name and description.
  7. Enter the tags value and select the Column Expression Name from the drop-down.
    You must select the same regular expression you had created under the Resources pane.
  8. Enter the tags value and select the Column Value Expression from the drop-down.
    You must select the same regular expression you had created under the Resources pane.
  9. Click Save & Validate.
    The Data For Validation window appears.
  10. Enter the sample values to validate if Column Expression Name and Column Value Expression entities match.
    Make sure that the correct data lake is selected to validate the entries.
  11. Click Submit Validation.
    The status for the newly created regular expression validation is displayed on the Tags Rule tab. Once the validation is successful, you can deploy the rule.
  12. Click Done.
    On the Rule Groups pane, verify if the rule is available under the Custom Deployed list. You can also suspend the tag by selecting the same from the list.

    Once the Cluster Sensitivity Profiler job or On-Demand Profiler picks up the Hive asset for profiling, the newly set-up custom tag must get applied on the Hive column, provided the asset has the column(s) which meet the custom rule criteria.