Prerequisites to access Cloudera Data Catalog

To access the Cloudera Data Catalog, you must have the required credentials.

Follow these instructions to provide the required access to the Cloudera Data Catalog users.

The PowerUser must provide the requisite access to subscribers who plan to use Cloudera Data Catalog as per the requirement.

There are a couple of roles based on which you can use Cloudera Data Catalog:
  • DataCatalogCspRuleManager
  • DataCatalogCspRuleViewer

Using DataCatalogCspRuleManager role, you can create, deploy new Custom Sensitivity Profiler rules, create new Regex, and run validations on newly created rules.

Using DataCatalogCspRuleViewer role, you can list and view the existing Custom Sensitivity Profiler rules.

Additionally, using Cloudera Manager, you must configure Apache Atlas and Apache Ranger services. Use the following instructions to complete the process.

  1. Cloudera Manager > Clusters > Atlas > Configuration.
  2. Enable the Kerberos Authentication for the Apache Atlas service.
  3. In the search bar, look up the following property:conf/atlas-application.properties_role_safety_valve and enter the values for the Atlas service:

    atlas.proxyuser.dpprofiler.hosts=*

    atlas.proxyuser.dpprofiler.users=*

    atlas.proxyuser.dpprofiler.groups=*

  4. For the Apache Ranger configuration updates, go to Cloudera Manager > Clusters > Ranger > Configuration.
  5. In the search bar, look up the following property:conf/ranger-admin-site.xml and enter the values for the Ranger service:

    Name: ranger.proxyuser.dpprofiler.hosts

    Value: *

    Name: ranger.proxyuser.dpprofiler.users

    Value: *

    Name: ranger.proxyuser.dpprofiler.groups

    Value: *

Later, restart Atlas and Ranger services respectively.