Model Governance
To capture and view centralized information about your ML projects, models, and builds
in Apache Atlas (Data Catalog) for a specific environment, governance must be enabled.
Enabling model governance You must enable governance to capture and view information about your ML projects, models, and builds centrally from Apache Atlas (Data Catalog) for a given environment. If you do not select this option while provisioning Cloudera AI Workbenches , then integration with Atlas will not work.ML Governance Requirements You must ensure that the following requirements are satisfied in order to enable ML Governance on Private Cloud.Registering training data lineage using a linking file The Cloudera AI projects, model builds, model deployments, and associated metadata are tracked in Apache Atlas, which is available in the environment's SDX cluster. You can also specify additional metadata to be tracked for a given model build. For example, you can specify metadata that links training data to a project through a special file called the linking file (lineage.yaml
).Viewing lineage for a model deployment in Atlas You can view the lineage information for a particular model deployment and trace it back to the specific data that was used to train the model through the Atlas' Management Console.