Configure dbt Cloud Connector for Cloudera Octopai
Learn how to configure dbt Cloud for Cloudera Octopai Data Lineage.
Before you connect dbt Cloud to Cloudera Octopai, verify that the dbt Cloud environment, permissions, and credentials are prepared. The connector uses the dbt Cloud API and a service token to retrieve execution metadata and compiled SQL for lineage extraction.
dbt Cloud plan requirements
The connector requires API access.
You must have one of the following:
- A paid dbt Cloud plan (Starter or Enterprise).
- An active trial of a paid plan.
Git repository requirement
dbt Cloud requires a connected Git repository (for example, GitHub or GitLab). The connector depends on project runs generated from a version-controlled dbt project.
- Extraction scope
-
The connector extracts metadata related to dbt execution activity. The following objects are discovered:
- Jobs
- Environments
- Models
- Job-level SQL retrieval
-
For each job, Cloudera Octopai retrieves the SQL executed during the most recent successful run. Note the following behavior:
- The original model SQL source file is not used.
- Lineage is based on the compiled SQL generated by dbt.
- Lineage parsing scope
-
The connector parses SQL-based models only. Python-based dbt transformations do not generate lineage. Lineage parsing is limited to database engines supported by Cloudera Octopai.
- Expression representation
-
Expressions defined inside dbt models are not represented as expressions in the internal lineage visualization.
- Python models
- Python dbt models are not supported currently:
- Python models are not parsed.
- Column-level lineage is not generated for Python transformations.
- Supported platforms
-
The connector supports dbt lineage extraction when models run on the following databases:
- Snowflake
- PostgreSQL
- Google BigQuery
- Redshift
- Databricks (requires GSP license)
- Spark SQL
- Synapse (via dbvmssql)
- Teradata
- Unsupported platforms
-
Lineage parsing is limited to RDBMS platforms supported by Cloudera Octopai. The following platforms are not supported:
- Starburst
- Microsoft Fabric
- Athena
