Trino federation connectors

Trino is a distributed SQL query engine designed from the bottom up to be built around the concept of connectors and federation. Trino connectors help you connect to and access data from a variety of remote data sources, expose metadata (exposed within Trino as catalogs), and handle sending or receiving data from the remote source.

You can use Cloudera Data Warehouse to configure a connector for a data source, enabling a Trino Virtual Warehouse to access the data source. You can either create an Optimized connector (tested and certified by Cloudera) or a Community connector.

When you create an optimized connector, a template specific to the selected data source type is provided for you to specify the connector configuration details, such as connector URL and secrets that are required to access the data source. You can also choose community data source types that are powered by open-source Trino, however, these connectors are not directly supported by Cloudera and do not offer default configuration.

Example of a default template for a Cloudera certified Trino connector

Cloudera enables you to configure connectors for the following data sources:

Optimized Connectors

PostgreSQL
MySQL
Snowflake
AWSRedshift
Oracle
Hive
Iceberg
MariaDB
Teradata (Technical Preview)

Community Connectors

Google BigQuery
Apache Druid
Apache Ignite
Apache Kafka
Apache Pinot
Black Hole
Cassandra
Clickhouse
Delta Lake
DuckDB
Elasticsearch
Exasol
Google Sheets
Hudi (Apache Hudi)
Loki
Microsoft SQL Server
MongoDB
OpenSearch
Prometheus
Redis
SingleStore
TPC-DS
TPC-H
Vertica