Data Mart clusters

Learn about the default Data Mart and Real Time Data Mart clusters, including cluster definition and template names, included services, and compatible Runtime version.

Data Mart is an MPP SQL database powered by Apache Impala designed to support custom Data Mart applications at big data scale. Impala easily scales to petabytes of data, processes tables with trillions of rows, and allows users to store, browse, query, and explore their data in an interactive way.

Data Mart clusters

The Data Mart template provides a ready to use, fully capable, standalone deployment of Impala. Upon deployment, it can be used as a standalone Data Mart to which users point their BI dashboards using JDBC/ODBC end points. Users can also choose to author SQL queries in Cloudera’s web-based SQL query editor, Hue, and execute them with Impala providing a delightful end-user focused and interactive SQL/BI experience.

Cluster definition names
  • Data Mart for AWS

  • Data Mart for Azure

Cluster template name
CDP - Data Mart: Apache Impala, Hue
Included services
  • HDFS
  • Hue
  • Impala
Compatible Runtime version
7.1.0, 7.2.0, 7.2.1

Real Time Data Mart clusters

The Real-Time Data Mart template provides a ready to use, fully capable, standalone deployment of Impala. Upon deployment, it can be used as a standalone Data Mart to which users point their BI dashboards using JDBC/ODBC end points. Users can also choose to author SQL queries in Cloudera’s web-based SQL query editor, Hue, and execute them with Impala providing a delightful end-user focused and interactive SQL/BI experience. The Real-Time Data Mart cluster also includes Kudu and Spark.

Cluster definition names
  • Real-time Data Mart for AWS

  • Real-time Data Mart for Azure

Cluster template name
CDP - Real-time Data Mart: Apache Impala, Hue, Apache Kudu, Apache Spark
Included services
  • HDFS
  • Hue
  • Impala
  • Kudu
  • Spark
  • Yarn
Compatible Runtime version
7.1.0, 7.2.0, 7.2.1, 7.2.2