Data Engineering

Instructions and examples for Apache Spark and Apache Zeppelin in Cloudera.

Managing Apache Spark

Configuring Apache Spark

Instructions for configuring Apache Spark in Cloudera.

Upgrading Apache Spark

Instructions for upgrading Apache Spark from older versions for Cloudera 7.3.1.

Using Apache Spark

Developing Apache Spark Applications

Instructions and examples for creating Apache Spark applications to run on Cloudera.

Running Apache Spark Applications

Instructions and examples for running Apache Spark applications on Cloudera.

Tuning Apache Spark

Advice and recommendations for optimizing the performance of Apache Spark and Spark applications for Cloudera.

Apache Spark integration with Schema Registry

Leverage Schema Registry for managing Spark schemas and to serialize and/or de-serialize messages.

Using Apache Iceberg with Spark

Use Spark to interact with Apache Iceberg tables.