Data Engineering

Instructions and examples for Apache Spark and Apache Zeppelin in Cloudera Data Platform.

Managing Apache Spark

Configuring Apache Spark

Instructions for configuring Apache Spark in Cloudera Data Platform.

Upgrading Apache Spark

Instructions for upgrading Apache Spark from older versions for Cloudera 7.3.1.

Using Apache Spark

Developing Apache Spark Applications

Instructions and examples for creating Apache Spark applications to run on Cloudera Data Platform.

Running Apache Spark Applications

Instructions and examples for running Apache Spark applications on Cloudera Data Platform.

Tuning Apache Spark

Advice and recommendations for optimizing the performance of Apache Spark and Spark applications for Cloudera Data Platform.

Apache Spark integration with Schema Registry

Leverage Schema Registry for managing Spark schemas and to serialize and/or de-serialize messages.

Using Apache Iceberg with Spark

Use Spark to interact with Apache Iceberg tables.