Data Engineering

Instructions and examples for Apache Spark and Apache Zeppelin in Cloudera.

Instructions for configuring Apache Spark in Cloudera.

Instructions for upgrading Apache Spark from older versions for Cloudera 7.3.1.

Instructions and examples for creating Apache Spark applications to run on Cloudera.

Instructions and examples for running Apache Spark applications on Cloudera.

Advice and recommendations for optimizing the performance of Apache Spark and Spark applications for Cloudera.

Leverage Schema Registry for managing Spark schemas and to serialize and/or de-serialize messages.

Use Spark to interact with Apache Iceberg tables.