Data Engineering deployment architecture
Recommendations for scaling Cloudera Data Engineering deployments
Apache Airflow scaling and tuning considerations
General guidelines
Configuring Spark jobs for large shuffle data