Running Distributed ML Workloads on YARN
Cloudera Data Science Workbench 1.6 (and higher) allows you to run distributed machine learning workloads on the CDH/HDP cluster with frameworks such as TensorFlowOnSpark, H2O, XGBoost, and so on. This is similar to what you can already do with Spark workloads that run on the attached CDH/HDP cluster.
CDSW_IP_ADDRESSand the host's IP address is stored in
The information in these environmental variables can be used to make services running in the engine available to services running in the CDH cluster.