Ports Used By Cloudera Data Science Workbench
Cloudera Data Science Workbench runs on gateway nodes in a CDH cluster. As such, Cloudera Data Science Workbench acts as a gateway and requires full connectivity to CDH services (Impala, Spark 2, etc.) running on the cluster. Additionally, in the case of Spark 2, CDH cluster nodes will require access to the Spark driver running on a set of random ports (20050-32767) on Cloudera Data Science Workbench nodes.
Internally, the Cloudera Data Science Workbench master and worker nodes require full connectivity with no firewalls. Externally, end users connect to Cloudera Data Science Workbench exclusively through a web server running on the master node, and therefore do not need direct access to any other internal Cloudera Data Science Workbench or CDH services.
Components | Details |
---|---|
Communication with the CDH cluster |
CDH -> Cloudera Data Science Workbench The CDH cluster must have access to the Spark driver that runs on Cloudera Data Science Workbench nodes, on a set of randomized ports in the range, 20050-32767. |
Cloudera Data Science Workbench -> CDH As a gateway service, Cloudera Data Science Workbench must have access to all the ports used by CDH and Cloudera Manager. |
|
Communication with the Web Browser |
The Cloudera Data Science Workbench web application is available at port 80. HTTPS access is available over port 443. |