Gateway OpenTelemetry Collector recommendation

Proper sizing the number of Gateway OpenTelemetry required is crucial for optimal performance.

Telemetry data is collected from each node in the cluster using OpenTelemetry collectors and must be transmitted to the SaaS Observability solution. If outbound internet access is unavailable for all cluster nodes, specific Gateway nodes must be designated to centrally forward the telemetry data. These Gateway nodes are selected from the existing cluster nodes and must have outbound internet access. The OpenTelemetry collectors on these Gateway nodes will handle the transmission of the data.

High Availability enabled cluster

If you want to set up High Availability (HA) or have already set up HA on your cluster, Cloudera recommends a minimum of two Gateway nodes with OpenTelemetry Collectors, irrespective of the number of nodes.

Scaling recommendation

The table below provides the recommendations for appropriate scaling.

Number of Nodes Count of Gateway OpenTelemetry Collectors
100 2
500 2
1000 2
2000 3
4000 5
The formula to calculate the number of Gateway OpenTelemetry Collectors = 1 + ⌈(N/1000)⌉​ where N is the number of nodes on which the OpenTelemetry Collector is present.