Known issues for Cloudera Data Services on premises 1.5.5 SP3
Review the known issues for Cloudera Data Services on premises, the impact or changes to the functionality, and the applicable workaround.
Known issues identified in 1.5.5 SP3 release
- OPSAPS-77943: Cloudera Control Plane installation fails with an error message cdp-embedded-db-0 is not Ready
- Longhorn does not support using a
symlinkfor the default datapath/storagedirectory. The path must be a real directory on the host; symlinked paths can cause disk detection failures, replica scheduling issues, and volumeattach/mountfailures.
- OPSAPS-78075: In Cloudera Manager, selecting Use Default Configuration does not show the input for Administrator credentials
- In the Cloudera Embedded Container Service environment, when the Use Default Configuration checkbox is selected on the Cluster Basic UI, the Configure default login credentials for Control Plane panel does not display in Configure Data Services UI. Without configuring the Administrator credentials Cloudera Embedded Container Service installation for Cloudera Data Services on premises 1.5.5 SP3 release fails.
- OPSX-7794: The docker copy script fails checksum validation on Docker version 29.5.2
- While using the Custom Docker Repository Installation and Upgrade process, selecting the Docker version 27+ for the copy docker script (copy-docker.txt) fails checksum validation. An error message is thrown - does not match for each image it tries to copy.
- OPSAPS-77489: When calling rotateEcsCertificates API command directly, 500 Server error message is thrown
- Invoking the Cloudera Manager API,
/clusters/{clusterName}/services/{serviceName}/commands/rotateEcsCertificatesresults in a 500 server error.
- OPSX-6950: Data Recovery Service based Restore job fails because the services in cert-manager namespace is not created
-
Data Recovery Service Restore job fails due to ClusterIP Allocation Conflict. During a Data Recovery Service restore process, the operation may fail with a Kubernetes error indicating that a Service ClusterIP is already allocated. This occurs when the restore process attempts to recreate a service using a ClusterIP that is currently in use by another existing service in the cluster. A typical error message looks like: Service "cdp-release-cert-manager-cainjector" is invalid: spec.clusterIPs: failed to allocate IP <IP_ADDRESS>: provided IP is already allocated
- OPSX-7767: While performing an Cloudera Embedded Container Service upgrade from Cloudera Data Services on premises 1.5.5 SP2 → 155SP3, the upgrade process may fail during the restart step with a message - failed to reconcile with local datastore: context deadline exceeded
- During an Cloudera Embedded Container Service upgrade from Cloudera Data Services on premises 1.5.5 SP2 to 1.5.5 SP3, the upgrade process
may fail in the restart step with the following error: failed to bootstrap
cluster data: failed to reconcile with local datastore: context deadline
exceeded.
When the ECS_SERVER role is stopped as part of the restart during the upgrade workflow, not all RKE2-related processes are terminated. Some of the orphaned processes continue to run, including:
etcdkube-proxykube-apiserver
These processes are left under containerd static pods. With no active
rke2 serverprocess and ports 2379/6443 still in use, a subsequent Cloudera Manager start ofrke2 serverfails bootstrap reconcile with the error: failed to reconcile with local datastore: context deadline exceeded.
- OPSX-6858: Cloudera Embedded Container Service first run is stuck in loop at install-cp step (mke2fs command fails) on KCloud
-
During a few Cloudera Embedded Container Service installation processes, pods get stuck in the creating state as the associated longhorn mount on those pods fails.
This scenario is observed when longhorn PVC block devices contain stale
filesystem/partitionmetadata from the previous use. This event can be verified by runninglsof /dev/longhorn/pvc-2e2dc23b-82d6-45cd-9348-b40eba0fb4e1.Because of this scenario,
mke2fscommand fails which is required for setting up volume mounts on the pod
