Known Issues in MapReduce, Apache Hadoop YARN, and YARN Queue Manager

Learn about the known issues in Mapreduce, YARN and YARN Queue Manager, the impact or changes to the functionality, and the workaround.

Known Issues🔗

CDPD-46685 Nodemanager logs are filled with logs similar to: 2022-11-28 03:42:39,587 WARN org.apache.hadoop.ipc.Client: Address change detected. Old: deh-34631355-niv-master1.e2e-797.dze1-y40r.int.cldr.work/10.114.128.84:8031 New: deh-34631355-niv-master1.e2e-797.dze1-y40r.int.cldr.work/10.114.128.63:8031 2022-11-28 03:43:01,425 WARN org.apache.hadoop.ipc.Client: Address change detected. Old: deh-34631355-niv-master0.e2e-797.dze1-y40r.int.cldr.work/10.114.128.79:8031 New: deh-34631355-niv-master0.e2e-797.dze1-y40r.int.cldr.work/10.114.128.65:8031.: Restart all YARN NodeManagers, they should come up without issues and Cloudera Manager should recognize them as healthy nodes once the status of them is refreshed upon restart.
YARN cannot start if Kerberos principal name is changed: If the Kerberos principal name is changed in Cloudera Manager after launch, YARN will not be able to start. In such case the keytabs can be correctly generated but YARN cannot access ZooKeeper with the new Kerberos principal name and old ACLs.; There are two possible workarounds:

Delete the znode and restart the YARN service.

Use the reset ZK ACLs command. This also sets the znodes below /rmstore/ZKRMStateRoot to world:anyone:cdrwa which is less secure.
Third party applications do not launch if MapReduce framework path is not included in the client configuration: MapReduce application framework is loaded from HDFS instead of being present on the NodeManagers. By default the mapreduce.application.framework.path property is set to the appropriate value, but third party applications with their own configurations will not launch.; Set the mapreduce.application.framework.path property to the appropriate configuration for third party applications.
JobHistory URL mismatch after server relocation: After moving the JobHistory Server to a new host, the URLs listed for the JobHistory Server on the ResourceManager web UI still point to the old JobHistory Server. This affects existing jobs only. New jobs started after the move are not affected.; For any existing jobs that have the incorrect JobHistory Server URL, there is no option other than to allow the jobs to roll off the history over time. For new jobs, make sure that all clients have the updated mapred-site.xml that references the correct JobHistory Server.
CDH-6808: Routable IP address required by ResourceManager: ResourceManager requires routable host:port addresses for yarn.resourcemanager.scheduler.address, and does not support using the wildcard 0.0.0.0 address.; Set the address, in the form host:port, either in the client-side configuration, or on the command line when you submit the job.
CDH-49165: History link in ResourceManager web UI broken for killed Spark applications: When a Spark application is killed, the history link in the ResourceManager web UI does not work.; To view the history for a killed Spark application, see the Spark HistoryServer web UI instead.
CDPD-2936: Application logs are not accessible in WebUI2 or Cloudera Manager: Running Containers Logs from NodeManager local directory cannot be accessed either in Cloudera Manager or in WebUI2 due to log aggregation.; Use the YARN log CLI to access application logs. For example:
yarn logs -applicationId <Application ID>; Apache Issue: YARN-9725
COMPX-1445: Queue Manager operations are failing when Queue Manager is installed separately from YARN: If Queue Manager is not selected during YARN installation, Queue Manager operation are failing. Queue Manager says 0 queues are configured and several failures are present. That is because ZooKeeper configuration store is not enabled.; In Cloudera Manager, select the YARN service.

Click the Configuration tab.

Find the Queue Manager Service property.

Select the Queue Manager service that the YARN service instance depends on.

Click Save Changes.

Restart all services that are marked stale in Cloudera Manager.
COMPX-3329: Autorestart is not enabled for Queue Manager in Data Hub: In a Data Hub cluster, Queue Manager is installed with autorestart disabled. Hence, if Queue Manager goes down, it will not restart automatically.; If Queue Manager goes down in a Data Hub cluster, you must go to the Cloudera Manager Dashboard and restart the Queue Manager service.
COMPX-5817: Queue Manager UI will not be able to present a view of pre-upgrade queue structure. CM Store is not supported and therefore Yarn will not have any of the pre-upgrade queue structure preserved.: When a Data Hub cluster is deleted, all saved configurations are also deleted. All YARN configurations are saved in CM Store and this is yet to be supported in Data Hub and Cloudera Manager. Hence, the YARN queue structure also will be lost when a Data Hub cluster is deleted or upgraded or restored.

Technical Service Bulletins🔗

TSB 2023-641: InvalidClassException while editing queue configurations in YARN Queue Manager UI: Under situations described below, a user may encounter the following error message while editing queue configurations in the Apache Hadoop YARN (YARN) Queue Manager UI:; Failed to update queue configuration; Modify queue operation failed. Queue configuration in Cloudera Manager is inconsistent with YARN. Try restarting YARN.
Knowledge article: For the latest update on this issue see the corresponding Knowledge article: TSB 2022-641: InvalidClassException while editing queue configurations in YARN Queue Manager UI.

Unsupported Features🔗

: The following YARN features are currently not supported in Cloudera Data Platform:

Application Timeline Server (ATSv2 and ATSv1)

Container Resizing

Distributed or Centralized Allocation of Opportunistic Containers

Distributed Scheduling

Docker on YARN (DockerContainerExecutor) on Data Hub clusters

Fair Scheduler

GPU support for Docker

Hadoop Pipes

Native Services

Pluggable Scheduler Configuration

Queue Priority Support

Reservation REST APIs

Resource Estimator Service

Resource Profiles

(non-Zookeeper) ResourceManager State Store

Rolling Log Aggregation

Shared Cache

YARN Federation

Moving jobs between queues

Known Issues in MapReduce, Apache Hadoop YARN, and YARN Queue Manager

Known Issues🔗

Technical Service Bulletins🔗

Unsupported Features🔗

We want your opinion

How can we improve this page?