Check Job Status
If a job takes longer than expected or does not finish successfully, check the following to understand more about where the job stalled or failed:
-
To list running applications by ID from the command line, use
yarn application –list
.
-
To see a description of a resilient distributed dataset (RDD) and its recursive dependencies (useful for understanding how jobs are executed) use
toDebugString()
on the RDD. -
To check the query plan when using the DataFrame API, use
DataFrame#explain()
.