Running a Flink job
After developing your application, you can submit your Flink job in YARN per-job, session or application mode. To submit the Flink job, you need to run the Flink client in the command line including security parameters and other configurations with the run command.
Submitting a job means uploading the job’s JAR and related dependencies to the Flink cluster and initiating the job execution.
- Per-job mode
Per-job mode means that you run the Flink job in a dedicated YARN application. In this case each submitted Flink job has its own Flink cluster in YARN, with its own Job Manager and Task Managers. When you run Flink jobs in per-job mode, every job submission creates a new cluster. As the cluster deployment has to be created with every submission, the execution of the job can take up time.
- Session mode
Session mode means that you run multiple Flink jobs in the same YARN sessions. In this case every Flink job shares the cluster, the allocated resources, the Job Manager and Task Managers. When you run Flink jobs in session mode, the submitted jobs are created in one cluster and are long-lived. The execution time is shorter than in per-job mode, however you need to consider that in a session mode a cluster failure affects every Flink job, and recreation from a savepoint can take up time.
- Application mode
Regarding resource management, Application mode is much like Per-job mode. That means every deployed application has its dedicated Flink cluster. The key difference is the
main()
method of the deployed application is executed by the Job Manager rather than the client.
You can set how to run your Flink job with the execution.target
setting in the Flink configuration file. By default,
execution.target
is set to yarn-per-job
, but
you can change it to yarn-application
or
yarn-session
. Alternatively, you can add the corresponding
arguments to the flink run
command when submitting the Flink
job.