Configuring the Activity Profiler
In addition to the generic configuration, there are additional parameters for the Activity Profiler that can optionally be edited. You can configure the scheduling and the available resources for your profiler.
- Go to Profilers and select your data lake.
- Go to Profilers > Activity Profiler > Profiler Details > Configuration > All Configurations
-
Select a schedule to run profiler using either UNIX Cron
Expression or the Basic scheduler.
Figure 1. Profiler schedule with cron expression
Figure 2. Profiler schedule with natural language
-
Continue with the resource settings:
-
Set the Maximum number of executors
Indicates the number of processes that are used by the distributed computing framework. The recommended value is at least four executors.
-
Set the Maximum cores per executor
Indicates the maximum number of cores that can be allocated to an executor.
- Set the Executor memory limit in GBs
-
Set the Number of driver cores
Indicates the maximum number of driver cores. Increase the number of cores to improve the speed of profiler job scheduling.
-
Set the Maximum driver memory in GBs
Indicates the maximum amount of memory that can be allocated to an driver core. Increasing the available memory accelerates the profiling of larger and more complex tables and prevents out-of-memory errors.
-
Set the Maximum number of executors
- Click Save to apply the configuration changes to the selected profiler.
