Configuration Properties ReferencePDF version

Tez Properties in Cloudera Runtime 7.1.7

Role groups:

Description
The directory where the client configs will be deployed
Related Name
Default Value
/etc/tez
API Name
client_config_root_dir
Required
true
Description
For advanced use only, a string to be inserted into the client configuration for tez-conf/tez-site.xml.
Related Name
Default Value
API Name
tez-conf/tez-site.xml_client_config_safety_valve
Required
false

Description
When set, Cloudera Manager will send alerts when this entity's configuration changes.
Related Name
Default Value
false
API Name
enable_config_alerts
Required
false

Description
The priority level that the client configuration will have in the Alternatives system on the hosts. Higher priority levels will cause Alternatives to prefer this configuration over any others.
Related Name
Default Value
50
API Name
client_config_priority
Required
true

Description
Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_cdh_version_validator
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Deploy Directory parameter.
Related Name
Default Value
false
API Name
role_config_suppression_client_config_root_dir
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Client Advanced Configuration Snippet (Safety Valve) for tez-conf/tez-site.xml parameter.
Related Name
Default Value
false
API Name
role_config_suppression_tez-conf/tez-site.xml_client_config_safety_valve
Required
true

Description
The group that this service's processes should run as.
Related Name
Default Value
hadoop
API Name
process_groupname
Required
true
Description
The user that this service's processes should run as.
Related Name
Default Value
tez
API Name
process_username
Required
true
Description
For advanced use only, key-value pairs (one on each line) to be inserted into a role's environment. Applies to configurations of all roles in this service except client configuration.
Related Name
Default Value
API Name
TEZ_service_env_safety_valve
Required
false

Description
When set, Cloudera Manager will send alerts when the health of this service reaches the threshold specified by the EventServer setting eventserver_health_events_alert_threshold
Related Name
Default Value
true
API Name
enable_alerts
Required
false
Description
When set, Cloudera Manager will send alerts when this entity's configuration changes.
Related Name
Default Value
false
API Name
enable_config_alerts
Required
false
Description
The configured triggers for this service. This is a JSON-formatted list of triggers. These triggers are evaluated as part as the health system. Every trigger expression is parsed, and if the trigger condition is met, the list of actions provided in the trigger expression is executed. Each trigger has the following fields:
  • triggerName (mandatory) - The name of the trigger. This value must be unique for the specific service.
  • triggerExpression (mandatory) - A tsquery expression representing the trigger.
  • streamThreshold (optional) - The maximum number of streams that can satisfy a condition of a trigger before the condition fires. By default set to 0, and any stream returned causes the condition to fire.
  • enabled (optional) - By default set to 'true'. If set to 'false', the trigger is not evaluated.
  • expressionEditorConfig (optional) - Metadata for the trigger editor. If present, the trigger should only be edited from the Edit Trigger page; editing the trigger here can lead to inconsistencies.
For example, the following JSON formatted trigger fires if there are more than 10 DataNodes with more than 500 file descriptors opened:[{"triggerName": "sample-trigger", "triggerExpression": "IF (SELECT fd_open WHERE roleType = DataNode and last(fd_open) > 500) DO health:bad", "streamThreshold": 10, "enabled": "true"}]See the trigger rules documentation for more details on how to write triggers using tsquery.The JSON format is evolving and may change and, as a result, backward compatibility is not guaranteed between releases.
Related Name
Default Value
[]
API Name
service_triggers
Required
true
Description
For advanced use only, a list of derived configuration properties that will be used by the Service Monitor instead of the default ones.
Related Name
Default Value
API Name
smon_derived_configs_safety_valve
Required
false

Description
The maximum heartbeat interval between the Application Master and RM in milliseconds.
Related Name
tez.am.am-rm.heartbeat.interval-ms.max
Default Value
250 millisecond(s)
API Name
tez.am.am-rm.heartbeat.interval-ms.max
Required
true
Description
The maximum amount of time to hold on to a container if no task can be assigned to it immediately. Only active when reuse is enabled.
Related Name
tez.am.container.idle.release-timeout-max.millis
Default Value
20 second(s)
API Name
tez.am.container.idle.release-timeout-max.millis
Required
true
Description
The minimum amount of time to hold on to a container that is idle. Only active when reuse is enabled.
Related Name
tez.am.container.idle.release-timeout-min.millis
Default Value
10 second(s)
API Name
tez.am.container.idle.release-timeout-min.millis
Required
true
Description
Configuration to specify whether container should be reused.
Related Name
tez.am.container.reuse.enabled
Default Value
true
API Name
tez.am.container.reuse.enabled
Required
true
Description
The amount of time to wait before assigning a container to the next level of locality. NODE > RACK > NON_LOCAL
Related Name
tez.am.container.reuse.locality.delay-allocation-millis
Default Value
250 millisecond(s)
API Name
tez.am.container.reuse.locality.delay-allocation-millis
Required
true
Description
Whether to reuse containers for non-local tasks. Active only if reuse is enabled.
Related Name
tez.am.container.reuse.non-local-fallback.enabled
Default Value
false
API Name
tez.am.container.reuse.non-local-fallback.enabled
Required
true
Description
Whether to reuse containers for rack local tasks. Active only if reuse is enabled.
Related Name
tez.am.container.reuse.rack-fallback.enabled
Default Value
true
API Name
tez.am.container.reuse.rack-fallback.enabled
Required
true
Description
Cluster default Java options for the Tez Application Master process. These will be prepended to the properties specified via tez.am.launch.cmd-opts.
Related Name
tez.am.launch.cluster-default.cmd-opts
Default Value
-server -Djava.net.preferIPv4Stack=true
API Name
tez.am.launch.cluster-default.cmd-opts
Required
true
Description
Java options for the Tez Application Master process. The Xmx value is derived based on tez.am.resource.memory.mb and is 80% of the value by default. Used only if the value is not specified explicitly by the DAG definition.
Related Name
tez.am.launch.cmd-opts
Default Value
-XX:+PrintGCDetails -verbose:gc -XX:+UseNUMA -XX:+UseG1GC -XX:+ResizeTLAB -XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=/tmp
API Name
tez.am.launch.cmd-opts
Required
true
Description
Additional execution environment entries for tez. This is not an additive property. You must preserve the original value if you want to have access to native libraries. Used only if the value is not specified explicitly by the DAG definition.
Related Name
tez.am.launch.env
Default Value
LD_LIBRARY_PATH=$PARCELS_ROOT/CDH/lib/hadoop/lib/native
API Name
tez.am.launch.env
Required
true
Description
Root Logging level passed to the Tez Application Master.
Related Name
tez.am.log.level
Default Value
INFO
API Name
tez.am.log.level
Required
true
Description
Specifies the total number of time the Application Master will run in case recovery is triggered.
Related Name
tez.am.max.app.attempts
Default Value
2
API Name
tez.am.max.app.attempts
Required
true
Description
The maximum number of allowed task attempt failures on a node before it gets marked as blacklisted.
Related Name
tez.am.maxtaskfailures.per.node
Default Value
10
API Name
tez.am.maxtaskfailures.per.node
Required
true
Description
The amount of memory to be used by the Application Master. Used only if the value is not specified explicitly by the DAG definition.
Related Name
tez.am.resource.memory.mb
Default Value
2 GiB
API Name
tez.am.resource.memory.mb
Required
true
Description
Template to generate the History URL for a particular Tez Application. Template replaces __APPLICATION_ID__ with the actual applicationId and __HISTORY_URL_BASE__ with the value from the tez.tez-ui.history-url.base config property
Related Name
tez.am.tez-ui.history-url.template
Default Value
__HISTORY_URL_BASE__?viewPath=%2F%23%2Ftez-app%2F__APPLICATION_ID__
API Name
tez.am.tez-ui.history-url.template
Required
true
Description
Application Master view ACLs. This allows the specified users/groups to view the status of the Application Master and all DAGs that run within this Appliation Master. Value format: Comma separated list of users, followed by whitespace, followed by a comma separated list of groups.
Related Name
tez.am.view-acls
Default Value
*
API Name
tez.am.view-acls
Required
false
Description
Specify additional classpath information to be used for Tez AM and all containers.
Related Name
tez.cluster.additional.classpath.prefix
Default Value
API Name
tez.cluster.additional.classpath.prefix
Required
false
Description
The number of allowed counters for the executing DAG.
Related Name
tez.counters.max
Default Value
10000
API Name
tez.counters.max
Required
true
Description
The number of allowed counter groups for the executing DAG.
Related Name
tez.counters.max.groups
Default Value
3000
API Name
tez.counters.max.groups
Required
true
Description
Generate debug artifacts such as a text representation of the submitted DAG plan.
Related Name
tez.generate.debug.artifacts
Default Value
false
API Name
tez.generate.debug.artifacts
Required
true
Description
Upper bound on the size (in bytes) of a grouped split, to avoid generating excessively large split.
Related Name
tez.grouping.max-size
Default Value
1 GiB
API Name
tez.grouping.max-size
Required
true
Description
Lower bound on the size (in bytes) of a grouped split, to avoid generating too many splits.
Related Name
tez.grouping.min-size
Default Value
16 MiB
API Name
tez.grouping.min-size
Required
true
Description
The multiplier for available queue capacity when determining number of tasks for a Vertex. 1.7 with 100% queue available implies generating a number of tasks roughly equal to 170% of the available containers on the queue.
Related Name
tez.grouping.split-waves
Default Value
1.7
API Name
tez.grouping.split-waves
Required
true
Description
Directory where proto logger writes the history events, should generally be sys.db database directory.
Related Name
tez.history.logging.proto-base-dir
Default Value
/warehouse/tablespace/managed/hive/sys.db
API Name
tez.history.logging.proto-base-dir
Required
true
Description
DAGs per group.
Related Name
tez.history.logging.timeline-cache-plugin.old-num-dags-per-group
Default Value
5
API Name
tez.history.logging.timeline-cache-plugin.old-num-dags-per-group
Required
true
Description
Whether intermediate data should be compressed or not.
Related Name
tez.runtime.compress
Default Value
true
API Name
tez.runtime.compress
Required
true
Description
The codec to be used if compressing intermediate data. Only applicable if tez.runtime.compress is enabled.
Related Name
tez.runtime.compress.codec
Default Value
org.apache.hadoop.io.compress.SnappyCodec
API Name
tez.runtime.compress.codec
Required
true
Description
Whether to publish configuration information to History logger.
Related Name
tez.runtime.convert.user-payload.to.history-text
Default Value
false
API Name
tez.runtime.convert.user-payload.to.history-text
Required
true
Description
The size of the sort buffer when output needs to be sorted.
Related Name
tez.runtime.io.sort.mb
Default Value
272 MiB
API Name
tez.runtime.io.sort.mb
Required
true
Description
If the shuffle input is on the local host bypass the http fetch and access the files directly.
Related Name
tez.runtime.optimize.local.fetch
Default Value
true
API Name
tez.runtime.optimize.local.fetch
Required
true
Description
Tez runtime pipelined sorter sort threads.
Related Name
tez.runtime.pipelined.sorter.sort.threads
Default Value
2
API Name
tez.runtime.pipelined.sorter.sort.threads
Required
true
Description
Fraction (0-1) of the available memory which can be used to retain shuffled data.
Related Name
tez.runtime.shuffle.fetch.buffer.percent
Default Value
0.6
API Name
tez.runtime.shuffle.fetch.buffer.percent
Required
true
Description
This property determines if the shuffle connection should be kept alive. If not, then the connection needs to be reestablished.
Related Name
tez.runtime.shuffle.keep-alive.enabled
Default Value
true
API Name
tez.runtime.shuffle.keep-alive.enabled
Required
true
Description
This property determines the maximum size of a shuffle segment which can be fetched to memory. Fraction (0-1) of shuffle memory (after applying tez.runtime.shuffle.fetch.buffer.percent).
Related Name
tez.runtime.shuffle.memory.limit.percent
Default Value
0.25
API Name
tez.runtime.shuffle.memory.limit.percent
Required
true
Description
The size of the buffer when output does not require to be sorted.
Related Name
tez.runtime.unordered.output.buffer.size-mb
Default Value
100 MiB
API Name
tez.runtime.unordered.output.buffer.size-mb
Required
true
Description
Time (in seconds) for which the Tez Application Master should wait for a DAG to be submitted before shutting down.
Related Name
tez.session.am.dag.submit.timeout.secs
Default Value
5 minute(s)
API Name
tez.session.am.dag.submit.timeout.secs
Required
true
Description
Time (in seconds) to wait for Application Master to come up when trying to submit a DAG from the client.
Related Name
tez.session.client.timeout.secs
Default Value
-1 second(s)
API Name
tez.session.client.timeout.secs
Required
true
Description
In case of a ScatterGather connection, once this fraction of source tasks have completed, all tasks on the current vertex can be scheduled. Number of tasks ready for scheduling on the current vertex scales linearly between min-fraction and max-fraction.
Related Name
tez.shuffle-vertex-manager.max-src-fraction
Default Value
0.4
API Name
tez.shuffle-vertex-manager.max-src-fraction
Required
true
Description
In case of a ScatterGather connection, the fraction of source tasks which should complete before tasks for the current vertex are schedule.
Related Name
tez.shuffle-vertex-manager.min-src-fraction
Default Value
0.2
API Name
tez.shuffle-vertex-manager.min-src-fraction
Required
true
Description
The staging dir used while submitting DAGs.
Related Name
tez.staging-dir
Default Value
/tmp/$user.name/staging
API Name
tez.staging-dir
Required
true
Description
Time interval at which task counters are sent to the Application Master.
Related Name
tez.task.am.heartbeat.counter.interval-ms.max
Default Value
4 second(s)
API Name
tez.task.am.heartbeat.counter.interval-ms.max
Required
true
Description
Whether to generate counters on a per-edge basis for a Tez DAG. Helpful for in-depth analysis.
Related Name
tez.task.generate.counters.per.io
Default Value
true
API Name
tez.task.generate.counters.per.io
Required
true
Description
The maximum amount of time, in seconds, to wait before a task asks an Application Master for another task.
Related Name
tez.task.get-task.sleep.interval-ms.max
Default Value
200 millisecond(s)
API Name
tez.task.get-task.sleep.interval-ms.max
Required
true
Description
Cluster default Java options for tasks. These will be prepended to the properties specified via tez.task.launch.cmd-opts.
Related Name
tez.task.launch.cluster-default.cmd-opts
Default Value
-server -Djava.net.preferIPv4Stack=true
API Name
tez.task.launch.cluster-default.cmd-opts
Required
true
Description
Java options for tasks. The Xmx value is derived based on tez.task.resource.memory.mb and is 80% of this value by default. Used only if the value is not specified explicitly by the DAG definition.
Related Name
tez.task.launch.cmd-opts
Default Value
-XX:+PrintGCDetails -verbose:gc -XX:+UseNUMA -XX:+UseG1GC -XX:+ResizeTLAB -XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=/tmp
API Name
tez.task.launch.cmd-opts
Required
true
Description
Additional execution environment entries for tez. This is not an additive property. You must preserve the original value if you want to have access to native libraries. Used only if the value is not specified explicitly by the DAG definition.
Related Name
tez.task.launch.env
Default Value
LD_LIBRARY_PATH=$PARCELS_ROOT/CDH/lib/hadoop/lib/native
API Name
tez.task.launch.env
Required
true
Description
Maximum number of events to fetch from the Application Master by the tasks in a single heartbeat.
Related Name
tez.task.max-events-per-heartbeat
Default Value
500
API Name
tez.task.max-events-per-heartbeat
Required
true
Description
The amount of memory to be used by launched tasks. Used only if the value is not specified explicitly by the DAG definition.
Related Name
tez.task.resource.memory.mb
Default Value
1536 MiB
API Name
tez.task.resource.memory.mb
Required
true
Description
The base of the Tez UI URL.
Related Name
tez.tez-ui.history-url.base
Default Value
API Name
tez.tez-ui.history-url.base
Required
false
Description
This being true implies that the deployment is relying on hadoop jars being available on the cluster on all nodes.
Related Name
tez.use.cluster.hadoop-libs
Default Value
false
API Name
tez.use.cluster.hadoop-libs
Required
true
Description
Timeline service version we're currently using.
Related Name
yarn.timeline-service.enabled
Default Value
false
API Name
yarn.timeline-service.enabled
Required
true
Description
Name of the YARN service that this Tez service instance depends on
Related Name
Default Value
API Name
yarn_service
Required
true

Description
Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_cdh_version_validator
Required
true
Description
Whether to suppress configuration warnings produced by the Deploy Directory configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_client_config_root_dir
Required
true
Description
Whether to suppress configuration warnings produced by the Tez Client Advanced Configuration Snippet (Safety Valve) for tez-conf/tez-site.xml configuration validator.
Related Name
Default Value
false
API Name
role_config_suppression_tez-conf/tez-site.xml_client_config_safety_valve
Required
true
Description
Whether to suppress configuration warnings produced by the Gateway Count Validator configuration validator.
Related Name
Default Value
false
API Name
service_config_suppression_gateway_count_validator
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the System Group parameter.
Related Name
Default Value
false
API Name
service_config_suppression_process_groupname
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the System User parameter.
Related Name
Default Value
false
API Name
service_config_suppression_process_username
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Triggers parameter.
Related Name
Default Value
false
API Name
service_config_suppression_service_triggers
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Monitor Derived Configs Advanced Configuration Snippet (Safety Valve) parameter.
Related Name
Default Value
false
API Name
service_config_suppression_smon_derived_configs_safety_valve
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Application Master Default Command Line Options parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.am.launch.cluster-default.cmd-opts
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Application Master Command Line Options parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.am.launch.cmd-opts
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Application Master Environment Settings parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.am.launch.env
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Number of Recovery Runs parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.am.max.app.attempts
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the History URL Template parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.am.tez-ui.history-url.template
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Application Master View ACLs parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.am.view-acls
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Additional Classpath parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.cluster.additional.classpath.prefix
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez history events directory parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.history.logging.proto-base-dir
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the DAGs per Group parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.history.logging.timeline-cache-plugin.old-num-dags-per-group
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Codec for Compressing Intermediate Data parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.runtime.compress.codec
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the TEZ Staging directory parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.staging-dir
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Task Default Command Line Options parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.task.launch.cluster-default.cmd-opts
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Task Command Line Options parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.task.launch.cmd-opts
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Task Environment Settings parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.task.launch.env
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez UI URL Base parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez.tez-ui.history-url.base
Required
true
Description
Whether to suppress configuration warnings produced by the built-in parameter validation for the Tez Service Environment Advanced Configuration Snippet (Safety Valve) parameter.
Related Name
Default Value
false
API Name
service_config_suppression_tez_service_env_safety_valve
Required
true