Hive LLAP Properties in Cloudera Runtime 7.0.3

Role groups:

Gateway

Categories:

Advanced

Display Name Description Related Name Default Value API Name Required
Deploy Directory The directory where the client configs will be deployed /etc/hive client_config_root_dir true
Hive Client Advanced Configuration Snippet (Safety Valve) for hive-site.xml For advanced use only, a string to be inserted into the client configuration for hive-site.xml. hive_client_config_safety_valve false
Gateway Client Environment Advanced Configuration Snippet (Safety Valve) for hive-env.sh For advanced use only, key-value pairs (one on each line) to be inserted into the client configuration for hive-env.sh hive_client_env_safety_valve false
Client Java Configuration Options These are Java command-line arguments. Commonly, garbage collection flags, PermGen, or extra debugging flags would be passed here. -Djava.net.preferIPv4Stack=true hive_client_java_opts false
Hive Metastore Connection Timeout Timeout for requests to the Hive Metastore Server. Consider increasing this if you have tables with a lot of metadata and see timeout errors. Used by most Hive Metastore clients such as Hive CLI and HiveServer2, but not by Impala. Impala has a separately configured timeout. hive.metastore.client.socket.timeout 5 minute(s) hive_metastore_timeout false
Gateway Logging Advanced Configuration Snippet (Safety Valve) For advanced use only, a string to be inserted into log4j.properties for this role only. log4j_safety_valve false

Logs

Display Name Description Related Name Default Value API Name Required
Gateway Logging Threshold The minimum log level for Gateway logs INFO log_threshold false

Monitoring

Display Name Description Related Name Default Value API Name Required
Enable Configuration Change Alerts When set, Cloudera Manager will send alerts when this entity's configuration changes. false enable_config_alerts false

Other

Display Name Description Related Name Default Value API Name Required
Alternatives Priority The priority level that the client configuration will have in the Alternatives system on the hosts. Higher priority levels will cause Alternatives to prefer this configuration over any others. 91 client_config_priority true

Resource Management

Display Name Description Related Name Default Value API Name Required
Client Java Heap Size in Bytes Maximum size in bytes for the Java process heap memory. Passed to Java -Xmx. 2 GiB hive_client_java_heapsize false

Suppressions

Display Name Description Related Name Default Value API Name Required
Suppress Configuration Validator: CDH Version Validator Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator. false role_config_suppression_cdh_version_validator true
Suppress Parameter Validation: Deploy Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Deploy Directory parameter. false role_config_suppression_client_config_root_dir true
Suppress Parameter Validation: Hive Client Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Client Advanced Configuration Snippet (Safety Valve) for hive-site.xml parameter. false role_config_suppression_hive_client_config_safety_valve true
Suppress Parameter Validation: Gateway Client Environment Advanced Configuration Snippet (Safety Valve) for hive-env.sh Whether to suppress configuration warnings produced by the built-in parameter validation for the Gateway Client Environment Advanced Configuration Snippet (Safety Valve) for hive-env.sh parameter. false role_config_suppression_hive_client_env_safety_valve true
Suppress Parameter Validation: Client Java Configuration Options Whether to suppress configuration warnings produced by the built-in parameter validation for the Client Java Configuration Options parameter. false role_config_suppression_hive_client_java_opts true
Suppress Parameter Validation: Gateway Logging Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the Gateway Logging Advanced Configuration Snippet (Safety Valve) parameter. false role_config_suppression_log4j_safety_valve true

HiveServer2

Categories:

Advanced

Display Name Description Related Name Default Value API Name Required
HiveServer2 Environment Advanced Configuration Snippet (Safety Valve) For advanced use only, key-value pairs (one on each line) to be inserted into a role's environment. Applies to configurations of this role except client configuration. hive_hs2_env_safety_valve false
Hive Metastore Connection Retries Count Number of retries while opening a connection to the Hive Metastore Server hive.metastore.connect.retries 10 hive_metastore_connection_retries false
Enable Metrics Subsystem Controls whether the Hive metrics subsystem is enabled for the role. hive.server2.metrics.enabled true hive_metrics_enabled false
Metrics Sample File Location The full path to a file with a sample of metrics exposed by the role. The sample is updated at the frequency configured by Metrics Sample File Logging Frequency. By default, the sample file is logged to a directory under the role log directory, e.g., /var/log/hive/metrics-hivemetastore/metrics.log. The setting only has an effect if "Enable Metrics Subsystem" is set to true. hive.service.metrics.file.location hive_metrics_sample_file_location false
Metrics Sample File Logging Frequency The frequency at which the metrics are logged to the sample file. The setting only has an effect if "Enable Metrics Subsystem" is set to true. hive.service.metrics.file.frequency 30 second(s) hive_metrics_sample_logging_frequency false
Allow User Substitution Allow alternate user to be specified as part of HiveServer2 open connection request. hive.server2.allow.user.substitution true hive_server2_allow_user_substitution false
HiveServer2 Transport Mode The server transport mode. hive.server2.transport.mode binary hive_server2_transport_mode false
HiveServer2 Advanced Configuration Snippet (Safety Valve) for hive-site.xml For advanced use only. A string to be inserted into hive-site.xml for this role only. hivellap_hs2_config_safety_valve false
Hive Downloaded Resources Directory Local directory where Hive stores jars downloaded for remote file systems (HDFS). If not specified, Hive uses a default location. hive.downloaded.resources.dir hiveserver2_downloaded_resources_dir false
Enable Explain Logging When enabled, HiveServer2 logs EXPLAIN EXTENDED output for every query at INFO log4j level. hive.log.explain.output false hiveserver2_enable_explain_output false
Hive Local Scratch Directory Local Directory where Hive stores jars and data when performing a MapJoin optimization. If not specified, Hive uses a default location. hive.exec.local.scratchdir hiveserver2_exec_local_scratchdir false
Hive HDFS Scratch Directory Directory in HDFS where Hive writes intermediate data between MapReduce jobs. If not specified, Hive uses a default location. hive.exec.scratchdir hiveserver2_exec_scratchdir false
Fair Scheduler XML Advanced Configuration Snippet (Safety Valve) An XML string that will be inserted verbatim into the Fair Scheduler allocations file. This configuration only has effect in CDH 5.8 or later. hiveserver2_fair_scheduler_safety_valve false
Idle Operation Timeout Operation will be closed when not accessed for this duration of time, in milliseconds; disable by setting to zero. For a positive value, checked for operations in terminal state only (FINISHED, CANCELED, CLOSED, ERROR). For a negative value, checked for all of the operations regardless of state. hive.server2.idle.operation.timeout 6 hour(s) hiveserver2_idle_operation_timeout false
Idle Session Timeout Session will be closed when not accessed for this duration of time, in milliseconds; disable by setting to zero or a negative value. hive.server2.idle.session.timeout 1 day(s) hiveserver2_idle_session_timeout false
Exclude Live Operations From Session Idle Time Session will be considered to be idle only if there is no activity, and there is no pending operation. This setting takes effect only if session idle timeout (hive.server2.idle.session.timeout) and checking (hive.server2.session.check.interval) are enabled. hive.server2.idle.session.check.operation true hiveserver2_idle_session_timeout_check_operation false
Java Configuration Options for HiveServer2 These arguments will be passed as part of the Java command line. Commonly, garbage collection flags, PermGen, or extra debugging flags would be passed here. Note: When CM version is 6.3.0 or greater, {{JAVA_GC_ARGS}} will be replaced by JVM Garbage Collection arguments based on the runtime Java JVM version. JAVA_GC_ARGS hiveserver2_java_opts false
Maximum Query String Length for Show Locks The maximum length allowed for the query string when the SHOW LOCKS EXTENDED command is executed. Important: The query string is truncated at the length set for this property. Setting this property to a large value puts pressure on ZooKeeper and might cause out-of-memory issues. hive.lock.query.string.max.length 10000 hiveserver2_lock_query_string_max_length false
Max HiveServer2 Threads Maximum number of worker threads in HiveServer2's thread pool hive.server2.thrift.max.worker.threads 500 hiveserver2_max_threads true
Min HiveServer2 Threads Minimum number of worker threads in HiveServer2's thread pool hive.server2.thrift.min.worker.threads 5 hiveserver2_min_threads true
Session Check Interval The check interval for session/operation timeout, in milliseconds, which can be disabled by setting to zero or a negative value. hive.server2.session.check.interval 15 minute(s) hiveserver2_session_check_interval false
HiveServer2 WebUI Max Threads The max threads for the HiveServer2 WebUI. hive.server2.webui.max.threads 50 hiveserver2_webui_max_threads false
HiveServer2 Advanced Configuration Snippet (Safety Valve) for core-site.xml For advanced use only. A string to be inserted into core-site.xml for this role only. hs2_core_site_safety_valve false
HiveServer2 Logging Advanced Configuration Snippet (Safety Valve) For advanced use only, a string to be inserted into log4j.properties for this role only. log4j_safety_valve false
Heap Dump Directory Path to directory where heap dumps are generated when java.lang.OutOfMemoryError error is thrown. This directory is automatically created if it does not exist. If this directory already exists, role user must have write access to this directory. If this directory is shared among multiple roles, it should have 1777 permissions. The heap dump files are created with 600 permissions and are owned by the role user. The amount of free space in this directory should be greater than the maximum Java Process heap size configured for this role. oom_heap_dump_dir /tmp oom_heap_dump_dir false
Dump Heap When Out of Memory When set, generates heap dump file when java.lang.OutOfMemoryError is thrown. true oom_heap_dump_enabled true
Kill When Out of Memory When set, a SIGKILL signal is sent to the role process when java.lang.OutOfMemoryError is thrown. true oom_sigkill_enabled true
Automatically Restart Process When set, this role's process is automatically (and transparently) restarted in the event of an unexpected failure. This configuration applies in the time after the Start Wait Timeout period. false process_auto_restart true
Enable Metric Collection Cloudera Manager agent monitors each service and each of its role by publishing metrics to the Cloudera Manager Service Monitor. Setting it to false will stop Cloudera Manager agent from publishing any metric for corresponding service/roles. This is usually helpful for services that generate large amount of metrics which Service Monitor is not able to process. true process_should_monitor true
Process Start Retry Attempts Number of times to try starting a role's process when the process exits before the Start Wait Timeout period. After a process is running beyond the Start Wait Timeout, the retry count is reset. Setting this configuration to zero will prevent restart of the process during the Start Wait Timeout period. 3 process_start_retries false
Process Start Wait Timeout The time in seconds to wait for a role's process to start successfully on a host. Processes which exit/crash before this time will be restarted until reaching the limit specified by the Start Retry Attempts count parameter. Setting this configuration to zero will turn off this feature. 20 process_start_secs false

Logs

Display Name Description Related Name Default Value API Name Required
HiveServer2 Log Directory Directory where HiveServer2 will place its log files. /var/log/hive hive_log_dir false
Enable HiveServer2 Operations Logging When enabled, HiveServer2 will temporarily save logs associated with ongoing operations. This enables clients like beeline and Hue to request and display logs for a particular ongoing operation. Logs are removed upon completion of operation. hive.server2.logging.operation.enabled true hive_server2_logging_operation_enabled false
HiveServer2 Operations Log Directory Top level directory where operation logs are temporarily stored if Enable HiveServer2 Operations Logging is true. Logs are stored in session and operation level subdirectories under this location and are removed on completion of operation. hive.server2.logging.operation.log.location /var/log/hive/operation_logs hive_server2_logging_operation_log_location false
HiveServer2 Logging Threshold The minimum log level for HiveServer2 logs INFO log_threshold false
HiveServer2 Maximum Log File Backups The maximum number of rolled log files to keep for HiveServer2 logs. Typically used by log4j or logback. 10 max_log_backup_index false
HiveServer2 Max Log Size The maximum size, in megabytes, per log file for HiveServer2 logs. Typically used by log4j or logback. 200 MiB max_log_size false

Monitoring

Display Name Description Related Name Default Value API Name Required
Enable Health Alerts for this Role When set, Cloudera Manager will send alerts when the health of this role reaches the threshold specified by the EventServer setting eventserver_health_events_alert_threshold true enable_alerts false
Enable Configuration Change Alerts When set, Cloudera Manager will send alerts when this entity's configuration changes. false enable_config_alerts false
Heap Dump Directory Free Space Monitoring Absolute Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's heap dump directory. Warning: 10 GiB, Critical: 5 GiB heap_dump_directory_free_space_absolute_thresholds false
Heap Dump Directory Free Space Monitoring Percentage Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's heap dump directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Heap Dump Directory Free Space Monitoring Absolute Thresholds setting is configured. Warning: Never, Critical: Never heap_dump_directory_free_space_percentage_thresholds false
Hive Downloaded Resources Directory Free Space Monitoring Absolute Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's Hive Downloaded Resources Directory. Warning: 10 GiB, Critical: 5 GiB hive_llap_hs2_downloaded_resources_directory_free_space_absolute_thresholds false
Hive Downloaded Resources Directory Free Space Monitoring Percentage Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's Hive Downloaded Resources Directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Hive Downloaded Resources Directory Free Space Monitoring Absolute Thresholds setting is configured. Warning: Never, Critical: Never hive_llap_hs2_downloaded_resources_directory_free_space_percentage_thresholds false
Hive Local Scratch Directory Free Space Monitoring Absolute Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's Hive Local Scratch Directory. Warning: 10 GiB, Critical: 5 GiB hive_llap_hs2_exec_local_scratch_directory_free_space_absolute_thresholds false
Hive Local Scratch Directory Free Space Monitoring Percentage Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's Hive Local Scratch Directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Hive Local Scratch Directory Free Space Monitoring Absolute Thresholds setting is configured. Warning: Never, Critical: Never hive_llap_hs2_exec_local_scratch_directory_free_space_percentage_thresholds false
File Descriptor Monitoring Thresholds The health test thresholds of the number of file descriptors used. Specified as a percentage of file descriptor limit. Warning: 50.0 %, Critical: 70.0 % hiveserver2_fd_thresholds false
HiveServer2 Host Health Test When computing the overall HiveServer2 health, consider the host's health. true hiveserver2_host_health_enabled false
Pause Duration Thresholds The health test thresholds for the weighted average extra time the pause monitor spent paused. Specified as a percentage of elapsed wall clock time. Warning: 30.0, Critical: 60.0 hiveserver2_pause_duration_thresholds false
Pause Duration Monitoring Period The period to review when computing the moving average of extra time the pause monitor spent paused. 5 minute(s) hiveserver2_pause_duration_window false
HiveServer2 Process Health Test Enables the health test that the HiveServer2's process state is consistent with the role configuration true hiveserver2_scm_health_enabled false
Log Directory Free Space Monitoring Absolute Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's log directory. Warning: 10 GiB, Critical: 5 GiB log_directory_free_space_absolute_thresholds false
Log Directory Free Space Monitoring Percentage Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's log directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Log Directory Free Space Monitoring Absolute Thresholds setting is configured. Warning: Never, Critical: Never log_directory_free_space_percentage_thresholds false
Navigator Audit Failure Thresholds The health test thresholds for failures encountered when monitoring audits within a recent period specified by the mgmt_navigator_failure_window configuration for the role. The value that can be specified for this threshold is the number of bytes of audits data that is left to be sent to audit server. mgmt.navigator.failure.thresholds Warning: Never, Critical: Any mgmt_navigator_failure_thresholds false
Monitoring Period For Audit Failures The period to review when checking if audits are blocked and not getting processed. mgmt.navigator.failure.window 20 minute(s) mgmt_navigator_failure_window false
Navigator Audit Pipeline Health Check Enable test of audit events processing pipeline. This will test if audit events are not getting processed by Audit Server for a role that generates audit. mgmt.navigator.status.check.enabled true mgmt_navigator_status_check_enabled false
Metric Filter Defines a Metric Filter for this role. Cloudera Manager Agents will not send filtered metrics to the Service Monitor. Define the following fields:
  • Health Test Metric Set - Select this parameter to collect only metrics required for health tests.
  • Default Dashboard Metric Set - Select this parameter to collect only metrics required for the default dashboards. For user-defined charts, you must add the metrics you require for the chart using the Custom Metrics parameter.
  • Include/Exclude Custom Metrics - Select Include to specify metrics that should be collected. Select Exclude to specify metrics that should not be collected. Enter the metric names to be included or excluded using the Metric Name parameter.
  • Metric Name - The name of a metric that will be included or excluded during metric collection.
If you do not select Health Test Metric Set or Default Dashboard Metric Set, or specify metrics by name, metric filtering will be turned off (this is the default behavior).For example, the following configuration enables the collection of metrics required for Health Tests and the jvm_heap_used_mb metric:
  • Include only Health Test Metric Set: Selected.
  • Include/Exclude Custom Metrics: Set to Include.
  • Metric Name: jvm_heap_used_mb
You can also view the JSON representation for this parameter by clicking View as JSON. In this example, the JSON looks like this:{ "includeHealthTestMetricSet": true, "filterType": "whitelist", "metrics": ["jvm_heap_used_mb"] }
monitoring_metric_filter false
Process Swap Memory Thresholds The health test thresholds on the swap memory usage of the process. This takes precedence over the host level threshold. Warning: 200 B, Critical: Never process_swap_memory_thresholds false
Role Triggers The configured triggers for this role. This is a JSON-formatted list of triggers. These triggers are evaluated as part as the health system. Every trigger expression is parsed, and if the trigger condition is met, the list of actions provided in the trigger expression is executed. Each trigger has the following fields:
  • triggerName (mandatory) - The name of the trigger. This value must be unique for the specific role.
  • triggerExpression (mandatory) - A tsquery expression representing the trigger.
  • streamThreshold (optional) - The maximum number of streams that can satisfy a condition of a trigger before the condition fires. By default set to 0, and any stream returned causes the condition to fire.
  • enabled (optional) - By default set to 'true'. If set to 'false', the trigger is not evaluated.
  • expressionEditorConfig (optional) - Metadata for the trigger editor. If present, the trigger should only be edited from the Edit Trigger page; editing the trigger here can lead to inconsistencies.
For example, the following JSON formatted trigger configured for a DataNode fires if the DataNode has more than 1500 file descriptors opened:[{"triggerName": "sample-trigger", "triggerExpression": "IF (SELECT fd_open WHERE roleName=$ROLENAME and last(fd_open) > 1500) DO health:bad", "streamThreshold": 0, "enabled": "true"}]See the trigger rules documentation for more details on how to write triggers using tsquery.The JSON format is evolving and may change and, as a result, backward compatibility is not guaranteed between releases.
[] role_triggers true
Unexpected Exits Thresholds The health test thresholds for unexpected exits encountered within a recent period specified by the unexpected_exits_window configuration for the role. Warning: Never, Critical: Any unexpected_exits_thresholds false
Unexpected Exits Monitoring Period The period to review when computing unexpected exits. 5 minute(s) unexpected_exits_window false

Other

Display Name Description Related Name Default Value API Name Required
Restrict Cross Joins (Cartesian Products) Whether to allow queries with cross joins. If set to true, queries that contain this pattern throw a compile-time error. hive.strict.checks.cartesian.product false hive_restrict_cross_joins false
Restrict LOAD Queries Against Bucketed Tables Whether to allow LOAD queries against bucketed tables. If set to true, queries that contain this pattern throw a compile-time error. hive.strict.checks.bucketing true hive_restrict_load_bucketed_table false
Restrict Queries with ORDER BY but no LIMIT clause Whether to allow queries with an ORDER BY clause, but no LIMIT clause. If set to true, queries that contain this pattern throw a compile-time error. hive.strict.checks.orderby.no.limit false hive_restrict_orderby_with_no_limit false
Restrict Partitioned Table Scans with no Partitioned Column Filter Whether to allow queries that scan a partitioned table but don't filter on the partition column. If set to true, queries that contain this pattern throw a compile-time error. hive.strict.checks.no.partition.filter false hive_restrict_partitioned_scans_no_filter false
Restrict Unsafe Data Type Comparisons Whether to allow queries that compare bigints to strings or doubles. If set to true, queries that contain this pattern throw a compile-time error. hive.strict.checks.type.safety true hive_restrict_unsafe_comparison false
Support Dynamic Service Discovery Whether HiveServer2 supports dynamic service discovery for its clients. To support this, each instance of HiveServer2 currently uses ZooKeeper to register itself, when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: hive.zookeeper.quorum in their connection string. hive.server2.support.dynamic.service.discovery true hive_server2_support_dynamic_service_discovery false
Hive Server Zookeeper Namespace The parent node in ZooKeeper used by HiveServer2 when supporting dynamic service discovery. hive.server2.zookeeper.namespace hiveserver2-interactive hive_server2_zookeeper_namespace false
Default query queues A list of comma separated values corresponding to YARN queues of the same name. When HiveServer2 is launched in Tez mode, this configuration needs to be set for multiple Tez sessions to run in parallel on the cluster. hive.server2.tez.default.queues llap hivellap_default_query_queues false
hive.prewarm.enabled Enables container prewarm for Tez hive.prewarm.enabled false hivellap_prewarm_enabled false
Number of Containers Held Controls the number of containers to prewarm for Tez hive.prewarm.numcontainers 2 hivellap_prewarm_numcontainers false
Start Tez session at Initialization This flag is used in HiveServer2 to enable a user to use HiveServer2 without turning on Tez for HiveServer2. The user could potentially want to run queries over Tez without the pool of sessions. hive.server2.tez.initialize.default.sessions true hivellap_tez_initialize_default_sessions false
Allow custom queues Whether to allow the users of this HS2 to specify custom queues - yes, no (fail if specified), ignore (use the default queues even if a custom one is specified) hive.server2.tez.sessions.custom.queue.allowed ignore hivellap_tez_sessions_custom_queue_allowed false
HiveServer2 Enable Impersonation HiveServer2 will impersonate the beeline client user when talking to other services such as MapReduce and HDFS. hive.server2.enable.doAs true hiveserver2_enable_impersonation false
HiveServer2 Load Balancer Address of the load balancer used for HiveServer2 roles, specified in host:port format. If port is not specified, the port used by HiveServer2 is used. Note: Changing this property regenerates Kerberos keytabs for all HiveServer2 roles. hiveserver2_load_balancer false

Performance

Display Name Description Related Name Default Value API Name Required
Enable Dynamic Partitions Whether or not to allow dynamic partitions in DML/DDL. hive.exec.dynamic.partition true hive_exec_dynamic_partition false
Hive Auto Convert Join Noconditional Size If Hive auto convert join is on, and the sum of the size for n-1 of the tables/partitions for a n-way join is smaller than the specified size, the join is directly converted to a MapJoin (there is no conditional task). hive.auto.convert.join.noconditionaltask.size 50 MiB hiveserver2_auto_convert_join_noconditionaltask_size false
Store Intermediate Data on Blobstore When writing data to a table on a blobstore (such as S3), whether or not the blobstore should be used to store intermediate data during Hive query execution. Setting this to true can degrade performance for queries that spawn multiple MR / Spark jobs, but is useful for queries whose intermediate data cannot fit in the allocated HDFS cluster. hive.blobstore.use.blobstore.as.scratchdir false hiveserver2_blobstore_use_blobstore_as_scratchdir false
Enable Stats Optimization Enable optimization that checks if a query can be answered using statistics. If so, answers the query using only statistics stored in metastore. hive.compute.query.using.stats true hiveserver2_compute_query_using_stats false
Enable Cost-Based Optimizer for Hive Enabled the Calcite-based Cost-Based Optimizer for HiveServer2. hive.cbo.enable true hiveserver2_enable_cbo false
Enable MapJoin Optimization Enable optimization that converts common join into MapJoin based on input file size. hive.auto.convert.join true hiveserver2_enable_mapjoin false
Fetch Task Query Conversion Some select queries can be converted to a single FETCH task instead of a MapReduce task, minimizing latency. A value of none disables all conversion, minimal converts simple queries such as SELECT * and filter on partition columns, and more converts SELECT queries including FILTERS. hive.fetch.task.conversion more hiveserver2_fetch_task_conversion false
Fetch Task Query Conversion Threshold Above this size, queries are converted to fetch tasks. hive.fetch.task.conversion.threshold 1 GiB hiveserver2_fetch_task_conversion_threshold false
Input Listing Max Threads Maximum number of threads that Hive uses to list input files. Increasing this value can improve performance when there are a lot of partitions being read, or when running on blobstores. hive.exec.input.listing.max.threads 15 hiveserver2_input_listing_max_threads false
Maximum ReduceSink Top-K Memory Usage The maximum percentage of heap to be used for hash in ReduceSink operator for Top-K selection. 0 means the optimization is disabled. Accepted values are between 0 and 1. hive.limit.pushdown.memory.usage 0.04 hiveserver2_limit_pushdown_memory_usage false
Load Dynamic Partitions Thread Count Number of threads used to load dynamically generated partitions. Loading requires renaming the file its final location, and updating some metadata about the new partition. Increasing this can improve performance when there are a lot of partitions dynamically generated. hive.load.dynamic.partitions.thread 15 hiveserver2_load_dynamic_partitions_thread_count false
Enable Map-Side Aggregation Enable map-side partial aggregation, which cause the mapper to generate fewer rows. This reduces the data to be sorted and distributed to reducers. hive.map.aggr true hiveserver2_map_aggr false
Ratio of Memory Usage for Map-Side Aggregation Portion of total memory used in map-side partial aggregation. When exceeded, the partially aggregated results will be flushed from the map task to the reducers. hive.map.aggr.hash.percentmemory 0.5 hiveserver2_map_aggr_hash_memory_ratio false
Enable Merging Small Files - Map-Only Job Merge small files at the end of a map-only job. When enabled, a map-only job is created to merge the files in the destination table/partitions. hive.merge.mapfiles true hiveserver2_merge_mapfiles false
Enable Merging Small Files - Map-Reduce Job Merge small files at the end of a map-reduce job. When enabled, a map-only job is created to merge the files in the destination table/partitions. hive.merge.mapredfiles false hiveserver2_merge_mapredfiles false
Desired File Size After Merging The desired file size after merging. This should be larger than hive.merge.smallfiles.avgsize. hive.merge.size.per.task 256 MiB hiveserver2_merge_size_per_task false
Small File Average Size Merge Threshold When the average output file size of a job is less than the value of this property, Hive will start an additional map-only job to merge the output files into bigger files. This is only done for map-only jobs if hive.merge.mapfiles is true, for map-reduce jobs if hive.merge.mapredfiles is true, and for Spark jobs if hive.merge.sparkfiles is true. hive.merge.smallfiles.avgsize 16 MiB hiveserver2_merge_smallfiles_avgsize false
MSCK Repair Batch Size Batch size for the msck repair command (recover partitions command). If the value is greater than zero, new partition information will be sent from HiveServer2 to the Metastore in batches, which can potentially improve memory usage in the Metastore and avoid client read timeout exceptions. If this value is 0, all partition information will sent in a single Thrift call. hive.msck.repair.batch.size 0 hiveserver2_msck_repair_batch_size false
Move Files Thread Count The number of threads used by HiveServer2 to move data from the staging directory to another location (typically to the final table location). A separate thread pool of workers of this size is used for each query, which means this configuration can be set on a per-query basis too. hive.mv.files.thread 15 hiveserver2_mv_files_thread false
Hive Optimize Sorted Merge Bucket Join Whether to try sorted merge bucket (SMB) join. hive.optimize.bucketmapjoin.sortedmerge true hiveserver2_optimize_bucketmapjoin_sortedmerge false
Enable Automatic Use of Indexes Whether to use the indexing optimization for all queries. hive.optimize.index.filter true hiveserver2_optimize_index_filter false
Enable ReduceDeDuplication Optimization Remove extra map-reduce jobs if the data is already clustered by the same key, eliminating the need to repartition the dataset again. hive.optimize.reducededuplication true hiveserver2_optimize_reducededuplication false
Mininum Reducers for ReduceDeDuplication Optimization When the number of ReduceSink operators after merging is less than this number, the ReduceDeDuplication optimization will be disabled. hive.optimize.reducededuplication.min.reducer 4 hiveserver2_optimize_reducededuplication_min_reducer false
Enable Sorted Dynamic Partition Optimizer When dynamic partition is enabled, reducers keep only one record writer at all times, which lowers the memory pressure on reducers. hive.optimize.sort.dynamic.partition false hiveserver2_optimize_sort_dynamic_partition false
Enable Parallel Compilation of Queries When activated, individual sessions can compile queries simultaneously. Within each session, queries compile one at a time. hive.driver.parallel.compilation true hiveserver2_parallel_compilation_enabled false
Query Compilation Degree of Parallelism Determines the maximum number of queries that can compile in parallel on a HiveServer2 instance. Use negative values or zero to set unlimited parallelism. Use a positive value to set the number of queries that can compile simultaneously. This setting can be fine-tuned based on the current cluster load. Monitor cluster load using the 'waiting_compile_ops' metric and the 'Waiting Compile Operations' graph in the HiveServer2 graph library. hive.driver.parallel.compilation.global.limit 3 hiveserver2_parallel_compilation_global_limit false
Hive SMB Join Cache Rows The number of rows with the same key value to be cached in memory per SMB-joined table. hive.smbjoin.cache.rows 10000 hiveserver2_smbjoin_cache_rows false
Load Column Statistics Whether column stats for a table are fetched during explain. hive.stats.fetch.column.stats true hiveserver2_stats_fetch_column_stats false
Vectorized Adapter Usage Mode Vectorized Adaptor Usage Mode specifies the extent to which the vectorization engine tries to vectorize UDFs that do not have native vectorized versions available. Selecting the "none" option specifies that only queries using native vectorized UDFs are vectorized. Selecting the "chosen" option specifies that Hive choses to vectorize a subset of the UDFs based on performance benefits using the Vectorized Adaptor. Selecting the "all" option specifies that the Vectorized Adaptor be used for all UDFs even when native vectorized versions are not available. hive.vectorized.adaptor.usage.mode chosen hiveserver2_vectorized_adaptor_usage_mode false
Enable Vectorization Optimization Enable optimization that vectorizes query execution by streamlining operations by processing a block of 1024 rows at a time. hive.vectorized.execution.enabled true hiveserver2_vectorized_enabled false
Vectorized GroupBy Check Interval In vectorized group-by, the number of row entries added to the hash table before re-checking average variable size for memory usage estimation. hive.vectorized.groupby.checkinterval 4096 hiveserver2_vectorized_groupby_checkinterval false
Vectorized GroupBy Flush Ratio Ratio between 0.0 and 1.0 of entries in the vectorized group-by aggregation hash that is flushed when the memory threshold is exceeded. hive.vectorized.groupby.flush.percent 0.1 hiveserver2_vectorized_groupby_flush_ratio false
Enable Vectorized Input Format If enabled, Hive uses the native vectorized input format for vectorized query execution when it is available. hive.vectorized.use.vectorized.input.format true hiveserver2_vectorized_input_format_enabled false
Exclude Vectorized Input Formats Specifies a list of file input format classnames to exclude from vectorized query execution using the vectorized input format. Note that vectorized execution can still occur for an excluded input format based on whether row SerDes or vector SerDes are enabled. hive.vectorized.input.format.excludes org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat hiveserver2_vectorized_input_format_excludes false
Enable Reduce-Side Vectorization Whether to vectorize the reduce side of query execution. hive.vectorized.execution.reduce.enabled true hiveserver2_vectorized_reduce_enabled false
Enable Overflow-checked Vector Expressions To enhance performance, vectorized expressions operate using wide data types like long and double. When wide data types are used, numeric overflows can occur during expression evaluation in a different manner for vectorized expressions than they do for non-vectorized expressions. Consequently, different query results can be returned for vectorized expressions compared to results returned for non-vectorized expressions. When this configuration is enabled, Hive uses vectorized expressions that handle numeric overflows in the same way as non-vectorized expressions are handled. hive.vectorized.use.checked.expressions true hiveserver2_vectorized_use_checked_expressions false
Vectorize Using Vector SerDes If enabled, Hive uses built-in vector SerDes to process text and sequencefile tables for vectorized query execution. hive.vectorized.use.vector.serde.deserialize false hiveserver2_vectorized_use_vector_serde_deserialize false
Maximum Process File Descriptors If configured, overrides the process soft and hard rlimits (also called ulimits) for file descriptors to the configured value. rlimit_fds false

Ports and Addresses

Display Name Description Related Name Default Value API Name Required
Thrift port TCP port number to listen on. hive.server2.thrift.http.port 10501 hive_server2_thrift_http_port false
Bind HiveServer2 to Wildcard Address If enabled, the HiveServer2 binds to the wildcard address ("0.0.0.0") on all of its ports. hive.server2.webui.host true hiveserver2_webui_bind_wildcard false
HiveServer2 WebUI Port The port the HiveServer2 WebUI will listen on. This can be set to 0 to disable the WebUI. hive.server2.webui.port 10502 hiveserver2_webui_port false
HiveServer2 Port Port on which HiveServer2 will listen for connections. hive.server2.thrift.port 10500 hs2_thrift_address_port false

Resource Management

Display Name Description Related Name Default Value API Name Required
Java Heap Size of HiveServer2 in Bytes Maximum size in bytes for the Java Process heap memory. Passed to Java -Xmx. 4 GiB hiveserver2_java_heapsize false
Cgroup CPU Shares Number of CPU shares to assign to this role. The greater the number of shares, the larger the share of the host's CPUs that will be given to this role when the host experiences CPU contention. Must be between 2 and 262144. Defaults to 1024 for processes not managed by Cloudera Manager. cpu.shares 1024 rm_cpu_shares true
Custom Control Group Resources (overrides Cgroup settings) Custom control group resources to assign to this role, which will be enforced by the Linux kernel. These resources should exist on the target hosts, otherwise an error will occur when the process starts. Use the same format as used for arguments to the cgexec command: resource1,resource2:path1 or resource3:path2 For example: 'cpu,memory:my/path blkio:my2/path2' ***These settings override other cgroup settings.*** custom.cgroups rm_custom_resources false
Cgroup I/O Weight Weight for the read I/O requests issued by this role. The greater the weight, the higher the priority of the requests when the host experiences I/O contention. Must be between 100 and 1000. Defaults to 1000 for processes not managed by Cloudera Manager. blkio.weight 500 rm_io_weight true
Cgroup Memory Hard Limit Hard memory limit to assign to this role, enforced by the Linux kernel. When the limit is reached, the kernel will reclaim pages charged to the process. If reclaiming fails, the kernel may kill the process. Both anonymous as well as page cache pages contribute to the limit. Use a value of -1 to specify no limit. By default processes not managed by Cloudera Manager will have no limit. If the value is -1, Cloudera Manager will not monitor Cgroup memory usage therefore some of the charts will show 'No Data' memory.limit_in_bytes -1 MiB rm_memory_hard_limit true
Cgroup Memory Soft Limit Soft memory limit to assign to this role, enforced by the Linux kernel. When the limit is reached, the kernel will reclaim pages charged to the process if and only if the host is facing memory pressure. If reclaiming fails, the kernel may kill the process. Both anonymous as well as page cache pages contribute to the limit. Use a value of -1 to specify no limit. By default processes not managed by Cloudera Manager will have no limit. If the value is -1, Cloudera Manager will not monitor Cgroup memory usage therefore some of the charts will show 'No Data' memory.soft_limit_in_bytes -1 MiB rm_memory_soft_limit true

Security

Display Name Description Related Name Default Value API Name Required
Enable TLS/SSL for HiveServer2 WebUI Encrypt communication between clients and HiveServer2 WebUI using Transport Layer Security (TLS) (formerly known as Secure Socket Layer (SSL)). hive.server2.webui.use.ssl false ssl_enabled false
HiveServer2 WebUI TLS/SSL Server JKS Keystore File Location The path to the TLS/SSL keystore file containing the server certificate and private key used for TLS/SSL. Used when HiveServer2 WebUI is acting as a TLS/SSL server. The keystore must be in JKS format. hive.server2.webui.keystore.path ssl_server_keystore_location false
HiveServer2 WebUI TLS/SSL Server JKS Keystore File Password The password for the HiveServer2 WebUI JKS keystore file. hive.server2.webui.keystore.password ssl_server_keystore_password false

Stacks Collection

Display Name Description Related Name Default Value API Name Required
Stacks Collection Data Retention The amount of stacks data that is retained. After the retention limit is reached, the oldest data is deleted. stacks_collection_data_retention 100 MiB stacks_collection_data_retention false
Stacks Collection Directory The directory in which stacks logs are placed. If not set, stacks are logged into a stacks subdirectory of the role's log directory. stacks_collection_directory stacks_collection_directory false
Stacks Collection Enabled Whether or not periodic stacks collection is enabled. stacks_collection_enabled false stacks_collection_enabled true
Stacks Collection Frequency The frequency with which stacks are collected. stacks_collection_frequency 5.0 second(s) stacks_collection_frequency false
Stacks Collection Method The method used to collect stacks. The jstack option involves periodically running the jstack command against the role's daemon process. The servlet method is available for those roles that have an HTTP server endpoint exposing the current stacks traces of all threads. When the servlet method is selected, that HTTP endpoint is periodically scraped. stacks_collection_method jstack stacks_collection_method false

Suppressions

Display Name Description Related Name Default Value API Name Required
Suppress Configuration Validator: CDH Version Validator Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator. false role_config_suppression_cdh_version_validator true
Suppress Parameter Validation: HiveServer2 Environment Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Environment Advanced Configuration Snippet (Safety Valve) parameter. false role_config_suppression_hive_hs2_env_safety_valve true
Suppress Parameter Validation: HiveServer2 Log Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Log Directory parameter. false role_config_suppression_hive_log_dir true
Suppress Parameter Validation: Metrics Sample File Location Whether to suppress configuration warnings produced by the built-in parameter validation for the Metrics Sample File Location parameter. false role_config_suppression_hive_metrics_sample_file_location true
Suppress Configuration Validator: Restrict Load Bucketed Table Validator Whether to suppress configuration warnings produced by the Restrict Load Bucketed Table Validator configuration validator. false role_config_suppression_hive_restrict_load_bucketed_table_validator true
Suppress Configuration Validator: Restrict Unsafe Comparison Validator Whether to suppress configuration warnings produced by the Restrict Unsafe Comparison Validator configuration validator. false role_config_suppression_hive_restrict_unsafe_comparison_validator true
Suppress Parameter Validation: HiveServer2 Operations Log Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Operations Log Directory parameter. false role_config_suppression_hive_server2_logging_operation_log_location true
Suppress Parameter Validation: Thrift port Whether to suppress configuration warnings produced by the built-in parameter validation for the Thrift port parameter. false role_config_suppression_hive_server2_thrift_http_port true
Suppress Parameter Validation: Hive Server Zookeeper Namespace Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Server Zookeeper Namespace parameter. false role_config_suppression_hive_server2_zookeeper_namespace true
Suppress Parameter Validation: Default query queues Whether to suppress configuration warnings produced by the built-in parameter validation for the Default query queues parameter. false role_config_suppression_hivellap_default_query_queues true
Suppress Parameter Validation: HiveServer2 Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Advanced Configuration Snippet (Safety Valve) for hive-site.xml parameter. false role_config_suppression_hivellap_hs2_config_safety_valve true
Suppress Parameter Validation: Allow custom queues Whether to suppress configuration warnings produced by the built-in parameter validation for the Allow custom queues parameter. false role_config_suppression_hivellap_tez_sessions_custom_queue_allowed true
Suppress Parameter Validation: Hive Downloaded Resources Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Downloaded Resources Directory parameter. false role_config_suppression_hiveserver2_downloaded_resources_dir true
Suppress Parameter Validation: Hive Local Scratch Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Local Scratch Directory parameter. false role_config_suppression_hiveserver2_exec_local_scratchdir true
Suppress Parameter Validation: Hive HDFS Scratch Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive HDFS Scratch Directory parameter. false role_config_suppression_hiveserver2_exec_scratchdir true
Suppress Parameter Validation: Fair Scheduler XML Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the Fair Scheduler XML Advanced Configuration Snippet (Safety Valve) parameter. false role_config_suppression_hiveserver2_fair_scheduler_safety_valve true
Suppress Parameter Validation: Java Configuration Options for HiveServer2 Whether to suppress configuration warnings produced by the built-in parameter validation for the Java Configuration Options for HiveServer2 parameter. false role_config_suppression_hiveserver2_java_opts true
Suppress Parameter Validation: HiveServer2 Load Balancer Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Load Balancer parameter. false role_config_suppression_hiveserver2_load_balancer true
Suppress Parameter Validation: Exclude Vectorized Input Formats Whether to suppress configuration warnings produced by the built-in parameter validation for the Exclude Vectorized Input Formats parameter. false role_config_suppression_hiveserver2_vectorized_input_format_excludes true
Suppress Parameter Validation: HiveServer2 WebUI Port Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 WebUI Port parameter. false role_config_suppression_hiveserver2_webui_port true
Suppress Parameter Validation: HiveServer2 Advanced Configuration Snippet (Safety Valve) for core-site.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Advanced Configuration Snippet (Safety Valve) for core-site.xml parameter. false role_config_suppression_hs2_core_site_safety_valve true
Suppress Parameter Validation: HiveServer2 Port Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Port parameter. false role_config_suppression_hs2_thrift_address_port true
Suppress Parameter Validation: HiveServer2 Logging Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 Logging Advanced Configuration Snippet (Safety Valve) parameter. false role_config_suppression_log4j_safety_valve true
Suppress Parameter Validation: Heap Dump Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Heap Dump Directory parameter. false role_config_suppression_oom_heap_dump_dir true
Suppress Parameter Validation: Custom Control Group Resources (overrides Cgroup settings) Whether to suppress configuration warnings produced by the built-in parameter validation for the Custom Control Group Resources (overrides Cgroup settings) parameter. false role_config_suppression_rm_custom_resources true
Suppress Parameter Validation: Role Triggers Whether to suppress configuration warnings produced by the built-in parameter validation for the Role Triggers parameter. false role_config_suppression_role_triggers true
Suppress Parameter Validation: HiveServer2 WebUI TLS/SSL Server JKS Keystore File Location Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 WebUI TLS/SSL Server JKS Keystore File Location parameter. false role_config_suppression_ssl_server_keystore_location true
Suppress Parameter Validation: HiveServer2 WebUI TLS/SSL Server JKS Keystore File Password Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 WebUI TLS/SSL Server JKS Keystore File Password parameter. false role_config_suppression_ssl_server_keystore_password true
Suppress Parameter Validation: Stacks Collection Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Stacks Collection Directory parameter. false role_config_suppression_stacks_collection_directory true
Suppress Parameter Validation: tez.history.logging.taskattempt-filters Whether to suppress configuration warnings produced by the built-in parameter validation for the tez.history.logging.taskattempt-filters parameter. false role_config_suppression_tez_interactive_history_logging_taskattempt_filters true
Suppress Health Test: Audit Pipeline Test Whether to suppress the results of the Audit Pipeline Test heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_audit_health true
Suppress Health Test: File Descriptors Whether to suppress the results of the File Descriptors heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_file_descriptor true
Suppress Health Test: Heap Dump Directory Free Space Whether to suppress the results of the Heap Dump Directory Free Space heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_heap_dump_directory_free_space true
Suppress Health Test: Host Health Whether to suppress the results of the Host Health heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_host_health true
Suppress Health Test: Log Directory Free Space Whether to suppress the results of the Log Directory Free Space heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_log_directory_free_space true
Suppress Health Test: Pause Duration Whether to suppress the results of the Pause Duration heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_pause_duration true
Suppress Health Test: Process Status Whether to suppress the results of the Process Status heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_scm_health true
Suppress Health Test: Swap Memory Usage Whether to suppress the results of the Swap Memory Usage heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_swap_memory_usage true
Suppress Health Test: Unexpected Exits Whether to suppress the results of the Unexpected Exits heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hiveserver2_unexpected_exits true
Suppress Health Test: Hive Downloaded Resources Directory Free Space Whether to suppress the results of the Hive Downloaded Resources Directory Free Space heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hs2_downloaded_resources_directory_free_space true
Suppress Health Test: Hive Local Scratch Directory Free Space Whether to suppress the results of the Hive Local Scratch Directory Free Space heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_hs2_exec_local_scratch_directory_free_space true

Tez interactive

Display Name Description Related Name Default Value API Name Required
tez.am.am-rm.heartbeat.interval-ms.max The heartbeat interval between the tez AM and YARN RM tez.am.am-rm.heartbeat.interval-ms.max 10 second(s) tez_interactive_am_am_rm_heartbeat_interval_ms_max false
tez.am.client.heartbeat.poll.interval.millis The interval at which the AM checks for a client heartbeat timeout tez.am.client.heartbeat.poll.interval.millis 6 second(s) tez_interactive_am_client_heartbeat_poll_interval_millis false
tez.am.client.heartbeat.timeout.secs The time interval, after which an AM will kill itself, if it does not receive a heartbeat from the client. tez.am.client.heartbeat.timeout.secs 1 minute(s), 30 second(s) tez_interactive_am_client_heartbeat_timeout_secs false
tez.am.node-blacklisting.enabled Whether to enable blacklisting in Tez AMs. Disable for LLAP tez.am.node-blacklisting.enabled false tez_interactive_am_node_blacklisting_enabled false
tez.am.resource.memory.mb The amount of memory to be used by the AppMaster tez.am.resource.memory.mb 1 GiB tez_interactive_am_resource_memory_mb false
tez.am.task.listener.thread-count Number of IPC server thread for Tez task listener. Should be minimized for LLAP tez.am.task.listener.thread-count 1 tez_interactive_am_task_listener_thread_count false
tez.am.task.reschedule.higher.priority Whether rescheduled tasks should be treated at higher priority tez.am.task.reschedule.higher.priority false tez_interactive_am_task_reschedule_higher_priority false
tez.container.max.java.heap.fraction Setting this to -1 so that Tez can auto determine different Xmx for different container size tez.container.max.java.heap.fraction -1.0 tez_interactive_container_max_java_heap_fraction false
tez.dag.recovery.enabled tez.dag.recovery.enabled tez.dag.recovery.enabled false tez_interactive_dag_recovery_enabled false
tez.grouping.node.local.only tez.grouping.node.local.only tez.grouping.node.local.only true tez_interactive_grouping_node_local_only false
tez.history.logging.log.level Set the log level to TASK_ATTEMPT. tez.history.logging.log.level TASK_ATTEMPT tez_interactive_history_logging_log_level false
tez.history.logging.taskattempt-filters TASK_ATTEMPT events to be ignored. tez.history.logging.taskattempt-filters SERVICE_BUSY, EXTERNAL_PREEMPTION tez_interactive_history_logging_taskattempt_filters false
tez.history.logging.timeline.num-dags-per-group Maximum number of dags per group. tez.history.logging.timeline.num-dags-per-group 5 tez_interactive_history_logging_timeline_num_dags_per_group false
tez.runtime.enable.final-merge.in.output Whether to enable a map side merge of outputs tez.runtime.enable.final-merge.in.output false tez_interactive_runtime_enable_final_merge_in_output false
tez.runtime.io.sort.mb The size of the sort buffer when output needs to be sorted tez.runtime.io.sort.mb 512 MiB tez_interactive_runtime_io_sort_mb false
tez.runtime.pipelined-shuffle.enabled tez.runtime.pipelined-shuffle.enabled tez.runtime.pipelined-shuffle.enabled false tez_interactive_runtime_pipelined_shuffle_enabled false
tez.runtime.pipelined.sorter.lazy-allocate.memory tez.runtime.pipelined.sorter.lazy-allocate.memory tez.runtime.pipelined.sorter.lazy-allocate.memory true tez_interactive_runtime_pipelined_sorter_lazy_allocate_memory false
tez.runtime.report.partition.stats tez.runtime.report.partition.stats tez.runtime.report.partition.stats true tez_interactive_runtime_report_partition_stats false
tez.runtime.shuffle.connect.timeout Shuffle connect timeouts (ms) tez.runtime.shuffle.connect.timeout 30 second(s) tez_interactive_runtime_shuffle_connect_timeout false
tez.runtime.shuffle.fetch.buffer.percent Fraction (0-1) of the available memory which can be used to retain shuffled data tez.runtime.shuffle.fetch.buffer.percent 0.6 tez_interactive_runtime_shuffle_fetch_buffer_percent false
tez.runtime.shuffle.fetch.verify-disk-checksum tez.runtime.shuffle.fetch.verify-disk-checksum tez.runtime.shuffle.fetch.verify-disk-checksum false tez_interactive_runtime_shuffle_fetch_verify_disk_checksum false
tez.runtime.shuffle.keep-alive.enabled Connection keep-alive for shuffle tez.runtime.shuffle.keep-alive.enabled true tez_interactive_runtime_shuffle_keep_alive_enabled false
tez.runtime.shuffle.memory.limit.percent This property determines the maximum size of a shuffle segment which can be fetched to memory. Fraction (0-1) of shuffle memory (after applying tez.runtime.shuffle.fetch.buffer.percent) tez.runtime.shuffle.memory.limit.percent 0.25 tez_interactive_runtime_shuffle_memory_limit_percent false
tez.runtime.shuffle.parallel.copies tez.runtime.shuffle.parallel.copies tez.runtime.shuffle.parallel.copies 8 tez_interactive_runtime_shuffle_parallel_copies false
tez.runtime.shuffle.read.timeout Shuffle read timeout (ms) tez.runtime.shuffle.read.timeout 30 second(s) tez_interactive_runtime_shuffle_read_timeout false
tez.runtime.shuffle.ssl.enable tez.runtime.shuffle.ssl.enable tez.runtime.shuffle.ssl.enable false tez_interactive_runtime_shuffle_ssl_enable false
tez.runtime.unordered.output.buffer.size-mb The size of the buffer when output does not require to be sorted tez.runtime.unordered.output.buffer.size-mb 100 MiB tez_interactive_runtime_unordered_output_buffer_size_mb false
tez.runtime.unordered.output.max-per-buffer.size-bytes tez.runtime.unordered.output.max-per-buffer.size-bytes tez.runtime.unordered.output.max-per-buffer.size-bytes 128 MiB tez_interactive_runtime_unordered_output_max_per_buffer_size_bytes false
tez.session.am.dag.submit.timeout.secs The amount of time an AM will wait, before killing itself, if not DAG is submitted. tez.session.am.dag.submit.timeout.secs 14 day(s) tez_interactive_session_am_dag_submit_timeout_secs false
tez.task.heartbeat.timeout.check-ms The time interval, in milliseconds, at which the AM will check for timed out tasks tez.task.heartbeat.timeout.check-ms 15 second(s) tez_interactive_task_heartbeat_timeout_check_ms false
tez.task.timeout-ms mount of time the Tez AM waits before marking a task which has not sent in a heartbeat, as timed out tez.task.timeout-ms 1 minute(s), 30 second(s) tez_interactive_task_timeout_ms false

LLAP Proxy

Categories:

Advanced

Display Name Description Related Name Default Value API Name Required
Metrics Sample File Location The full path to a file with a sample of metrics exposed by the role. The sample is updated at the frequency configured by Metrics Sample File Logging Frequency. By default, the sample file is logged to a directory under the role log directory, e.g., /var/log/hive/metrics-hivemetastore/metrics.log. The setting only has an effect if "Enable Metrics Subsystem" is set to true. hive.service.metrics.file.location hive_metrics_sample_file_location false
LLAP Daemon Java Options Extra Java options that will be applied on the JVM processes of LLAP Daemons. hivellap_daemon_opts false
LLAP Proxy Advanced Configuration Snippet (Safety Valve) for hive-site.xml For advanced use only. A string to be inserted into hive-site.xml for this role only. hivellap_llapproxy_config_safety_valve false
LLAP Proxy Environment Advanced Configuration Snippet (Safety Valve) For advanced use only, key-value pairs (one on each line) to be inserted into a role's environment. Applies to configurations of this role except client configuration. LLAPPROXY_role_env_safety_valve false
Heap Dump Directory Path to directory where heap dumps are generated when java.lang.OutOfMemoryError error is thrown. This directory is automatically created if it does not exist. If this directory already exists, role user must have write access to this directory. If this directory is shared among multiple roles, it should have 1777 permissions. The heap dump files are created with 600 permissions and are owned by the role user. The amount of free space in this directory should be greater than the maximum Java Process heap size configured for this role. oom_heap_dump_dir /tmp oom_heap_dump_dir false
Dump Heap When Out of Memory When set, generates heap dump file when java.lang.OutOfMemoryError is thrown. true oom_heap_dump_enabled true
Kill When Out of Memory When set, a SIGKILL signal is sent to the role process when java.lang.OutOfMemoryError is thrown. true oom_sigkill_enabled true
Automatically Restart Process When set, this role's process is automatically (and transparently) restarted in the event of an unexpected failure. This configuration applies in the time after the Start Wait Timeout period. false process_auto_restart true
Enable Metric Collection Cloudera Manager agent monitors each service and each of its role by publishing metrics to the Cloudera Manager Service Monitor. Setting it to false will stop Cloudera Manager agent from publishing any metric for corresponding service/roles. This is usually helpful for services that generate large amount of metrics which Service Monitor is not able to process. true process_should_monitor true
Process Start Retry Attempts Number of times to try starting a role's process when the process exits before the Start Wait Timeout period. After a process is running beyond the Start Wait Timeout, the retry count is reset. Setting this configuration to zero will prevent restart of the process during the Start Wait Timeout period. 3 process_start_retries false
Process Start Wait Timeout The time in seconds to wait for a role's process to start successfully on a host. Processes which exit/crash before this time will be restarted until reaching the limit specified by the Start Retry Attempts count parameter. Setting this configuration to zero will turn off this feature. 20 process_start_secs false

Logs

Display Name Description Related Name Default Value API Name Required
LLAP Proxy Log Directory Directory where LLAP Proxy will place its log files. /var/log/hive hive_log_dir false

Monitoring

Display Name Description Related Name Default Value API Name Required
Enable Health Alerts for this Role When set, Cloudera Manager will send alerts when the health of this role reaches the threshold specified by the EventServer setting eventserver_health_events_alert_threshold true enable_alerts false
Enable Configuration Change Alerts When set, Cloudera Manager will send alerts when this entity's configuration changes. false enable_config_alerts false
Heap Dump Directory Free Space Monitoring Absolute Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's heap dump directory. Warning: 10 GiB, Critical: 5 GiB heap_dump_directory_free_space_absolute_thresholds false
Heap Dump Directory Free Space Monitoring Percentage Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's heap dump directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Heap Dump Directory Free Space Monitoring Absolute Thresholds setting is configured. Warning: Never, Critical: Never heap_dump_directory_free_space_percentage_thresholds false
Hive LLAP Daemons Ready Status Startup Tolerance Bad health state will be shown if, after this amount of time (after starting LLAP proxy role), 0 LLAP daemons are running. 5 minute(s) llapd_ready_status_check_daemon_startup_tolerance false
Hive LLAP Proxy Role Ready Status Startup Tolerance Startup Tolerance time of the LLAP proxy role in which no health test is run. This is recommended to be kept minimal. 1 second(s) llapd_ready_status_check_role_startup_tolerance false
Hive LLAP Daemons Ready Status Thresholds The health test thresholds for monitoring the number of active LLAP daemons. Warning: 75.0, Critical: 25.0 llapd_ready_status_thresholds false
File Descriptor Monitoring Thresholds The health test thresholds of the number of file descriptors used. Specified as a percentage of file descriptor limit. Warning: 50.0 %, Critical: 70.0 % llapproxy_fd_thresholds false
LLAP Proxy Host Health Test When computing the overall LLAP Proxy health, consider the host's health. true llapproxy_host_health_enabled false
Pause Duration Thresholds The health test thresholds for the weighted average extra time the pause monitor spent paused. Specified as a percentage of elapsed wall clock time. Warning: 30.0, Critical: 60.0 llapproxy_pause_duration_thresholds false
Pause Duration Monitoring Period The period to review when computing the moving average of extra time the pause monitor spent paused. 5 minute(s) llapproxy_pause_duration_window false
LLAP Proxy Process Health Test Enables the health test that the LLAP Proxy's process state is consistent with the role configuration true llapproxy_scm_health_enabled false
Log Directory Free Space Monitoring Absolute Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's log directory. Warning: 10 GiB, Critical: 5 GiB log_directory_free_space_absolute_thresholds false
Log Directory Free Space Monitoring Percentage Thresholds The health test thresholds for monitoring of free space on the filesystem that contains this role's log directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Log Directory Free Space Monitoring Absolute Thresholds setting is configured. Warning: Never, Critical: Never log_directory_free_space_percentage_thresholds false
Navigator Audit Failure Thresholds The health test thresholds for failures encountered when monitoring audits within a recent period specified by the mgmt_navigator_failure_window configuration for the role. The value that can be specified for this threshold is the number of bytes of audits data that is left to be sent to audit server. mgmt.navigator.failure.thresholds Warning: Never, Critical: Any mgmt_navigator_failure_thresholds false
Monitoring Period For Audit Failures The period to review when checking if audits are blocked and not getting processed. mgmt.navigator.failure.window 20 minute(s) mgmt_navigator_failure_window false
Navigator Audit Pipeline Health Check Enable test of audit events processing pipeline. This will test if audit events are not getting processed by Audit Server for a role that generates audit. mgmt.navigator.status.check.enabled true mgmt_navigator_status_check_enabled false
Metric Filter Defines a Metric Filter for this role. Cloudera Manager Agents will not send filtered metrics to the Service Monitor. Define the following fields:
  • Health Test Metric Set - Select this parameter to collect only metrics required for health tests.
  • Default Dashboard Metric Set - Select this parameter to collect only metrics required for the default dashboards. For user-defined charts, you must add the metrics you require for the chart using the Custom Metrics parameter.
  • Include/Exclude Custom Metrics - Select Include to specify metrics that should be collected. Select Exclude to specify metrics that should not be collected. Enter the metric names to be included or excluded using the Metric Name parameter.
  • Metric Name - The name of a metric that will be included or excluded during metric collection.
If you do not select Health Test Metric Set or Default Dashboard Metric Set, or specify metrics by name, metric filtering will be turned off (this is the default behavior).For example, the following configuration enables the collection of metrics required for Health Tests and the jvm_heap_used_mb metric:
  • Include only Health Test Metric Set: Selected.
  • Include/Exclude Custom Metrics: Set to Include.
  • Metric Name: jvm_heap_used_mb
You can also view the JSON representation for this parameter by clicking View as JSON. In this example, the JSON looks like this:{ "includeHealthTestMetricSet": true, "filterType": "whitelist", "metrics": ["jvm_heap_used_mb"] }
monitoring_metric_filter false
Process Swap Memory Thresholds The health test thresholds on the swap memory usage of the process. This takes precedence over the host level threshold. Warning: 200 B, Critical: Never process_swap_memory_thresholds false
Role Triggers The configured triggers for this role. This is a JSON-formatted list of triggers. These triggers are evaluated as part as the health system. Every trigger expression is parsed, and if the trigger condition is met, the list of actions provided in the trigger expression is executed. Each trigger has the following fields:
  • triggerName (mandatory) - The name of the trigger. This value must be unique for the specific role.
  • triggerExpression (mandatory) - A tsquery expression representing the trigger.
  • streamThreshold (optional) - The maximum number of streams that can satisfy a condition of a trigger before the condition fires. By default set to 0, and any stream returned causes the condition to fire.
  • enabled (optional) - By default set to 'true'. If set to 'false', the trigger is not evaluated.
  • expressionEditorConfig (optional) - Metadata for the trigger editor. If present, the trigger should only be edited from the Edit Trigger page; editing the trigger here can lead to inconsistencies.
For example, the following JSON formatted trigger configured for a DataNode fires if the DataNode has more than 1500 file descriptors opened:[{"triggerName": "sample-trigger", "triggerExpression": "IF (SELECT fd_open WHERE roleName=$ROLENAME and last(fd_open) > 1500) DO health:bad", "streamThreshold": 0, "enabled": "true"}]See the trigger rules documentation for more details on how to write triggers using tsquery.The JSON format is evolving and may change and, as a result, backward compatibility is not guaranteed between releases.
[] role_triggers true
Unexpected Exits Thresholds The health test thresholds for unexpected exits encountered within a recent period specified by the unexpected_exits_window configuration for the role. Warning: Never, Critical: Any unexpected_exits_thresholds false
Unexpected Exits Monitoring Period The period to review when computing unexpected exits. 5 minute(s) unexpected_exits_window false

Other

Display Name Description Related Name Default Value API Name Required
hive.llap.auto.allow.uber Whether or not to allow the planner to run vertices in the AM hive.llap.auto.allow.uber false hivellap_auto_allow_uber false
In-Memory Cache per Daemon The amount of memory reserved for Hive's optimized in-memory cache. hive.llap.io.memory.size 512 MiB hivellap_cache_per_llap_daemon false
hive.llap.client.consistent.splits Whether to setup split locations to match nodes on which llap daemons are running, instead of using the locations provided by the split itself. hive.llap.client.consistent.splits true hivellap_client_consistent_splits false
hive.llap.daemon.am.liveness.heartbeat.interval.ms Tez AM-LLAP heartbeat interval. This should be below the task timeout. hive.llap.daemon.am.liveness.heartbeat.interval.ms 10 second(s) hivellap_daemon_am_heartbeat_interval false
Number of nodes used by Hive's LLAP Number of nodes used by Hive's LLAP, which includes LLAP nodes running. Yarn native service container and Tez App Master(s) are not part of this. num_llap_nodes 1 hivellap_daemon_count false
hive.llap.daemon.logger Logger to be used by LLAP. (query-routing, RFA) hive.llap.daemon.logger llap hivellap_daemon_logger false
hive.llap.daemon.rpc.port The LLAP daemon RPC port. hive.llap.daemon.rpc.port 0 hivellap_daemon_rpc_port false
hive.llap.daemon.task.scheduler.enable.preemption hive.llap.daemon.task.scheduler.enable.preemption hive.llap.daemon.task.scheduler.enable.preemption true hivellap_daemon_task_scheduler_enable_preemption false
hive.llap.daemon.vcpus.per.instance The total number of vcpus to use for the executors inside LLAP. hive.llap.daemon.vcpus.per.instance 0 hivellap_daemon_vcpus_per_instance false
hive.llap.daemon.yarn.shuffle.port YARN shuffle port for LLAP-daemon-hosted shuffle. hive.llap.daemon.yarn.shuffle.port 15551 hivellap_daemon_yarn_shuffle_port false
dfs.client.mmap.enabled Disable HDFS caching fo LLAP dfs.client.mmap.enabled false hivellap_dfs_client_mmap_enabled false
dfs.short.circuit.shared.memory.watcher.interrupt.check.ms Disable HDFS caching fo LLAP dfs.short.circuit.shared.memory.watcher.interrupt.check.ms 0 second(s) hivellap_dfs_shared_mem_watcher_interrupt false
hive.llap.enable.grace.join.in.llap Override if grace join should be allowed to run in llap for regular map joins. Dynamic partitioned joins will honor the hive.mapjoin.hybridgrace.hashtable property in LLAP hive.llap.enable.grace.join.in.llap false hivellap_enable_grace_join_in_llap false
hive.execution.mode Chooses whether query fragments will run in container or in llap hive.execution.mode llap hivellap_execution_mode false
Number of executors per LLAP Daemon The number of fragments that a single LLAP daemon will run concurrently. Usually, this will be the same as the number of available CPUs hive.llap.daemon.num.executors 2 hivellap_executors_per_llap_daemon false
LLAP Daemon Heap Size in MB. LLAP Daemon Heap Size in MB. llap_heap_size 1280 MiB hivellap_heap_per_llap_daemon false
hive.llap.io.enabled Whether the LLAP IO layer is enabled. hive.llap.io.enabled true hivellap_io_enabled false
hive.llap.io.memory.mode LLAP IO memory usage; 'cache' (the default) uses data and metadata cache with a custom off-heap allocator, 'allocator' uses the custom allocator without the caches, 'none' doesn't use either (this mode may result in significant performance degradation) hive.llap.io.memory.mode cache hivellap_io_memory_mode false
hive.llap.io.threadpool.size Specify the number of threads to use for low-level IO thread pool. hive.llap.io.threadpool.size 2 hivellap_io_threadpool_size false
hive.llap.io.use.lrfu Whether ORC low-level cache should use LRFU cache policy instead of default (FIFO). hive.llap.io.use.lrfu true hivellap_io_use_lrfu false
hive.llap.execution.mode Chooses which fragments of a query will run in llap hive.llap.execution.mode only hivellap_llap_execution_mode false
hive.llap.management.rpc.port RPC port for LLAP daemon management service. hive.llap.management.rpc.port 15004 hivellap_management_rpc_port false
hive.llap.mapjoin.memory.oversubscribe.factor hive.llap.mapjoin.memory.oversubscribe.factor hive.llap.mapjoin.memory.oversubscribe.factor 0.3 hivellap_mapjoin_memory_oversubscribe_factor false
Maximum Total Concurrent Queries The maximum number of queries the Hive Interactive cluster will be able to handle concurrently. hive.server2.tez.sessions.per.default.queue 1 hivellap_max_total_concurrent_queries false
Memory per Daemon Total memory used by individual LLAP daemons (YARN Container size). This includes memory for the cache as well as for the query execution. Should be larger than the sum of the Daemon cache size and the daemon heap size, and should leave some headroom after this (In most cases: cache size + heap size + headroom = Memory Per Daemon). hive.llap.daemon.yarn.container.mb 2 GiB hivellap_memory_per_llap_daemon false
hive.llap.object.cache.enabled Cache objects (plans, hashtables, etc) in llap. hive.llap.object.cache.enabled true hivellap_object_cache_enabled false
Interactive Query Queue Choose the YARN queue in this cluster that is dedicated to interactive query. hive.llap.daemon.queue.name llap hivellap_queue_name false
llap.shuffle.connection-keep-alive.enable llap.shuffle.connection-keep-alive.enable llap.shuffle.connection-keep-alive.enable true hivellap_shuffle_connection_keep_alive_enable false
llap.shuffle.connection-keep-alive.timeout llap.shuffle.connection-keep-alive.timeout llap.shuffle.connection-keep-alive.timeout 1 minute(s) hivellap_shuffle_connection_keep_alive_timeout false
Turn SSD Cache On? Turn SSD Cache On? hive.llap.io.allocator.mmap false hivellap_ssd_cache_on false
hive.llap.task.scheduler.locality.delay Amount of time to wait before allocating a request which contains location information, to a location other than the ones requested. Set to -1 for an infinite delay, 0 for no delay. hive.llap.task.scheduler.locality.delay 1 hivellap_task_scheduler_locality_delay false

Performance

Display Name Description Related Name Default Value API Name Required
Maximum Process File Descriptors If configured, overrides the process soft and hard rlimits (also called ulimits) for file descriptors to the configured value. rlimit_fds false

Resource Management

Display Name Description Related Name Default Value API Name Required
Cgroup CPU Shares Number of CPU shares to assign to this role. The greater the number of shares, the larger the share of the host's CPUs that will be given to this role when the host experiences CPU contention. Must be between 2 and 262144. Defaults to 1024 for processes not managed by Cloudera Manager. cpu.shares 1024 rm_cpu_shares true
Custom Control Group Resources (overrides Cgroup settings) Custom control group resources to assign to this role, which will be enforced by the Linux kernel. These resources should exist on the target hosts, otherwise an error will occur when the process starts. Use the same format as used for arguments to the cgexec command: resource1,resource2:path1 or resource3:path2 For example: 'cpu,memory:my/path blkio:my2/path2' ***These settings override other cgroup settings.*** custom.cgroups rm_custom_resources false
Cgroup I/O Weight Weight for the read I/O requests issued by this role. The greater the weight, the higher the priority of the requests when the host experiences I/O contention. Must be between 100 and 1000. Defaults to 1000 for processes not managed by Cloudera Manager. blkio.weight 500 rm_io_weight true
Cgroup Memory Hard Limit Hard memory limit to assign to this role, enforced by the Linux kernel. When the limit is reached, the kernel will reclaim pages charged to the process. If reclaiming fails, the kernel may kill the process. Both anonymous as well as page cache pages contribute to the limit. Use a value of -1 to specify no limit. By default processes not managed by Cloudera Manager will have no limit. If the value is -1, Cloudera Manager will not monitor Cgroup memory usage therefore some of the charts will show 'No Data' memory.limit_in_bytes -1 MiB rm_memory_hard_limit true
Cgroup Memory Soft Limit Soft memory limit to assign to this role, enforced by the Linux kernel. When the limit is reached, the kernel will reclaim pages charged to the process if and only if the host is facing memory pressure. If reclaiming fails, the kernel may kill the process. Both anonymous as well as page cache pages contribute to the limit. Use a value of -1 to specify no limit. By default processes not managed by Cloudera Manager will have no limit. If the value is -1, Cloudera Manager will not monitor Cgroup memory usage therefore some of the charts will show 'No Data' memory.soft_limit_in_bytes -1 MiB rm_memory_soft_limit true

Stacks Collection

Display Name Description Related Name Default Value API Name Required
Stacks Collection Data Retention The amount of stacks data that is retained. After the retention limit is reached, the oldest data is deleted. stacks_collection_data_retention 100 MiB stacks_collection_data_retention false
Stacks Collection Directory The directory in which stacks logs are placed. If not set, stacks are logged into a stacks subdirectory of the role's log directory. stacks_collection_directory stacks_collection_directory false
Stacks Collection Enabled Whether or not periodic stacks collection is enabled. stacks_collection_enabled false stacks_collection_enabled true
Stacks Collection Frequency The frequency with which stacks are collected. stacks_collection_frequency 5.0 second(s) stacks_collection_frequency false
Stacks Collection Method The method used to collect stacks. The jstack option involves periodically running the jstack command against the role's daemon process. The servlet method is available for those roles that have an HTTP server endpoint exposing the current stacks traces of all threads. When the servlet method is selected, that HTTP endpoint is periodically scraped. stacks_collection_method jstack stacks_collection_method false

Suppressions

Display Name Description Related Name Default Value API Name Required
Suppress Configuration Validator: CDH Version Validator Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator. false role_config_suppression_cdh_version_validator true
Suppress Parameter Validation: LLAP Proxy Log Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the LLAP Proxy Log Directory parameter. false role_config_suppression_hive_log_dir true
Suppress Parameter Validation: Metrics Sample File Location Whether to suppress configuration warnings produced by the built-in parameter validation for the Metrics Sample File Location parameter. false role_config_suppression_hive_metrics_sample_file_location true
Suppress Parameter Validation: hive.llap.daemon.logger Whether to suppress configuration warnings produced by the built-in parameter validation for the hive.llap.daemon.logger parameter. false role_config_suppression_hivellap_daemon_logger true
Suppress Parameter Validation: LLAP Daemon Java Options Whether to suppress configuration warnings produced by the built-in parameter validation for the LLAP Daemon Java Options parameter. false role_config_suppression_hivellap_daemon_opts true
Suppress Parameter Validation: hive.execution.mode Whether to suppress configuration warnings produced by the built-in parameter validation for the hive.execution.mode parameter. false role_config_suppression_hivellap_execution_mode true
Suppress Parameter Validation: hive.llap.io.memory.mode Whether to suppress configuration warnings produced by the built-in parameter validation for the hive.llap.io.memory.mode parameter. false role_config_suppression_hivellap_io_memory_mode true
Suppress Parameter Validation: hive.llap.execution.mode Whether to suppress configuration warnings produced by the built-in parameter validation for the hive.llap.execution.mode parameter. false role_config_suppression_hivellap_llap_execution_mode true
Suppress Parameter Validation: LLAP Proxy Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the LLAP Proxy Advanced Configuration Snippet (Safety Valve) for hive-site.xml parameter. false role_config_suppression_hivellap_llapproxy_config_safety_valve true
Suppress Parameter Validation: Interactive Query Queue Whether to suppress configuration warnings produced by the built-in parameter validation for the Interactive Query Queue parameter. false role_config_suppression_hivellap_queue_name true
Suppress Parameter Validation: LLAP Proxy Environment Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the LLAP Proxy Environment Advanced Configuration Snippet (Safety Valve) parameter. false role_config_suppression_llapproxy_role_env_safety_valve true
Suppress Parameter Validation: Heap Dump Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Heap Dump Directory parameter. false role_config_suppression_oom_heap_dump_dir true
Suppress Parameter Validation: Custom Control Group Resources (overrides Cgroup settings) Whether to suppress configuration warnings produced by the built-in parameter validation for the Custom Control Group Resources (overrides Cgroup settings) parameter. false role_config_suppression_rm_custom_resources true
Suppress Parameter Validation: Role Triggers Whether to suppress configuration warnings produced by the built-in parameter validation for the Role Triggers parameter. false role_config_suppression_role_triggers true
Suppress Parameter Validation: Stacks Collection Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Stacks Collection Directory parameter. false role_config_suppression_stacks_collection_directory true
Suppress Health Test: Audit Pipeline Test Whether to suppress the results of the Audit Pipeline Test heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_audit_health true
Suppress Health Test: File Descriptors Whether to suppress the results of the File Descriptors heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_file_descriptor true
Suppress Health Test: Heap Dump Directory Free Space Whether to suppress the results of the Heap Dump Directory Free Space heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_heap_dump_directory_free_space true
Suppress Health Test: Host Health Whether to suppress the results of the Host Health heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_host_health true
Suppress Health Test: Log Directory Free Space Whether to suppress the results of the Log Directory Free Space heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_log_directory_free_space true
Suppress Health Test: Pause Duration Whether to suppress the results of the Pause Duration heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_pause_duration true
Suppress Health Test: Process Status Whether to suppress the results of the Process Status heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_scm_health true
Suppress Health Test: Swap Memory Usage Whether to suppress the results of the Swap Memory Usage heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_swap_memory_usage true
Suppress Health Test: Unexpected Exits Whether to suppress the results of the Unexpected Exits heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_hive_llap_llapproxy_unexpected_exits true
Suppress Health Test: LLAP Daemons Ready Check Whether to suppress the results of the LLAP Daemons Ready Check heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false role_health_suppression_llapd_ready_status true

Service-Wide

Categories:

Advanced

Display Name Description Related Name Default Value API Name Required
Hive Auxiliary JARs Directory Directory containing auxiliary JARs used by Hive. This should be a directory location and not a classpath containing one or more JARs. This directory must be created and managed manually on hosts that run the Hive Metastore Server, HiveServer2, or the Hive CLI. The directory location is set in the environment as HIVE_AUX_JARS_PATH and will generally override the hive.aux.jars.path property set in XML files, even if hive.aux.jars.path is set in an advanced configuration snippet. hive_aux_jars_path_dir false
Bypass Hive Metastore Server Instead of talking to Hive Metastore Server for Metastore information, Hive clients will talk directly to the Metastore database. false hive_bypass_metastore_server false
Aborted Transaction Threshold Number of aborted transactions involving a particular table or partition before major compaction is initiated. hive.compactor.abortedtxn.threshold 1000 hive_compactor_abortedtxn_threshold true
Number of Threads Used by Compactor Number of compactor worker threads to run on this metastore instance. Can be different values on different Metastore instances. hive.compactor.worker.threads 5 hive_compactor_worker_threads true
Hive Service Advanced Configuration Snippet (Safety Valve) for core-site.xml For advanced use only, a string to be inserted into core-site.xml. Applies to configurations of all roles in this service except client configuration. hive_core_site_safety_valve false
Create Tables as Full ACID Whether the eligible tables should be created as full ACID by default. Does not apply to external tables, the ones using storage handlers, etc. hive.create.as.acid true hive_create_as_acid false
Create Tables as ACID Insert Only Whether the eligible tables should be created as ACID insert-only by default. Does not apply to external tables, the ones using storage handlers, etc. hive.create.as.insert.only true hive_create_as_insert_only false
Hive Copy Large File Size Smaller than this size, Hive uses a single-threaded copy; larger than this size, Hive uses DistCp. hive.exec.copyfile.maxsize 32 MiB hive_exec_copyfile_maxsize false
Base Directory for Hive Proto Hook The directory where hive proto hooks should write the events, should generally be location of query_data table under sys.db database. hive.hook.proto.base-directory /warehouse/tablespace/managed/hive/sys.db/query_data/ hive_hook_proto_base_directory false
Hive LLAP Service Environment Advanced Configuration Snippet (Safety Valve) For advanced use only, key-value pairs (one on each line) to be inserted into a role's environment. Applies to configurations of all roles in this service except client configuration. HIVE_LLAP_service_env_safety_valve false
Metastore Transactional Listener List A comma separated list of Java classes that implement the org.apache.hadoop.hive.metastore.MetaStoreEventListener interface. Both the metastore event and corresponding listener method will be invoked in the same JDO transaction. hive.metastore.transactional.event.listeners org.apache.hive.hcatalog.listener.DbNotificationListener hive_metastore_transactional_event_listeners false
Hive Replication Environment Advanced Configuration Snippet (Safety Valve) For advanced use only, key-value pairs (one on each line) to be inserted into the environment of Hive replication jobs. hive_replication_env_safety_valve false
Hive Service Advanced Configuration Snippet (Safety Valve) for hive-site.xml For advanced use only, a string to be inserted into hive-site.xml. Applies to configurations of all roles in this service except client configuration. hive_service_config_safety_valve false
Hive Replication Advanced Configuration Snippet (Safety Valve) for hive-site.xml For advanced use only, a string to be inserted into hive-site.xml. Applies to all Hive Replication jobs. hive_service_replication_config_safety_valve false
Use Locking Support concurrency and use locks, needed for Transactions. Requires Zookeeper. hive.support.concurrency true hive_support_concurrency false
Transaction Manager HiveTxnManager implementation used for managing transactions. hive.txn.manager org.apache.hadoop.hive.ql.lockmgr.DbTxnManager hive_txn_manager true
Transaction Maximum Open Batch Maximum number of transactions that can be fetched in one call to open_txns(). Increasing this will decrease the number of delta files created when streaming data into Hive. But it will also increase the number of open transactions at any given time, possibly impacting read performance. hive.txn.max.open.batch 1000 hive_txn_max_open_batch true
Transaction Strict Locking Mode In strict locking non-ACID resources use standard R/W lock semantics. hive.txn.strict.locking.mode false hive_txn_strict_locking_mode false
Transaction Timeout Time after which transactions are declared aborted if the client has not sent a heartbeat. hive.txn.timeout 5 minute(s) hive_txn_timeout true
Hive Client Advanced Configuration Snippet (Safety Valve) for navigator.client.properties For advanced use only, a string to be inserted into the client configuration for navigator.client.properties. navigator_client_config_safety_valve false
Hive Client Advanced Configuration Snippet (Safety Valve) for navigator.lineage.client.properties For advanced use only, a string to be inserted into the client configuration for navigator.lineage.client.properties. navigator_lineage_client_config_safety_valve false
System Group The group that this service's processes should run as. hive process_groupname true
System User The user that this service's processes should run as. hive process_username true
Hive Service Advanced Configuration Snippet (Safety Valve) for ranger-hive-audit.xml For advanced use only, a string to be inserted into ranger-hive-audit.xml. Applies to configurations of all roles in this service except client configuration. ranger_audit_safety_valve false
Hive Service Advanced Configuration Snippet (Safety Valve) for ranger-hive-security.xml For advanced use only, a string to be inserted into ranger-hive-security.xml. Applies to configurations of all roles in this service except client configuration. ranger_security_safety_valve false

Cloudera Navigator

Display Name Description Related Name Default Value API Name Required
Enable Audit Collection Enable collection of audit events from the service's roles. navigator.audit.enabled true navigator_audit_enabled false
Audit Event Filter Event filters are defined in a JSON object like the following: { "defaultAction" : ("accept", "discard"), "rules" : [ { "action" : ("accept", "discard"), "fields" : [ { "name" : "fieldName", "match" : "regex" } ] } ] } A filter has a default action and a list of rules, in order of precedence. Each rule defines an action, and a list of fields to match against the audit event. A rule is "accepted" if all the listed field entries match the audit event. At that point, the action declared by the rule is taken. If no rules match the event, the default action is taken. Actions default to "accept" if not defined in the JSON object. The following is the list of fields that can be filtered for Hive events:
  • userName: the user performing the action.
  • ipAddress: the IP from where the request originated.
  • operation: the Hive operation being performed.
  • databaseName: the databaseName for the operation.
  • tableName: the tableName for the operation.
The default Hive audit event filter discards HDFS directory events generated by Hive jobs that reference the /tmp directory.
navigator.event.filter comment: [ The default Hive audit event filter discards HDFS directory events , generated by Hive jobs that reference the /tmp directory. ], defaultAction: accept, rules: [ action: discard, fields: [ name: operation, match: QUERY , name: objectType, match: DFS_DIR , name: resourcePath, match: /tmp/hive-(?:.+)?/hive_(?:.+)?/-mr-.* ] ] navigator_audit_event_filter false
Audit Queue Policy Action to take when the audit event queue is full. Drop the event or shutdown the affected process. navigator.batch.queue_policy DROP navigator_audit_queue_policy false
Audit Event Tracker Configures the rules for event tracking and coalescing. This feature is used to define equivalency between different audit events. When events match, according to a set of configurable parameters, only one entry in the audit list is generated for all the matching events. Tracking works by keeping a reference to events when they first appear, and comparing other incoming events against the "tracked" events according to the rules defined here. Event trackers are defined in a JSON object like the following: { "timeToLive" : [integer], "fields" : [ { "type" : [string], "name" : [string] } ] } Where:
  • timeToLive: maximum amount of time an event will be tracked, in milliseconds. Must be provided. This defines how long, since it's first seen, an event will be tracked. A value of 0 disables tracking.
  • fields: list of fields to compare when matching events against tracked events.
Each field has an evaluator type associated with it. The evaluator defines how the field data is to be compared. The following evaluators are available:
  • value: uses the field value for comparison.
  • userName: treats the field value as a userNname, and ignores any host-specific data. This is useful for environment using Kerberos, so that only the principal name and realm are compared.
The following is the list of fields that can be used to compare Hive events:
  • operation: the Hive operation being performed.
  • username: the user performing the action.
  • ipAddress: the IP from where the request originated.
  • allowed: whether the operation was allowed or denied.
  • databaseName: the database affected by the operation.
  • tableName: the table or view affected by the operation.
  • objectType: the type of object affected by the operation.
  • resourcePath: the path of the resource affected by the operation.
navigator_event_tracker navigator_event_tracker false
Enable Lineage Collection Enable collection of lineage from the service's roles. true navigator_lineage_enabled false

Logs

Display Name Description Related Name Default Value API Name Required
Audit Log Directory Path to the directory where audit logs will be written. The directory will be created if it doesn't exist. audit_event_log_dir /var/log/hive/audit audit_event_log_dir false
Hive Lineage Log Directory The directory in which Hive lineage log files are written. lineage_event_log_dir /var/log/hive/lineage lineage_event_log_dir true
Hive Maximum Lineage Log File Size The maximum size, in megabytes, per log file for Hive lineage logs. Typically used by log4j or logback. max_lineage_log_file_size 100 MiB max_lineage_log_file_size false
Maximum Audit Log File Size Maximum size of audit log file in MB before it is rolled over. navigator.audit_log_max_file_size 100 MiB navigator_audit_log_max_file_size false
Number of Audit Logs to Retain Maximum number of rolled-over audit logs to retain. The logs are not deleted if they contain audit events that have not yet been propagated to the Audit Server. navigator.client.max_num_audit_log 10 navigator_client_max_num_audit_log false

Monitoring

Display Name Description Related Name Default Value API Name Required
Enable Service Level Health Alerts When set, Cloudera Manager will send alerts when the health of this service reaches the threshold specified by the EventServer setting eventserver_health_events_alert_threshold true enable_alerts false
Enable Configuration Change Alerts When set, Cloudera Manager will send alerts when this entity's configuration changes. false enable_config_alerts false
Healthy HiveServer2 Monitoring Thresholds The health test thresholds of the overall HiveServer2 health. The check returns "Concerning" health if the percentage of "Healthy" HiveServer2s falls below the warning threshold. The check is unhealthy if the total percentage of "Healthy" and "Concerning" HiveServer2s falls below the critical threshold. Warning: 99.0 %, Critical: 51.0 % hive_llap_hiveserver2s_healthy_thresholds false
LLAP Proxy Role Health Test When computing the overall HIVE_LLAP health, consider LLAP Proxy's health true hive_llap_llapproxy_health_enabled false
Service Triggers The configured triggers for this service. This is a JSON-formatted list of triggers. These triggers are evaluated as part as the health system. Every trigger expression is parsed, and if the trigger condition is met, the list of actions provided in the trigger expression is executed. Each trigger has the following fields:
  • triggerName (mandatory) - The name of the trigger. This value must be unique for the specific service.
  • triggerExpression (mandatory) - A tsquery expression representing the trigger.
  • streamThreshold (optional) - The maximum number of streams that can satisfy a condition of a trigger before the condition fires. By default set to 0, and any stream returned causes the condition to fire.
  • enabled (optional) - By default set to 'true'. If set to 'false', the trigger is not evaluated.
  • expressionEditorConfig (optional) - Metadata for the trigger editor. If present, the trigger should only be edited from the Edit Trigger page; editing the trigger here can lead to inconsistencies.
For example, the following JSON formatted trigger fires if there are more than 10 DataNodes with more than 500 file descriptors opened:[{"triggerName": "sample-trigger", "triggerExpression": "IF (SELECT fd_open WHERE roleType = DataNode and last(fd_open) > 500) DO health:bad", "streamThreshold": 10, "enabled": "true"}]See the trigger rules documentation for more details on how to write triggers using tsquery.The JSON format is evolving and may change and, as a result, backward compatibility is not guaranteed between releases.
[] service_triggers true
Service Monitor Client Config Overrides For advanced use only, a list of configuration properties that will be used by the Service Monitor instead of the current client configuration for the service. <property> <name>hive.metastore.client.socket.timeout</name> <value>60</value> </property> smon_client_config_overrides false
Service Monitor Derived Configs Advanced Configuration Snippet (Safety Valve) For advanced use only, a list of derived configuration properties that will be used by the Service Monitor instead of the default ones. smon_derived_configs_safety_valve false

Other

Display Name Description Related Name Default Value API Name Required
HBase Service Name of the HBase service that this Hive service instance depends on. hbase_service false
HDFS Service Name of the HDFS service that this HS2 service instance depends on hdfs_service true
Hive Bytes Per Reducer Size per reducer. If the input size is 10GiB and this is set to 1GiB, Hive will use 10 reducers. hive.exec.reducers.bytes.per.reducer 64 MiB hive_bytes_per_reducer false
Default File Format Default file format for CREATE TABLE statement. hive.default.fileformat TextFile hive_default_fileformat true
Default File Format for Managed Tables Default file format for CREATE TABLE statement applied to managed tables only. External tables will be created with default file format. Leaving this null will result in using the default file format for all tables. hive.default.fileformat.managed ORC hive_default_fileformat_managed false
Hive Max Reducers Max number of reducers to use. If the configuration parameter Hive Reduce Tasks is negative, Hive will limit the number of reducers to the value of this parameter. hive.exec.reducers.max 1009 hive_max_reducers false
Hive Reduce Tasks Default number of reduce tasks per job. Usually set to a prime number close to the number of available hosts. Ignored when mapred.job.tracker is "local". Hadoop sets this to 1 by default, while Hive uses -1 as the default. When set to -1, Hive will automatically determine an appropriate number of reducers for each job. mapred.reduce.tasks -1 hive_reduce_tasks false
Set User and Group Information In unsecure mode, setting this property to true will cause the Metastore Server to execute DFS operations using the client's reported user and group permissions. Cloudera Manager will set this for all clients and servers. hive.metastore.execute.setugi true hive_set_ugi true
LLAP app name LLAP app name llap_app_name llap0 hivellap_app_name false
hive.llap.daemon.service.hosts Explicitly specified hosts to use for LLAP scheduling. If it's left empty, LLAP app name will be used instead with a ZooKeeper based registry. hive.llap.daemon.service.hosts hivellap_daemon_service_hosts false
Hive Metastore Connector Name of the Hive Metastore Connector from the data context that this service instance depends on. hms_connector true
MapReduce Service MapReduce jobs are run against this service. mapreduce_yarn_service true
Ranger Plugin Trusted Proxy IP Address Accepts a list of IP addresses of proxy servers for trusting. ranger.plugin.hive.trusted.proxy.ipaddress ranger_plugin_trusted_proxy_ipaddress false
Ranger Plugin Use X-Forwarded for IP Address The parameter is used for identifying the originating IP address of a user connecting to a component through proxy for audit logs. ranger.plugin.hive.use.x-forwarded-for.ipaddress false ranger_plugin_use_x_forwarded_for_ipaddress false
Ranger Service Name of the Ranger service that this Hive service instance depends on ranger_service false
Tez Service Tez Service that the Hive on Tez uses for execution tez_service true
ZooKeeper Service Name of the ZooKeeper service that this Hive service instance depends on. zookeeper_service false

Proxy

Display Name Description Related Name Default Value API Name Required
Hive Metastore Access Control and Proxy User Groups Override This configuration overrides the value set for Hive Proxy User Groups configuration in HDFS service for use by Hive Metastore Server. Specify a comma-delimited list of groups that you want to allow access to Hive Metastore metadata and allow the Hive user to impersonate. A value of '*' allows all groups. The default value of empty inherits the value set for Hive Proxy User Groups configuration in the HDFS service. hadoop.proxyuser.hive.groups hive_proxy_user_groups_list false

Replication

Display Name Description Related Name Default Value API Name Required
Replica functions root directory Root directory on the replica warehouse where the repl sub-system will store jars from the primary warehouse hive.repl.replica.functions.root.dir hive_repl_replica_functions_root_dir false

Security

Display Name Description Related Name Default Value API Name Required
Enable LDAP Authentication for HiveServer2 When checked, LDAP-based authentication for users is enabled. false hiveserver2_enable_ldap_auth false
Enable TLS/SSL for HiveServer2 Encrypt communication between clients and HiveServer2 using Transport Layer Security (TLS) (formerly known as Secure Socket Layer (SSL)). hive.server2.use.SSL false hiveserver2_enable_ssl false
HiveServer2 TLS/SSL Server JKS Keystore File Password The password for the HiveServer2 JKS keystore file. hive.server2.keystore.password hiveserver2_keystore_password false
HiveServer2 TLS/SSL Server JKS Keystore File Location The path to the TLS/SSL keystore file containing the server certificate and private key used for TLS/SSL. Used when HiveServer2 is acting as a TLS/SSL server. The keystore must be in JKS format. hive.server2.keystore.path hiveserver2_keystore_path false
LDAP BaseDN This parameter is useful when authenticating against a non-Active Directory server, such as OpenLDAP. When set, this parameter is used to convert the username into the LDAP Distinguished Name (DN), so that the resulting DN looks like uid=username,this parameter. For example, if this parameter is set to "ou=People,dc=cloudera,dc=com", and the username passed in is "mike", the resulting authentication passed to the LDAP server look like "uid=mike,ou=People,dc=cloudera,dc=com". This parameter is mutually exclusive with Active Directory Domain. hive.server2.authentication.ldap.baseDN hiveserver2_ldap_basedn false
Active Directory Domain Use this field for Active Directory configurations only, when combined with a simple username value in the "LDAP Bind User Distinguished Name" field, it will result in a UPM of user@example.com used for search/bind operations for authenticated user lookups. hive.server2.authentication.ldap.Domain hiveserver2_ldap_domain false
LDAP URL The URL of the LDAP Server. The URL must be prefixed with ldap:// or ldaps:// . The URL can optionally specify a custom port if necessary, but by default the ldap:// will connect to port 389, and the ldaps:// will connect to port 636. Note that passwords will be in the clear if ldap:// is used, and by fall 2020 Active directory servers will no longer allow non LDAPS connections to bind to AD hosts with LDAP signing enabled. See microsoft knowledge document 935834 for more information. hive.server2.authentication.ldap.url hiveserver2_ldap_uri false
HiveServer2 TLS/SSL Client Trust Store File The location on disk of the trust store, in .jks format, used to confirm the authenticity of TLS/SSL servers that HiveServer2 might connect to. This is used when HiveServer2 is the client in a TLS/SSL connection. This trust store must contain the certificate(s) used to sign the service(s) connected to. If this parameter is not provided, the default list of well-known certificate authorities is used instead. hiveserver2_truststore_file false
HiveServer2 TLS/SSL Client Trust Store Password The password for the HiveServer2 TLS/SSL Certificate Trust Store File. This password is not required to access the trust store; this field can be left blank. This password provides optional integrity checking of the file. The contents of trust stores are certificates, and certificates are public information. hiveserver2_truststore_password false
Kerberos Principal Kerberos principal short name used by all roles of this service. hive kerberos_princ_name true
Ranger DFS Audit Path The DFS path on which Ranger audits are written. The special placeholder '${ranger_base_audit_url}' should be used as the prefix, in order to use the centralized location defined in the Ranger service. xasecure.audit.destination.hdfs.dir $ranger_base_audit_url/hive ranger_audit_hdfs_dir false
Ranger Audit DFS Spool Dir Spool directory for Ranger audits being written to DFS. xasecure.audit.destination.hdfs.batch.filespool.dir /var/log/hive/audit/hdfs/spool ranger_audit_hdfs_spool_dir false
Ranger Audit Solr Spool Dir Spool directory for Ranger audits being written to Solr. xasecure.audit.destination.solr.batch.filespool.dir /var/log/hive/audit/solr/spool ranger_audit_solr_spool_dir false
Ranger Policy Cache Directory The directory where Ranger security policies are cached locally. ranger.plugin.hive.policy.cache.dir /var/lib/ranger/hive/policy-cache ranger_policy_cache_dir false

Suppressions

Display Name Description Related Name Default Value API Name Required
Suppress Configuration Validator: CDH Version Validator Whether to suppress configuration warnings produced by the CDH Version Validator configuration validator. false role_config_suppression_cdh_version_validator true
Suppress Configuration Validator: Deploy Directory Whether to suppress configuration warnings produced by the Deploy Directory configuration validator. false role_config_suppression_client_config_root_dir true
Suppress Configuration Validator: Hive Client Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the Hive Client Advanced Configuration Snippet (Safety Valve) for hive-site.xml configuration validator. false role_config_suppression_hive_client_config_safety_valve true
Suppress Configuration Validator: Gateway Client Environment Advanced Configuration Snippet (Safety Valve) for hive-env.sh Whether to suppress configuration warnings produced by the Gateway Client Environment Advanced Configuration Snippet (Safety Valve) for hive-env.sh configuration validator. false role_config_suppression_hive_client_env_safety_valve true
Suppress Configuration Validator: Client Java Configuration Options Whether to suppress configuration warnings produced by the Client Java Configuration Options configuration validator. false role_config_suppression_hive_client_java_opts true
Suppress Configuration Validator: HiveServer2 Environment Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the HiveServer2 Environment Advanced Configuration Snippet (Safety Valve) configuration validator. false role_config_suppression_hive_hs2_env_safety_valve true
Suppress Configuration Validator: HiveServer2 Log Directory Whether to suppress configuration warnings produced by the HiveServer2 Log Directory configuration validator. false role_config_suppression_hive_log_dir true
Suppress Configuration Validator: Metrics Sample File Location Whether to suppress configuration warnings produced by the Metrics Sample File Location configuration validator. false role_config_suppression_hive_metrics_sample_file_location true
Suppress Configuration Validator: Restrict Load Bucketed Table Validator Whether to suppress configuration warnings produced by the Restrict Load Bucketed Table Validator configuration validator. false role_config_suppression_hive_restrict_load_bucketed_table_validator true
Suppress Configuration Validator: Restrict Unsafe Comparison Validator Whether to suppress configuration warnings produced by the Restrict Unsafe Comparison Validator configuration validator. false role_config_suppression_hive_restrict_unsafe_comparison_validator true
Suppress Configuration Validator: HiveServer2 Operations Log Directory Whether to suppress configuration warnings produced by the HiveServer2 Operations Log Directory configuration validator. false role_config_suppression_hive_server2_logging_operation_log_location true
Suppress Configuration Validator: Thrift port Whether to suppress configuration warnings produced by the Thrift port configuration validator. false role_config_suppression_hive_server2_thrift_http_port true
Suppress Configuration Validator: Hive Server Zookeeper Namespace Whether to suppress configuration warnings produced by the Hive Server Zookeeper Namespace configuration validator. false role_config_suppression_hive_server2_zookeeper_namespace true
Suppress Configuration Validator: hive.llap.daemon.logger Whether to suppress configuration warnings produced by the hive.llap.daemon.logger configuration validator. false role_config_suppression_hivellap_daemon_logger true
Suppress Configuration Validator: LLAP Daemon Java Options Whether to suppress configuration warnings produced by the LLAP Daemon Java Options configuration validator. false role_config_suppression_hivellap_daemon_opts true
Suppress Configuration Validator: Default query queues Whether to suppress configuration warnings produced by the Default query queues configuration validator. false role_config_suppression_hivellap_default_query_queues true
Suppress Configuration Validator: hive.execution.mode Whether to suppress configuration warnings produced by the hive.execution.mode configuration validator. false role_config_suppression_hivellap_execution_mode true
Suppress Configuration Validator: HiveServer2 Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the HiveServer2 Advanced Configuration Snippet (Safety Valve) for hive-site.xml configuration validator. false role_config_suppression_hivellap_hs2_config_safety_valve true
Suppress Configuration Validator: hive.llap.io.memory.mode Whether to suppress configuration warnings produced by the hive.llap.io.memory.mode configuration validator. false role_config_suppression_hivellap_io_memory_mode true
Suppress Configuration Validator: hive.llap.execution.mode Whether to suppress configuration warnings produced by the hive.llap.execution.mode configuration validator. false role_config_suppression_hivellap_llap_execution_mode true
Suppress Configuration Validator: LLAP Proxy Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the LLAP Proxy Advanced Configuration Snippet (Safety Valve) for hive-site.xml configuration validator. false role_config_suppression_hivellap_llapproxy_config_safety_valve true
Suppress Configuration Validator: Interactive Query Queue Whether to suppress configuration warnings produced by the Interactive Query Queue configuration validator. false role_config_suppression_hivellap_queue_name true
Suppress Configuration Validator: Allow custom queues Whether to suppress configuration warnings produced by the Allow custom queues configuration validator. false role_config_suppression_hivellap_tez_sessions_custom_queue_allowed true
Suppress Configuration Validator: Hive Downloaded Resources Directory Whether to suppress configuration warnings produced by the Hive Downloaded Resources Directory configuration validator. false role_config_suppression_hiveserver2_downloaded_resources_dir true
Suppress Configuration Validator: Hive Local Scratch Directory Whether to suppress configuration warnings produced by the Hive Local Scratch Directory configuration validator. false role_config_suppression_hiveserver2_exec_local_scratchdir true
Suppress Configuration Validator: Hive HDFS Scratch Directory Whether to suppress configuration warnings produced by the Hive HDFS Scratch Directory configuration validator. false role_config_suppression_hiveserver2_exec_scratchdir true
Suppress Configuration Validator: Fair Scheduler XML Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the Fair Scheduler XML Advanced Configuration Snippet (Safety Valve) configuration validator. false role_config_suppression_hiveserver2_fair_scheduler_safety_valve true
Suppress Configuration Validator: Java Configuration Options for HiveServer2 Whether to suppress configuration warnings produced by the Java Configuration Options for HiveServer2 configuration validator. false role_config_suppression_hiveserver2_java_opts true
Suppress Configuration Validator: HiveServer2 Load Balancer Whether to suppress configuration warnings produced by the HiveServer2 Load Balancer configuration validator. false role_config_suppression_hiveserver2_load_balancer true
Suppress Configuration Validator: Exclude Vectorized Input Formats Whether to suppress configuration warnings produced by the Exclude Vectorized Input Formats configuration validator. false role_config_suppression_hiveserver2_vectorized_input_format_excludes true
Suppress Configuration Validator: HiveServer2 WebUI Port Whether to suppress configuration warnings produced by the HiveServer2 WebUI Port configuration validator. false role_config_suppression_hiveserver2_webui_port true
Suppress Configuration Validator: HiveServer2 Advanced Configuration Snippet (Safety Valve) for core-site.xml Whether to suppress configuration warnings produced by the HiveServer2 Advanced Configuration Snippet (Safety Valve) for core-site.xml configuration validator. false role_config_suppression_hs2_core_site_safety_valve true
Suppress Configuration Validator: HiveServer2 Port Whether to suppress configuration warnings produced by the HiveServer2 Port configuration validator. false role_config_suppression_hs2_thrift_address_port true
Suppress Configuration Validator: LLAP Proxy Environment Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the LLAP Proxy Environment Advanced Configuration Snippet (Safety Valve) configuration validator. false role_config_suppression_llapproxy_role_env_safety_valve true
Suppress Configuration Validator: HiveServer2 Logging Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the HiveServer2 Logging Advanced Configuration Snippet (Safety Valve) configuration validator. false role_config_suppression_log4j_safety_valve true
Suppress Configuration Validator: Heap Dump Directory Whether to suppress configuration warnings produced by the Heap Dump Directory configuration validator. false role_config_suppression_oom_heap_dump_dir true
Suppress Configuration Validator: Custom Control Group Resources (overrides Cgroup settings) Whether to suppress configuration warnings produced by the Custom Control Group Resources (overrides Cgroup settings) configuration validator. false role_config_suppression_rm_custom_resources true
Suppress Configuration Validator: Role Triggers Whether to suppress configuration warnings produced by the Role Triggers configuration validator. false role_config_suppression_role_triggers true
Suppress Configuration Validator: HiveServer2 WebUI TLS/SSL Server JKS Keystore File Location Whether to suppress configuration warnings produced by the HiveServer2 WebUI TLS/SSL Server JKS Keystore File Location configuration validator. false role_config_suppression_ssl_server_keystore_location true
Suppress Configuration Validator: HiveServer2 WebUI TLS/SSL Server JKS Keystore File Password Whether to suppress configuration warnings produced by the HiveServer2 WebUI TLS/SSL Server JKS Keystore File Password configuration validator. false role_config_suppression_ssl_server_keystore_password true
Suppress Configuration Validator: Stacks Collection Directory Whether to suppress configuration warnings produced by the Stacks Collection Directory configuration validator. false role_config_suppression_stacks_collection_directory true
Suppress Configuration Validator: tez.history.logging.taskattempt-filters Whether to suppress configuration warnings produced by the tez.history.logging.taskattempt-filters configuration validator. false role_config_suppression_tez_interactive_history_logging_taskattempt_filters true
Suppress Parameter Validation: Audit Log Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Audit Log Directory parameter. false service_config_suppression_audit_event_log_dir true
Suppress Configuration Validator: Gateway Count Validator Whether to suppress configuration warnings produced by the Gateway Count Validator configuration validator. false service_config_suppression_gateway_count_validator true
Suppress Parameter Validation: Hive Auxiliary JARs Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Auxiliary JARs Directory parameter. false service_config_suppression_hive_aux_jars_path_dir true
Suppress Configuration Validator: Client TLS/SSL In Use With LDAP Authentication Validator Whether to suppress configuration warnings produced by the Client TLS/SSL In Use With LDAP Authentication Validator configuration validator. false service_config_suppression_hive_client_ssl_recommended_with_ldap_auth_validator true
Suppress Parameter Validation: Hive Service Advanced Configuration Snippet (Safety Valve) for core-site.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Service Advanced Configuration Snippet (Safety Valve) for core-site.xml parameter. false service_config_suppression_hive_core_site_safety_valve true
Suppress Parameter Validation: Default File Format for Managed Tables Whether to suppress configuration warnings produced by the built-in parameter validation for the Default File Format for Managed Tables parameter. false service_config_suppression_hive_default_fileformat_managed true
Suppress Parameter Validation: Base Directory for Hive Proto Hook Whether to suppress configuration warnings produced by the built-in parameter validation for the Base Directory for Hive Proto Hook parameter. false service_config_suppression_hive_hook_proto_base_directory true
Suppress Parameter Validation: Hive LLAP Service Environment Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive LLAP Service Environment Advanced Configuration Snippet (Safety Valve) parameter. false service_config_suppression_hive_llap_service_env_safety_valve true
Suppress Parameter Validation: Metastore Transactional Listener List Whether to suppress configuration warnings produced by the built-in parameter validation for the Metastore Transactional Listener List parameter. false service_config_suppression_hive_metastore_transactional_event_listeners true
Suppress Configuration Validator: Hive on Spark Dependency Validator Whether to suppress configuration warnings produced by the Hive on Spark Dependency Validator configuration validator. false service_config_suppression_hive_on_spark_missing_dependency true
Suppress Configuration Validator: Hive Proxy Groups Validator Whether to suppress configuration warnings produced by the Hive Proxy Groups Validator configuration validator. false service_config_suppression_hive_proxy_groups_validator true
Suppress Parameter Validation: Hive Metastore Access Control and Proxy User Groups Override Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Metastore Access Control and Proxy User Groups Override parameter. false service_config_suppression_hive_proxy_user_groups_list true
Suppress Configuration Validator: Hive Ranger Validator Whether to suppress configuration warnings produced by the Hive Ranger Validator configuration validator. false service_config_suppression_hive_ranger_validator true
Suppress Parameter Validation: Replica functions root directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Replica functions root directory parameter. false service_config_suppression_hive_repl_replica_functions_root_dir true
Suppress Parameter Validation: Hive Replication Environment Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Replication Environment Advanced Configuration Snippet (Safety Valve) parameter. false service_config_suppression_hive_replication_env_safety_valve true
Suppress Configuration Validator: Hive Sentry Validator Whether to suppress configuration warnings produced by the Hive Sentry Validator configuration validator. false service_config_suppression_hive_sentry_validator true
Suppress Parameter Validation: Hive Service Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Service Advanced Configuration Snippet (Safety Valve) for hive-site.xml parameter. false service_config_suppression_hive_service_config_safety_valve true
Suppress Parameter Validation: Hive Replication Advanced Configuration Snippet (Safety Valve) for hive-site.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Replication Advanced Configuration Snippet (Safety Valve) for hive-site.xml parameter. false service_config_suppression_hive_service_replication_config_safety_valve true
Suppress Parameter Validation: LLAP app name Whether to suppress configuration warnings produced by the built-in parameter validation for the LLAP app name parameter. false service_config_suppression_hivellap_app_name true
Suppress Parameter Validation: hive.llap.daemon.service.hosts Whether to suppress configuration warnings produced by the built-in parameter validation for the hive.llap.daemon.service.hosts parameter. false service_config_suppression_hivellap_daemon_service_hosts true
Suppress Configuration Validator: HiveServer2 Count Validator Whether to suppress configuration warnings produced by the HiveServer2 Count Validator configuration validator. false service_config_suppression_hiveserver2_count_validator true
Suppress Parameter Validation: HiveServer2 TLS/SSL Server JKS Keystore File Password Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 TLS/SSL Server JKS Keystore File Password parameter. false service_config_suppression_hiveserver2_keystore_password true
Suppress Parameter Validation: HiveServer2 TLS/SSL Server JKS Keystore File Location Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 TLS/SSL Server JKS Keystore File Location parameter. false service_config_suppression_hiveserver2_keystore_path true
Suppress Parameter Validation: LDAP BaseDN Whether to suppress configuration warnings produced by the built-in parameter validation for the LDAP BaseDN parameter. false service_config_suppression_hiveserver2_ldap_basedn true
Suppress Parameter Validation: Active Directory Domain Whether to suppress configuration warnings produced by the built-in parameter validation for the Active Directory Domain parameter. false service_config_suppression_hiveserver2_ldap_domain true
Suppress Parameter Validation: LDAP URL Whether to suppress configuration warnings produced by the built-in parameter validation for the LDAP URL parameter. false service_config_suppression_hiveserver2_ldap_uri true
Suppress Parameter Validation: HiveServer2 TLS/SSL Client Trust Store File Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 TLS/SSL Client Trust Store File parameter. false service_config_suppression_hiveserver2_truststore_file true
Suppress Parameter Validation: HiveServer2 TLS/SSL Client Trust Store Password Whether to suppress configuration warnings produced by the built-in parameter validation for the HiveServer2 TLS/SSL Client Trust Store Password parameter. false service_config_suppression_hiveserver2_truststore_password true
Suppress Parameter Validation: Kerberos Principal Whether to suppress configuration warnings produced by the built-in parameter validation for the Kerberos Principal parameter. false service_config_suppression_kerberos_princ_name true
Suppress Parameter Validation: Hive Lineage Log Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Lineage Log Directory parameter. false service_config_suppression_lineage_event_log_dir true
Suppress Configuration Validator: LLAP Proxy Count Validator Whether to suppress configuration warnings produced by the LLAP Proxy Count Validator configuration validator. false service_config_suppression_llapproxy_count_validator true
Suppress Parameter Validation: Audit Event Filter Whether to suppress configuration warnings produced by the built-in parameter validation for the Audit Event Filter parameter. false service_config_suppression_navigator_audit_event_filter true
Suppress Parameter Validation: Hive Client Advanced Configuration Snippet (Safety Valve) for navigator.client.properties Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Client Advanced Configuration Snippet (Safety Valve) for navigator.client.properties parameter. false service_config_suppression_navigator_client_config_safety_valve true
Suppress Parameter Validation: Audit Event Tracker Whether to suppress configuration warnings produced by the built-in parameter validation for the Audit Event Tracker parameter. false service_config_suppression_navigator_event_tracker true
Suppress Parameter Validation: Hive Client Advanced Configuration Snippet (Safety Valve) for navigator.lineage.client.properties Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Client Advanced Configuration Snippet (Safety Valve) for navigator.lineage.client.properties parameter. false service_config_suppression_navigator_lineage_client_config_safety_valve true
Suppress Parameter Validation: System Group Whether to suppress configuration warnings produced by the built-in parameter validation for the System Group parameter. false service_config_suppression_process_groupname true
Suppress Parameter Validation: System User Whether to suppress configuration warnings produced by the built-in parameter validation for the System User parameter. false service_config_suppression_process_username true
Suppress Parameter Validation: Ranger DFS Audit Path Whether to suppress configuration warnings produced by the built-in parameter validation for the Ranger DFS Audit Path parameter. false service_config_suppression_ranger_audit_hdfs_dir true
Suppress Parameter Validation: Ranger Audit DFS Spool Dir Whether to suppress configuration warnings produced by the built-in parameter validation for the Ranger Audit DFS Spool Dir parameter. false service_config_suppression_ranger_audit_hdfs_spool_dir true
Suppress Parameter Validation: Hive Service Advanced Configuration Snippet (Safety Valve) for ranger-hive-audit.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Service Advanced Configuration Snippet (Safety Valve) for ranger-hive-audit.xml parameter. false service_config_suppression_ranger_audit_safety_valve true
Suppress Parameter Validation: Ranger Audit Solr Spool Dir Whether to suppress configuration warnings produced by the built-in parameter validation for the Ranger Audit Solr Spool Dir parameter. false service_config_suppression_ranger_audit_solr_spool_dir true
Suppress Parameter Validation: Ranger Plugin Trusted Proxy IP Address Whether to suppress configuration warnings produced by the built-in parameter validation for the Ranger Plugin Trusted Proxy IP Address parameter. false service_config_suppression_ranger_plugin_trusted_proxy_ipaddress true
Suppress Parameter Validation: Ranger Policy Cache Directory Whether to suppress configuration warnings produced by the built-in parameter validation for the Ranger Policy Cache Directory parameter. false service_config_suppression_ranger_policy_cache_dir true
Suppress Parameter Validation: Hive Service Advanced Configuration Snippet (Safety Valve) for ranger-hive-security.xml Whether to suppress configuration warnings produced by the built-in parameter validation for the Hive Service Advanced Configuration Snippet (Safety Valve) for ranger-hive-security.xml parameter. false service_config_suppression_ranger_security_safety_valve true
Suppress Parameter Validation: Service Triggers Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Triggers parameter. false service_config_suppression_service_triggers true
Suppress Parameter Validation: Service Monitor Client Config Overrides Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Monitor Client Config Overrides parameter. false service_config_suppression_smon_client_config_overrides true
Suppress Parameter Validation: Service Monitor Derived Configs Advanced Configuration Snippet (Safety Valve) Whether to suppress configuration warnings produced by the built-in parameter validation for the Service Monitor Derived Configs Advanced Configuration Snippet (Safety Valve) parameter. false service_config_suppression_smon_derived_configs_safety_valve true
Suppress Health Test: LLAP Proxy Health Whether to suppress the results of the LLAP Proxy Health heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false service_health_suppression_hive_llap_hive_llap_llapproxy_health true
Suppress Health Test: HiveServer2 Health Whether to suppress the results of the HiveServer2 Health heath test. The results of suppressed health tests are ignored when computing the overall health of the associated host, role or service, so suppressed health tests will not generate alerts. false service_health_suppression_hive_llap_hiveserver2s_healthy true