Enable Health Alerts for This Host |
When set, Cloudera Manager will send alerts when the health of this host reaches the threshold specified by the EventServer setting
eventserver_health_events_alert_threshold |
|
false |
enable_alerts |
false |
Enable Configuration Change Alerts |
When set, Cloudera Manager will send alerts when this entity's configuration changes. |
|
false |
enable_config_alerts |
false |
Cloudera Manager Agent Log Directory Free Space Monitoring Absolute Thresholds |
The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's log
directory. |
|
Warning: 2 GiB, Critical: 1 GiB |
host_agent_log_directory_free_space_absolute_thresholds |
false |
Cloudera Manager Agent Log Directory Free Space Monitoring Percentage Thresholds |
The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's log directory.
Specified as a percentage of the capacity on that filesystem. This setting is not used if a Cloudera Manager Agent Log Directory Free Space Monitoring Absolute Thresholds setting is configured. |
|
Warning: Never, Critical: Never |
host_agent_log_directory_free_space_percentage_thresholds |
false |
Cloudera Manager Agent Parcel Directory Free Space Monitoring Absolute Thresholds |
The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's parcel
directory. |
|
Warning: 10 GiB, Critical: 5 GiB |
host_agent_parcel_directory_free_space_absolute_thresholds |
false |
Cloudera Manager Agent Parcel Directory Free Space Monitoring Percentage Thresholds |
The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's parcel
directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Cloudera Manager Agent Parcel Directory Free Space Monitoring Absolute Thresholds setting is
configured. |
|
Warning: Never, Critical: Never |
host_agent_parcel_directory_free_space_percentage_thresholds |
false |
Cloudera Manager Agent Process Directory Free Space Monitoring Absolute Thresholds |
The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's process
directory. |
|
Warning: 200 MiB, Critical: 100 MiB |
host_agent_process_directory_free_space_absolute_thresholds |
false |
Cloudera Manager Agent Process Directory Free Space Monitoring Percentage Thresholds |
The health check thresholds for monitoring of free space on the filesystem that contains the Cloudera Manager Agent's process
directory. Specified as a percentage of the capacity on that filesystem. This setting is not used if a Cloudera Manager Agent Process Directory Free Space Monitoring Absolute Thresholds setting is
configured. |
|
Warning: Never, Critical: Never |
host_agent_process_directory_free_space_percentage_thresholds |
false |
Host Entropy Thresholds |
The health check thresholds for the available entropy on the host. |
|
Warning: 100.0, Critical: 50.0 |
host_available_entropy_thresholds |
false |
Host Clock Offset Thresholds |
The thresholds for the host clock offset health test. The test compares this threshold against the absolute value of the clock
offset reported by the host's NTP service from the 'ntpdc -np' or 'chronyc sources' command. Setting both the warning and critical threshold values to never turns off collection of the clock offset
by the Cloudera Manager Agent. If NTP is not in use, both threshold values should be set to never. Cloudera recommends using NTP for time synchronization of Hadoop clusters. |
|
Warning: 3 second(s), Critical: 10 second(s) |
host_clock_offset_thresholds |
false |
Default Process Swap Memory Thresholds |
The default health test thresholds on the swap memory usage of the processes on the host. This value is used if process level
threshold is equal to the default. |
|
Warning: 200.0 page(s), Critical: Never |
host_default_proc_memswap_thresholds |
false |
Disk Device Collection Exclusion Regex |
The regular expression used to filter monitored disk devices and partitions. Disk device (for example, sda) and partition (for
example, sda2) names that match this pattern will be excluded from metric collection. |
|
^$ |
host_disk_collection_filter |
false |
Host DNS Resolution Duration Thresholds |
The health check thresholds for the host DNS resolution duration. |
|
Warning: 1 second(s), Critical: Never |
host_dns_resolution_duration_thresholds |
false |
Hostname and Canonical Name Health Check |
Whether the hostname and canonical names for this host are consistent when checked from a Java process. |
|
true |
host_dns_resolution_enabled |
false |
Filesystem Collection Exclusion Regex |
The regular expression used to filter monitored filesystems. Mountpoints for filesystems (for example, /data/1) that match this
pattern will be excluded from metric collection. |
|
^$ |
host_fs_collection_filter |
false |
Host Memory Swapping Thresholds |
The health test thresholds of the number of pages swapped out on the host in the last 15 minutes |
|
Warning: 200.0 page(s), Critical: Never |
host_memswap_thresholds |
false |
Host Memory Swapping Check Window |
The amount of time over which the memory swapping test checks for pages swapped. |
|
15 minute(s) |
host_memswap_window |
false |
Host Network Frame Error Test Minimum Required Packets |
The minimum number of received packets that must be received within the test window for this test to return "Bad" health. If less
that this number of packets is received during the test window, the health check will never return "Bad" health. |
|
0 |
host_network_frame_errors_floor |
false |
Host Network Frame Error Percentage Thresholds |
The health check thresholds for the percentage of received packets that are frame errors. |
|
Warning: Any, Critical: 0.5 % |
host_network_frame_errors_thresholds |
false |
Host Network Frame Error Check Window |
The amount of time over which the host frame error checks for frame errors. |
|
15 minute(s) |
host_network_frame_errors_window |
false |
Network Interface Collection Exclusion Regex |
The regular expression used to filter monitored network interfaces. Network interfaces that match this pattern will be excluded from
metric collection. |
|
^lo$ |
host_network_interface_collection_filter |
false |
Host's Network Interfaces Slow Link Modes Thresholds |
The thresholds for the health check of the number of network interfaces that appear to be operating at less than full speed. |
|
Warning: Any, Critical: Never |
host_network_interfaces_slow_mode_thresholds |
false |
Network Interface Expected Duplex Mode |
The expected duplex mode for network interfaces. |
|
Full |
host_nic_expected_duplex_mode |
false |
Network Interface Expected Link Speed |
The expected network interface link speed. |
|
1000 |
host_nic_expected_speed |
false |
Host Process Health Test |
Enables the health test that the host's process state is consistent with the role configuration |
|
true |
host_scm_health_enabled |
false |
Host Triggers |
The configured triggers for this host. This is a JSON-formatted list of triggers. These triggers are evaluated as part as the health
system. Every trigger expression is parsed, and if the trigger condition is met, the list of actions provided in the trigger expression is executed. Each trigger has the following fields:
- triggerName (mandatory) - The name of the trigger. This value must be unique for the specific host.
- triggerExpression (mandatory) - A tsquery expression representing the trigger.
- streamThreshold (optional) - The maximum number of streams that can satisfy a condition of a trigger before the condition
fires. By default set to 0, and any stream returned causes the condition to fire.
- enabled (optional) - By default set to 'true'. If set to 'false', the trigger is not evaluated.
- expressionEditorConfig (optional) - Metadata for the trigger editor. If present, the trigger should only be edited from the
Edit Trigger page; editing the trigger here can lead to inconsistencies.
For example, the following JSON formatted trigger fires if the host wait time exceeds 500 ms:[{"triggerName": "sample-trigger", "triggerExpression": "IF (select await_time
where hostname=$HOSTNAME and last(await_time) > 500ms) DO health:concerning", "streamThreshold": 0, "enabled": "true"}]See the trigger rules documentation for more details on how to write
triggers using tsquery.The JSON format is evolving and may change and, as a result, backward compatibility is not guaranteed between releases. |
|
[] |
host_triggers |
true |