User Guide
Also available as:
PDF

Configuration Guidelines

The following sections describe configuration properties related to SmartSense components and provide tuning guidelines.

The "Default Value" of a parameter is listed as "(no value)" if by default the parameter is set to an empty value. In order to set the parameter, find it in the Ambari configuration tab listed and set it to a desired value.

The "Default Value" of a parameter is listed as "(unspecified)" if the parameter is unset by default. In order to set the parameter, you must add it as a custom configuration.

HST Server

The following configuration properties are available for HST server:

Table 3.2. HST Server Configuration Properties

Property NameDescriptionWhere to ConfigureGuidelines

customer.smartsense.id

Your SmartSense ID uniquely identifies your account. You can obtain it from Hortonworks Support. This is a mandatory field during SmartSense installation.

Type: string

Default Value: (unspecified)

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-server.ini

You can obtain your existing SmartSense ID from the Hortonworks Support portal.

customer.account.name

The name of your organization as it is registered with Hortonworks Support. This is a mandatory field during SmartSense setup and it is one of the important identifiers for clusters belonging to the same customer..

Type: string

Default Value: (unspecified)

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-server.ini

You must enter the organization name exactly as it is registered in the Hortonworks Support portal.

customer.notification.email

Email address used to send bundle upload and recommendation availability notifications. This is a mandatory field during SmartSense setup.

Type: string

Default Value: (unspecified)

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-server.ini

Check your junk mailbox in case you do not receive notifications.

customer.enable.flex.subscription

Enables Flex Subscription for the cluster.

Type: boolean

Default Value: false

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-server.ini

Enable only if you have a valid Flex Subscription ID obtained from Hortonworks Support.

customer.flex.subscription.id

Your Flex Subscription ID obtained from Hortonworks Support. Flex Subscription offers flexible support subscription.

Type: string

Default Value: (unspecified)

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-server.ini

Contact Hortonworks Support to obtain a Flex Subscription ID. When passing the ID, you must also enable flex subscription.

server.storage.dir

Directory used by HST server for storing bundles.

Type: string

Default Value: /var/lib/smartsense/hst-server/data

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-server.ini

Use a non-root partition for hosting this directory. For reliable operations, we recommend that you have at least 10GB of free space on that partition.

server.tmp.dir

Directory used by HST server for temporary operations.

Type: string

Default Value: /var/lib/smartsense/hst-server/tmp

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-server.ini

Use a non-root partition for hosting this directory. For reliable operations, we recommend that you have at least 10GB of free space on that partition.

server.port

Port to access the HST server web interface and API.

Type: int

Default Value: 9000

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-server.ini

This port is internally used for HST operations. Change only if port 9000 is already in use or cannot be unblocked, or if SSL needs a different port. This has no impact on SmartSense Ambari View.

server.max.heap

Maximum heap size (in MB) allocated for the HST server process.

Type: int

Default Value: 2048

Ambari Config:

Advanced > Advanced hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Usually 2048 MB is sufficient for clusters up to 500 nodes. Tuning might help if cluster has more than 500 nodes or if you encounter OOM errors on the server side.

agent.request.processing.timeout

Agent request processing timeout (in seconds). This usually indicates the total time for agent capture to finish.

Type: int

Default Value: 7200

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-server.ini

Increase the capture timeout to more than 120 minutes if you are capturing more than 4-5 services or have huge logs for support bundle captures.

You may also want to increase this if captures are timing out.

agent.request.syncup.interval

Interval (in seconds) after submitting data collection request in which all the data collections requests from various agents are treated as part of same bundle. In other words, this determines the maximum time for any agent to sync back with server on capture request.

If multiple agents join data collection process within this interval, they will be treated as part of same bundle. If any agent joins data collection after this interval, it will be treated as another bundle.

Type: int

Default Value: 180

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for most clusters.

In cases where Ambari server and agent requests are slow and SmartSense bundle collection shows unreported agents in every bundle collection, increasing this interval may help.

client.threadpool.size.max

Server thread pool size to handle bundle requests.

Type: int

Default Value: 40

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for most clusters

Consider increasing this property if you see multiple agent upload requests timing out on a large cluster with more than 500 nodes.

gateway.host

Fully qualified domain name of the host where the SmartSense Gateway process has been deployed and is running.

Type: string

Default Value: embedded

Ambari Config:

Gateway

Config File:

/etc/hst/conf/hst-server.ini

Keep the default if your HST server has outbound internet access to reach smartsense.hortonworks.com. Otherwise, set up a separate standalone gateway which has outbound access.

gateway.port

Port on which the SmartSense Gateway is listening and through which data is transferred. It is set up with two-way SSL. This port is not applicable for embedded gateway.

Type: int

Default Value: (no value)

Ambari Config:

Gateway

Config File:

/etc/hst/conf/hst-server.ini

This port is used for internal communication between the gateway and HST server.

Change this only if this port is already in use or cannot be unblocked. Note that if you change this port, you must update a similar property in the gateway.

gateway.registration.port

Port which is used by clients to register with the gateway. Data is not transferred through this port. It is set up with one-way SSL. This port is not applicable for embedded gateway.

Type: int

Default Value: (no value)

Ambari Config:

Gateway

Config File:

/etc/hst/conf/hst-server.ini

This port is used for internal communication between the gateway and HST server.

Change only if this port is already in use or cannot be unblocked. Note that if you change this port, you must update a similar property in the Gateway.

hst_log_dir

Directory where SmartSense log files are created.

Type: string

Default Value: /var/log/hst

Ambari Config:

Advanced > Advanced hst-log4j

Config File:

/etc/hst/conf/log4j.properties

Changing this setting is usually not recommended. If you change it, you must provide read/write/create permissions for this directory to Ambari Agent user.

hst_max_file_size

Maximum size of SmartSense HST log files.

Type: int

Default Value: 30

Ambari Config:

Advanced > Advanced hst-log4j

Config File:

/etc/hst/conf/log4j.properties

Default value is suitable for most clusters.

Check available storage capacity before updating this property.

hst_max_backup_index

Maximum number of HST log files.

Type: int

Default Value: 10

Ambari Config:

Advanced > Advanced hst-log4j

Config File:

/etc/hst/conf/log4j.properties

Increase this number to keep the record of older logs. Check available storage capacity before updating this property.

java.home

Path to the JAVA home for HST server.

Type: string

Default Value: (no value)

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

This setting is automatically configured from Ambari env settings and usually there is no reason to change it.

We recommend that you use the latest 1.7/1.8 JAVA versions with up-to-date security updates. For more security we recommend that you have unlimited JCE policy installed.

derby.system.home

Home directory path for Derby database used internally by HST server.

Type: string

Default Value: /var/lib/smartsense/hst-server/hstDB

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for most clusters.

This property should only be changed during the HST server setup. If you change thus after HST server is already set up, remember to make a backup and move existing data to the new location.

bundle.monitor.interval

Interval (in seconds) determining how often a bundle is checked for completeness. After every interval, data uploaded from agents will be collated into a single bundle. When data from all agents is collected into a bundle, the bundle is marked as completed.

Type: int

Default Value: 20

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for most clusters.

On very large clusters (with more than 1000 nodes) if bundle collection causes performance issues with the default configuration, this interval can be increased to one minute to minimize file compressions/decompressions.

bundle.alert.progress.timeout.percent

This percentage of bundle processing for which the bundle is failing will raise an alert.

Type: float

Default Value: 0.6

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for most clusters.

If the bundle is failing during capture or processing, you can adjust the percentage of processing for which alert will be issued.

server.cleanup.task.interval

Time in hours to execute server cleanup tasks (clean up stale/cancelled bundle temp data).

Type: int

Default Value: 1

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for all clusters.

security.server.two_way_ssl.port

Port for two-way SSL communication between HST server and HST agents. This port is used internally for HST operations.

Type: int

Default Value: 9441

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Change only if port 9441 is already in use or cannot be unblocked. This has no impact on SmartSense Ambari View.

security.server.one_way_ssl.port

Port for one-way SSL communication between HST server and HST agents. This port is usually required during two-way SSL setup. This port is used internally for HST operations.

Type: int

Default Value: 9440

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Change only if port 9442 is already in use or cannot be unblocked. This has no impact on SmartSense Ambari View.

security.openssl.digest.algorithms

Permitted algorithms for SSL encryption.

Type: string

Default Value: sha256,sha384,sha512,sha,sha1,md5

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Not required to modify as sha256,sha512 are available and provide strong encryption.

Change only if there are very specific security requirements that can not be met by sha256/sha512.

server.connection.max.idle.millis

The maximum period in milliseconds that a connection may be idle before it is closed.

Type: int

Default Value: 900000

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Update this if you see too many open threads in idle state on the HST server.

security.server.disabled.ciphers

A comma-separated list of disabled ciphers for SSL.

Type: string

Default Value: (no value)

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Weaker ciphers are already disabled. Change only if you have very specific security requirements.

security.server.disabled.protocols

A comma-separated list of disabled protocols for SSL.

Type: string

Default Value: (no value)

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Weaker ciphers are already disabled. Change only if you have very specific security requirements.

upload.permits

Agents capture data and upload it to the HST server which assembles it together into a single bundle. This property defines the number of concurrent uploads allowed from agent to server.

Type: int

Default Value: 10

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

This property may need to be increased if agent upload requests are timing out on a cluster with more than 500 nodes.

upload.initiate.timeout

Agents capture data and upload it to the HST server which assembles it together into a single bundle. Upload will fail if not initiated within the timeout window (in seconds) defined in this property.

Type: int

Default Value: 20

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

This property may need to be increased if agent upload requests are timing out on a cluster with more than 500 nodes.

bundle.keepuploaded

This tells the HST server whether to keep bundles received from agents even after merging. If set to false, the agent bundles are deleted after merging.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Set this to TRUE if you have to inspect the agent bundles for debugging purposes. Note that this will require plenty of available disk space.

bundle.purge.enabled

Enables a daemon process to purge old bundles. By default, the daemon process cleans up old bundles to efficiently use the disk space.

Type: boolean

Default Value: true

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

We recommend not to disable this process as it will require a large amount of additional disk space.

bundle.min.retention.days

Number of days to keep the bundle before soft purging. Bundles will be soft purged after the defined number of retention days: the bundle file will be deleted and the DB entries will be soft deleted.

Type: int

Default Value: 30

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Update this if you want to keep bundles for longer time (to keep records) or for shorter time (to reduce storage utilization).

bundle.min.force.purge.retention.days

Number of days to keep the bundle before hard purging. Bundles will be hard purged after the defined number of retention days: the DB entries of bundle data along with associated recommendations will be cleaned up.

Type: int

Default Value: 90

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for all clusters.

bundle.purge.threadpool.size

Thread pool used for purging hundreds of bundles.

Type: int

Default Value: 1

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for all clusters.

bundle.purge.interval

The frequency (in hours) with which to run the purge process.

Type: int

Default Value: 24

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

The default setting (once per day) is sufficient unless you have tens of bundles created daily.

bundle.validity.days

Bundle validity days for retrieving recommendations. After this number of days, a bundle will no longer be considered for retrieving recommendations.

Type: int

Default Value: 15

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

We recommend not to increase this beyond default because older bundles might not provide the latest status of the cluster.

recommendation.expiry

Recommendation actions such as "Apply" are not permitted on bundles which are older than this number of days.

Type: int

Default Value: 30

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

We recommend that you capture a new bundle and get new recommendations instead of referring to older recommendations.

recommendation.history.expiry

Recommendation history actions are not permitted on bundles older than this number of days.

Type: int

Default Value: 90

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

We recommend that you capture a new bundle and get new recommendations instead of referring to older recommendations. Update this value if you have to refer to earlier actions.

recommendation.auto.download.bundle.expiry

If recommendations are not received, HST server will stop trying to retrieve recommendations after this number of days.

Type: int

Default Value: 7

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Change this only if you have a very specific requirement and want to stop requesting for recommendations earlier than after 7 days.

recommendation.auto.download.interval

Interval (in seconds) for retrieving recommendations. By default, recommendations are retrieved every 300 seconds.

Type: int

Default Value: 300

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Change this only if you have a very specific requirement. Increase this if you want to reduce the frequency of retry attempts.

recommendation.feedback.push.interval

Interval (in seconds) for submitting customer feedback for recommendations. By default, HST server will submit feedback to Hortonworks every 30 minutes if new feedback is available.

Type: int

Default Value: 1800

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for most clusters. Change this only if you have a very specific requirement.

recommendation.feedback.push.maxentries

The number of feedback entries submitted in one request. By default, HST server submits a batch of 50 feedback entries in one request.

Type: int

Default Value: 50

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for most clusters. Change this only if you have a very specific requirement. Requires tuning only if you submit more than 100 feedback entries on a daily basis.

gateway.enabled

Enables auto upload of bundles after capture.

Type: boolean

Default Value: true

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Disable this if you are capturing the bundles for internal review purposes only. We recommend to keep it enabled to receive valuable insights and recommendations for your cluster.

gateway.retry.attempts

Defines how many attempts HST server makes to connect to the SmartSense Gateway.

Type: int

Default Value: 10

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for all clusters.

gateway.retry.interval.increment

The amount of time (in milliseconds) to wait before making a subsequent SmartSense Gateway connection attempt. In other words, this is the wait time between subsequent connection attempts.

Type: int

Default Value: 5000

Ambari Config:

Advanced > Custom hst-server-conf

Config File:

/etc/hst/conf/hst-server.ini

Default value is suitable for all clusters.


HST Agent

The following configuration properties are available for HST Agent:

Table 3.3. HST Agent Configuration Properties

Property NameDescriptionWhere to ConfigureGuidelines

agent.tmp_dir

Temporary directory used by agents to keep local bundles during bundle preparation.

Type: string

Default Value: /var/lib/smartsense/hst-agent/data/tmp

Ambari Config:

Basic

Config File:

/etc/hst/conf/hst-agent.ini

You must have at least 10GB of free space in this directory.

This should be set to a different location than the server tmp directory.

security.anonymization.max.heap

The maximum heap allocated (in MB) on every agent for anonymization.

Type: int

Default Value: 2048

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

If you experience out of memory exceptions during the anonymization process, increase the heap size gradually depending on availability.

agent.loglevel

Provides ability to change the hst-agent logging level. Possible values are: INFO, DEBUG, WARNING, ERROR, CRITICAL.

Type: string

Default Value: INFO

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

To debug issues on the agent, set this to DEBUG.

bundle.logs_to_capture

Patterns of log files to be captured.

Type: string

Default Value: (.*).log$,(.*).out$

Ambari Config:

Data Capture

Config File:

/etc/hst/conf/hst-agent.ini

Be careful when capturing more log files as they may turn out to be large and require extra space on the HST server.

server.url_port

Port for one-way SSL communication between HST server and HST agents. This port is usually required during two-way SSL setup.

Type:

Default Value: 9440

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

This should be modified in sync with similar property in HST server configurations.

server.secured_url_port

Port for two-way SSL communication between HST server and HST agents.

Type:

Default Value: 9441

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

This should be modified in sync with similar property in HST server configurations.

server.two_way_ssl

Enables two-way SSL for communication between HST server and HST agents.

Type: boolean

Default Value: true

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

We recommend not to change this unless you have a very specific requirement.

server.connection_retry_count

Number of times to retry to connect to server in case of connection failures and timeouts.

Type: int

Default Value: 100

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-agent.ini

In many cases, the default value (100 retry attempts) is often more than needed. Reduce it if retry connection attempts are keeping the system busy.

server.connection_retry_interval

Defines the interval (in seconds) between retries.

Type: int

Default Value: 10

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-agent.ini

Default value is suitable for all clusters.

java.home

Path to the JAVA home for HST agents.

Type: string

Default Value: (no value)

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

This setting is automatically configured from Ambari env settings and usually there is no reason to change it.

We recommend that you use the latest 1.7/1.8 JAVA versions with up-to-date security updates. For more security we also recommend that you have unlimited JCE policy installed.

command.heartbeat_interval

The heartbeat interval (in seconds). During agent capture, this heartbeat helps ensure connectivity with HST server and executes certain commands such as cancel capture.

Type: int

Default Value: 30

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

Do not change this unless you experience performance issues.

command.check_commands_retry_count

Number of times to retry check commands.

Type: int

Default Value: 10

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

Default value is suitable for all clusters.

command.check_commands_retry_interval

Interval (in seconds) between retries for check commands.

Type: int

Default Value: 10

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

Default value is suitable for all clusters.

management.updates.dir

Directory to store updates received from HST server.

Type: string

Default Value: /var/lib/smartsense/hst-agent/updates

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

We recommend not to change this unless you have a very specific requirement. If changing this, verify that permissions are set accordingly.

management.patch.auto.apply.enabled

Enable automatically downloading and applying updates received from HST server.

Type: boolean

Default Value: true

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

Disable only if you do not want the HST server to propagate the agent configuration changes to all agents.

bundle.compress_captured_log_locally

By default this is set to false; i.e the log files are included as they are, without applying compression.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

If capture requests are timing out, set this to true to activate log compression. Note that the compressed files will not be anonymized.

upload.retry_count

Number of times the agent will retry to submit its local bundle to server. Note that this is different from uploading the final bundle to Hortonworks.

Type: int

Default Value: 100

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-agent.ini

Default value is suitable for all clusters.
upload.min_retry_interval

Minimum interval (in seconds) between bundle upload retries made by agents. Random value between min_retry_interval and max_retry_interval will be used. For constant value, use retry_interval=x.

Type: int

Default Value: 15

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-agent.ini

Default value is suitable for all clusters.

upload.max_retry_interval

Maximum interval (in seconds) between bundle upload retries made by agents. Random value between min_retry_interval and max_retry_interval will be used. For constant value, use retry_interval=x.

Type: int

Default Value: 120

Ambari Config:

Operations

Config File:

/etc/hst/conf/hst-agent.ini

Default value is suitable for all clusters.

handler.largefiles.size

Minimum file size (in MB) for a file to qualify as a large file. Large files are handled based on action defined in the `handler.largefiles.action` property.

Type: int

Default Value: 200

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

Update this if you have a specific requirement which includes capturing files larger than 200MB.

handler.largefiles.action

Action to handle a large file. Supported actions are 'allow', 'ignore', 'truncate', 'fail'.

Type: string

Default Value: truncate

Ambari Config:

Advanced > Custom hst-agent-conf

Config File:

/etc/hst/conf/hst-agent.ini

Configure action to handle large files based on your requirements.

SmartSense Gateway

The following configuration properties are available for the SmartSense Gateway:

Table 3.4. SmartSense Gateway Configuration Properties

Property NameDescriptionWhere to ConfigureGuidelines

security.openssl.digest.algorithms

A comma separated list of supported algorithms for SSL.

Type: string

Default Value: sha256,sha384,sha512,sha,sha1,md5

Ambari Config: N/A

Config File:

/etc/hst/conf/hst-gateway.ini

Typically it there is no need to modify this as sha256,sha512 are available and provide strong encryption. Change this only if you have a very specific requirement.
security.gateway.cert.name

Use this property if you have to use a custom root Certificate Authority for SmartSense Gateway operations. This file must exist at /var/lib/hst-gateway/keys before gateway is started.

Type: string

Default Value: ca.crt

Ambari Config: N/A

Config File:

/etc/hst/conf/hst-gateway.ini

You can set up your own CA to sign certificates for two-way SSL communication between HST server and SmartSense Gateway. Modify this property to customize the root CA.

gateway.thread.pool.size

Thread pool for the gateway server's API endpoint. Default is automatically calculated based on CPU cores.

Type: int

Default Value: (Automatically calculated)

Ambari Config: N/A

Config File:

/etc/hst/conf/hst-gateway.ini

Since the count is already dynamic, it is usually not required to change it.
gateway.start.validation.enabled

Enables the outbound connectivity check that SmartSense Gateway performs during startup. By default, the connectivity check in enabled.

Type: boolean

Default Value: true

Ambari Config: N/A

Config File:

/etc/hst/conf/hst-gateway.ini

Set to false if you use an HTTP proxy for gateway and gateway start command fails with "unable to connect" error. There is bug in SmartSense Gateway versions earlier than 1.3.2 where the socket connectivity test does not use the intended proxy.

gateway.cache.expiry.hours

The frequency with which the SmartSense Gateway cache is refreshed.

Gateway caches the outbound connectivity status to report to HST server. By default, this cache is refreshed every two hours and upon gateway startup.

Type: int

Default Value: 2

Ambari Config: N/A

Config File:

/etc/hst/conf/hst-gateway.ini

Tweak this property to refresh the outbound connectivity status.

gateway.data.transfer.buffer.size

Buffer size for data transfer via SmartSense Gateway. Gateway uses chunked buffers to transfer encrypted data between SmartSense and Hortonworks Datalake.

Type: int

Default Value: 4096

Ambari Config: N/A

Config File:

/etc/hst/conf/hst-gateway.ini

Tune this property to effectively use the network bandwidth for communication between HST server and Hortonworks.


Activity Analyzer

The following configuration properties are available for Activity Analyzer:

Table 3.5. Activity Analyzer Configuration Properties

Property NameDescriptionWhere to ConfigureGuidelines

phoenix.sink.batch.size

Activities are batched for better storage performance. A batch is persisted when either the batch size becomes equal to phoenix.sink.batch.size or activity.status.update.interval.seconds has elapsed.

Type: int

Default Value: 100

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Increasing batch size can lower the load on storage and improve storage performance; however, it can delay the availability of data and increase memory pressure.

Reducing batch size can make data available sooner but has negative performance impact on storage layer.

global.activity.processing.parallelism

Number of parallel threads that process each activity type. Controls the threads used for Tez, YARN, MR, and HDFS activity data collection.

Type: int

Default Value: 8

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Reduce the number of threads if you encounter out of memory exceptions.

phoenix.sink.flush.interval.seconds

Time after which data will be flushed to Phoenix. A batch is persisted when either the batch size becomes equal to phoenix.sink.batch.size or activity.status.update.interval.seconds has elapsed.

Type: int

Default Value: 30

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Increase the time to reduce the number of persist operations to Phoenix only if number of records to be batched together is much less than 100.

mr_job.activity.watcher.enabled

Enables automatic activity analysis for MapReduce jobs.

Type: boolean

Default Value: true

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Disable only if you do not want to analyze MapReduce jobs.

mr_job.max.job.size.mb.for.parallel.exec

ution

Maximum size (in bytes) that a MapReduce job can have in order to be executed in parallel.

Some large MapReduce jobs may contain thousands of tasks. Such jobs require a lot of memory and they put memory pressure on JVM, especially in multi-threaded execution.

Any job with history size larger than specified in this parameter will be executed in synchronized fashion. This may slow the performance down, but will avoid OOM errors.

Any job with history file size smaller than specified in this parameter will be executed in parallel.

Type: int

Default Value: 500

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Reduce the parallel execution job size if you encounter OOM errors.

tez_job.activity.watcher.enabled

Enables automatic activity analysis for Tez jobs.

Type: boolean

Default Value: true

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Disable only if you do not want to analyze Tez jobs.

tez_job.tmp.dir

Temporary location where Tez job information is downloaded.

Type: string

Default Value:

/var/lib/smartsense/activity-analyzer/tez/tmp/

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

You can symlink it to a non-root partition or change it to use a directory in a non-root partition.

yarn_app.activity.watcher.enabled

Enables automatic activity analysis for YARN apps.

Type: boolean

Default Value: true

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Disable only if you do not want to analyze YARN jobs.

hdfs.activity.watcher.enabled

Enables automatic analysis for HDFS files.

Type: boolean

Default Value: true

Ambari Config:

Activity Analysis

Config File:

/etc/smartsense-activity/conf/activity.ini

Disable only if you do not want to analyze HDFS fsImage.

global.activity.analyzer.user

Defines the user used to read activity data from HDFS and YARN. This user must have read access to all activity data from HDFS/YARN/ATS, and so on.

Type: string

Default Value: activity_explorer

Ambari Config:

Advanced > Advanced activity-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.

activity.explorer.user

Defines the user used to read pre-analyzed data. This user does not need access to HDFS and YARN.

Type: string

Default Value: activity_explorer

Ambari Config:

Advanced > Advanced activity-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.

analyzer_jvm_opts

Allows you to specify multiple jvm options separated by space.

Type: string

Default Value: -Xms128m

Ambari Config:

Advanced > Advanced activity-env

Config File:

/etc/smartsense-activity/conf/activity-env.sh

This parameter allows you to add any additional jvm options for executing activity analyzers, for example for GC tuning.

analyzer_jvm_heap

Maximum heap space (in MB) allocated for Activity Analyzer process.

Type: int

Default Value: 8192

Ambari Config:

Advanced > Advanced activity-env

Config File:

/etc/smartsense-activity/conf/activity-env.sh

Usually 8192 MB is sufficient, but it can be increased if you encounter OOM errors.

activity_log_dir

Directory where activity log files are created.

Type: string

Default Value:

var/log/smartsense-activity

Ambari Config:

Advanced > Advanced activity-log4j

Config File:

/etc/smartsense-activity/conf/log4j.properties

Default value is suitable for most clusters.

If you change this directory, you must provide read/write/create permissions on the new directory to activity_analyzer user.

activity_max_file_size

Maximum size (in MB) for SmartSense activity log files.

Type: int

Default Value: 30

Ambari Config:

Advanced > Advanced activity-log4j

Config File:

/etc/smartsense-activity/conf/log4j.properties

Default value is suitable for most clusters.

Check available storage capacity before updating this property.

activity_max_backup_index

Maximum number of SmartSense activity log files.

Type: int

Default Value: 10

Ambari Config:

Advanced > Advanced activity-log4j

Config File:

/etc/smartsense-activity/conf/log4j.properties

You can increase this number to keep the record of older logs. Check available storage capacity before updating this property.

global.date.format

Format in which dates are converted to strings and sometimes persisted.

Type: string

Default Value: "YYYY-mm-DD"

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.

global.activity.status.update.interval.sec

onds

Interval (in seconds) after which status of processed/failed/in process activities is updated in DB.

Type: int

Default Value: 30

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.
activity.batch.interval.seconds

Interval for batching activities.

Activities are batched for better storage performance. A batch is persisted when either the batch size becomes equal to phoenix.sink.batch.size or activity.status.update.interval.seconds is elapsed.

Type: int

Default Value: 60

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Increasing the batch interval can lower the load on storage and improve storage performance; however, it can also delay the availability of data and increase memory pressure.

Reducing the interval size can make data available sooner, but has negative performance impact on storage layer.

activity.watcher.enabled

Enables regular collection of job data for analysis.

Type: boolean

Default Value: true

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Disable this only if you want to temporarily turn off data collection.

activity.history.max.back.track.days

The number of days of history to retrieve job information.

Type: int

Default Value: 7

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Increase this number if you have to refer to older jobs. Note that older jobs should have data available in AMS. This is used only during first run after installation.

phoenix.setup.continue.on.error

During initial setup, errors in DB setup may occur. This parameter indicates whether to continue if any error occurs.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.

phoenix.setup.drop.existing.tables

During initial setup matching tables may be found in the DB (typically from previous install attempts). This parameter determines whether they should be dropped and recreated. By default, the existing entries are kept.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.

phoenix.activity.analyzer.jdbc.url

JDBC URL used by Activity Analyzer to store its data.

Type: string

Default Value: (no value)

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Do not change it. It is auto configured based on the cluster setup.

ams.jdbc.url

JDBC URL used by Activity Analyzer to fetch data from AMS.

Type: string

Default Value: (no value)

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Do not change it. It is auto configured based on the cluster setup.

global.store.job.configs

Enables storing job-specific configs in AMS after analysis.

Type: boolean

Default Value: true

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Do not disable it. Keeping it on helps in debugging.

global.store.tasks

Enables persisting task-level data in AMS after analysis.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Task-level data can be huge and may overwhelm AMS, so keep it disabled unless absolutely needed. If enabling, disable again later.
global.store.task.counters

Enables storing task counter data in the AMS after analysis.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

All task counters can be huge and may overwhelm AMS, so keep it disabled unless absolutely needed. If enabling, disable again later.

global.activity.fetch.retry.interval.seconds

Interval (in seconds) between retry attempts to fetch the activity details.

Type: int

Default Value: 5

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.

global.activity.fetch.retry.attempts

Number of tries to fetch activities before giving up.

Type: int

Default Value: 5

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

Default value is suitable for all clusters.

global.tmp.dir

Temporary directory used by activity-analyzer for internal purposes.

Type: string

Default Value:

/var/lib/smartsense/activity-analyzer/tmp/

Ambari Config:

Advanced > Custom activity-analyzer-conf

Config File:

/etc/smartsense-activity/conf/activity.ini

We do not recommended to change this unless you have a very specific requirement. If using a different directory than the default, verify that permissions are set accordingly.


Activity Explorer

The following configuration properties are available for Activity Explorer:

Table 3.6. Activity Explorer Configuration Properties

Property NameDescriptionWhere to ConfigureGuidelines

users.admin

Password for Activity Explorer's admin user when using local authentication.

Type: string

Default Value: (no value)

Ambari Config:

Activity Analysis

Config File:

/etc/zeppelin/conf/shiro.ini

This should be updated only during installation. It requires uninstall and re-install if you have to update the password for admin access.

main.sessionManager

The SessionManager, as its name might imply, manages sessions for all subjects in an application: session creation, deletion, inactivity, validation, and so on. Like other core architectural components in Apache Shiro, the SessionManager is a top-level component maintained by the SecurityManager.

The default SecurityManager implementation uses a DefaultSessionManager out of the box. The DefaultSessionManager implementation provides enterprise-grade session management features (such as session validation and orphan cleanup) needed for an application.

Type: string

Default Value:

org.apache.shiro.web.session.mgt.Default

WebSessionManager

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Shiro documentation.

main.securityManager.sessionManager

The default value applies the value set in main.sessionManager to this property. If needed, you can set this to a value specific for security manager.

Type: string

Default Value: $sessionManager

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Shiro documentation.

securityManager.sessionManager.globalSes

sionTimeout

Sets the default timeout value (in milliseconds) for all newly created sessions. Changing this property will automatically apply the new value to all sessions.

Type: long

Default Value: 86400000

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Shiro documentation.

zeppelin.server.addr

Binding address for Zeppelin Activity Explorer.

Type: string

Default Value: 0.0.0.0

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.server.port

Port on which Zeppelin UI is available.

Type: int

Default Value: 9060

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.server.context.path

Context path of the web application.

Type: string

Default Value: /

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.war.tempdir

Location of Jetty temporary directory.

Type: string

Default Value:

/var/lib/smartsense/activity-explorer/webapp

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.notebook.dir

Path or URI for notebook persist.

Type: string

Default Value:

/var/lib/smartsense/activity-explorer/notebook

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.notebook.homescreen.hide

When set to true, hides home screen notebook from list.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.notebook.storage

Notebook persistence layer implementation.

Type: string

Default Value:

org.apache.zeppelin.notebook.repo.VFS

NotebookRepo

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.interpreter.dir

Interpreter implementation base directory.

Type: string

Default Value:

/usr/hdp/share/hst/activity-explorer/interpreter

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.interpreters

A comma separated list of interpreter configurations. First interpreter becomes default.

Type: string

Default Value: org.apache.zeppelin.phoenix.Phoenix

Interpreter

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.interpreter.connect.timeout

Interpreter process connect timeout in milliseconds.

Type: int

Default Value: 30000

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl

Enables using SSL for the servers.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.client.auth

Enables client authentication for SSL connections.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.keystore.path

Path to keystore relative to the Activity Explorer configuration directory.

Type: stringzeppelin.ssl.truststore.type

Default Value:

/var/lib/smartsense/activity-explorer/keystore

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.keystore.type

The format of the given keystore (for example JKS or PKCS12).

Type: string

Default Value: JKS

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.keystore.password

Keystore password. It can be obfuscated using the Jetty password tool.

Type: string

Default Value: admin

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.key.manager.password

Key Manager password. Defaults to keystore password. It can be obfuscated.

Type: string

Default Value: admin

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.truststore.path

Path to truststore relative to Activity Explorer configuration directory. Defaults to the keystore path.

Type: string

Default Value:

/var/lib/smartsense/activity-explorer/truststore

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.truststore.type

The format of the given truststore (for example JKS or PKCS12). Defaults to the same type as the keystore type.

Type: string

Default Value: JKS

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.ssl.truststore.password

Truststore password. Can be obfuscated using the Jetty password tool. Defaults to the keystore password.

Type: string

Default Value: admin

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.server.allowed.origins

Allowed sources for REST and WebSocket requests (i.e. http://onehost:8080,http://otherhost.com).

If you change from * you are vulnerable to the issue described in ZEPPELIN-173.

Type: string

Default Value: *

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.anonymous.allowed

Enables access by anonymous user.

Type: boolean

Default Value: false

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.

zeppelin.websocket.max.text.message.size

Size in characters of the maximum text message to be received by WebSocket.

Type: long

Default Value: 1024000

Ambari Config:

Advanced > Advanced activity-zeppelin-shiro

Config File:

/etc/zeppelin/conf/shiro.ini

Refer to Apache Zeppelin 0.6.2 documentation.