The Impala Service
You can install Cloudera Impala through the Cloudera Manager installation wizard, using either parcels or packages, and have the service created and started as part of the first run installation wizard. See Installing Impala.
If you elect not to include the Impala service using the installation wizard, you can you the Add Service wizard to perform the installation. The wizard will automatically configure and start the dependent services and the Impala service. See Adding a Service for instructions.
Configuring the Impala Service
There are several types of configuration settings you may need to apply, depending on your situation.
Running Impala with CDH 4.1
If you are running CDH 4.1, and the Bypass Hive Metastore Server option is enabled, do the following:- Go to the Impala service.
- Select .
- Select .
- Add the following to the Impala Advanced Configuration Snippet for hive-site.xml property, replacing <hive_metastore_server_host> with the name of your Hive Metastore Server host:
<property> <name>hive.metastore.local</name> <value>false</value> </property> <property> <name>hive.metastore.uris</name> <value>thrift://<hive_metastore_server_host>:9083</value> </property>
- Click Save Changes.
- Restart the Impala service.
Enabling Sentry Authorization for Impala
- Enable Sentry authorization for Hive.
- Go to the Impala service.
- Select .
- Under the Service-Wide category, go to the Sentry section.
- Check Enable Sentry Authorization, then click Save Changes.
- Restart the Impala service.
Configuring Table Statistics
Configuring table statistics is highly recommended when using Impala. It allows Impala to make optimizations that can result in significant (over 10x) performance improvement for some joins. If these are not available, Impala will still function, but at lower performance.
The Impala implementation to compute table statistics is available in CDH 5.0.0 or higher and in Impala version 1.2.2 or higher. The Impala implementation of COMPUTE STATS requires no setup steps and is preferred over the Hive implementation. See Table Statistics. If you are running an older version of Impala, follow the procedure in Hive Table Statistics.
Adding the Impala Llama ApplicationMaster Role
CDH 5 introduces the Impala Llama ApplicationMaster (Llama) role, which reserves and releases YARN-managed resources for Impala, thus reducing resource management overhead when performing Impala queries. For further information, see Managing Resources.
- Manually enable cgroup-based resource management:
- In the top navigation bar, click Hosts.
- Select .
- Expand Resource Management.
- Check the Enable Cgroup-based Resource Management checkbox.
- Click Save Changes.
- Optionally configure one or more dynamic resource pools for YARN. If you do not configure pools, queries use the default pool or a pool named for the users who submit the queries.
- Configure YARN resource management properties:
- Go to the YARN service.
- Select .
- Select .
- Check the Use CGroups for Resource Management and Always use Linux Container Executor properties.
- Click Save Changes.
- Select .
- Set the Container Memory Minimum and Container Virtual CPU Cores Minimum properties to 0.
- Click Save Changes.
- Select .
- Record the value of the Container Memory property.
- Configure Impala resource management properties:
- Go to the Impala service.
- Select .
- Click Resource Management.
- Select .
- Set it to the YARN service.
- Select .
- Set Impala Daemon Memory Limit property to be equal to the value you recorded in step 3j.
- Click Save Changes.
- Add and configure the Llama role:
- Click the Instances tab.
- Click the Add button.
- Select a host in the column under Impala Llama ApplicationMaster (beta), then click OK.
- Click Continue.
- Select .
- Click Impala Llama ApplicationMaster (beta) Default Group.
- In the Core Queues property, enter the pools you created in step 2, if any.
- Click Save Changes.
- Restart services and redeploy client configurations:
- Click in the top right.
- Click the Restart button.
- Click Restart Now.
- Click Finish.
Impala Web Servers
Enabling and Disabling Access to Impala Web Servers
By default access to the Impala Daemon and StateStore web servers is enabled.-
Impala StateStore
- Go to the Impala service.
- Select .
- Select Impala StateStore Default Group.
- Check or uncheck Enable StateStore Web Server.
- Click Save Changes.
- Restart the Impala service.
-
Impala Daemon
- Go to the Impala service.
- Select .
- Select .
- Check or uncheck Enable Impala Daemon Web Server.
- Click Save Changes.
- Restart the Impala service.
Opening Impala Web Server UIs
-
Impala StateStore
- Go to the Impala service.
- Select .
- Impala Daemon
- Go the to Impala service.
- Click the Instances tab.
- Click an impalad instance.
- Click Impala Daemon Web UI.
- Impala Catalog Server
- Go to the Impala service.
- Select .
- Impala Llama ApplicationMaster (beta)
- Go to the Impala service.
- Click the Instances tab.
- Click a llama instance.
- Click Llama Web UI.
Configuring Secure Access for Impala Web Servers
Cloudera Manager supports two methods of authentication for secure access to the Impala Catalog Server, Daemon, and StateStore web servers: password-based authentication and SSL certificate authentication. Both of these can be configured through properties of the Impala Catalog Server, Daemon, and StateStore. Authentication for the three types of daemons can be configured independently.
Configuring Password Authentication
- Go to the Impala service.
- Select .
- Search for "password" using the Search box within the Configuration page. This should display the password-related properties (Username and Password properties) for the Impala Catalog Server, Daemon, and StateStore. If there are multiple role groups configured for Impala Daemon instances, the search should display all of them.
- Enter a username and password into these fields.
- Click Save Changes.
- Restart the Impala service.
Now when you access the Web UI for the Impala Catalog Server, Daemon, and StateStore, you are asked to log in before access is granted.
Configuring SSL Certificate Authentication
- Create or obtain an SSL certificate.
- Place the certificate, in .pem format, on the hosts where the Impala Catalog Server and StateStore are running, and on each host where an Impala Daemon is running. It can be placed in any location (path) you choose. If all the Impala Daemons are members of the same role group, then the .pem file must have the same path on every host.
- Go to the Impala service page.
- Select .
- Search for "certificate" using the Search box within the Configuration page. This should display the certificate file location properties for the Impala Catalog Server, Daemon, and StateStore. If there are multiple role groups configured for Impala Daemon instances, the search should display all of them.
- In the property fields, enter the full path name to the certificate file.
- Click Save Changes.
- Restart the Impala service.
When you access the Web UI for the Impala Catalog Server, Daemon, and StateStore, https will be used.
Deploying Impala with Hue
- Go to the Hue service page.
- Select .
- Select .
- Add the following to Hue Server Configuration Advanced Configuration Snippet for hue_safety_valve_server.ini:
[impala] server_host=<impalad_hostname> server_port=21000
Substitute your actual hostname for <impalad_hostname>. You can chose any one of your Impala Daemon hosts, assuming you have more than one. - Click Save Changes.
- Restart the Hue service.
<< The Hue Service | The Lily HBase Indexer Service >> | |