Try the recommended solution for each of the following problems.
If you reboot your cluster, you must restart the Ambari Server and all the Ambari Agents manually.
Charts appear incorrectly or not at all despite Host health status is displayed incorrectly.
SUSE 11 ships with Python version 2.6.0-8.12.2 which contains a known defect that causes this crash.
As an option you can start the HBase REST server manually after the install process is complete. It can be started on any host that has the HBase Master or the Region Server installed. If you install the REST server on the same host as the Ambari server, the http ports will conflict.
On a cluster host ps aux | grep ambari-agent
shows more than one agent process running. This causes Ambari Server to get incorrect ids from the host and forces Agent to restart and re-register.
The Hive Service uses MySQL Server by default. If you choose MySQL server as the database on the Ambari Server host as the managed server for Hive, Ambari stops this database during deployment and crashes.
The cluster fails to install with an error related to running groupmod
. This can occur in environments where groups are managed in LDAP, and not on local Linux machines. You may see an error message similar to the following one:
Fail: Execution of 'groupmod hadoop' returned 10. groupmod: group 'hadoop' does not exist in /etc/group
When using SLES and performing host registration using SSH, the Agent bootstrap may fail due to timeout when running the setupAgent.py
script. The host on which the timeout occurs will show the following process hanging:
c6401.ambari.apache.org:/etc/
# ps -ef | grep zypper
root 18318 18317 5 03:15 pts/1 00:00:00 zypper -q search -s --match-exact
ambari-agent
If you have a repository registered that is prompting to accept keys, via user interaction, you may see the hang and timeout. In this case, run
zypper refresh
and confirm all repository keys are accepted for the zypper command to work without user interaction.Another alternative is to perform manual Agent setup and not use SSH for host registration. This option does not require that Ambari call zypper without user interaction.
When installing Ambari on RHEL/CentOS 6 using the Cluster Installer Wizard at the Host Checks step, one or more host checks may fail if you have not disabled Transparent Huge Pages on all hosts.
Host Checks will warn you when a failure occurs.
Disable THP. On all hosts,
Add the following command to your
/etc/rc.local
file:if test -f /sys/kernel/mm/transparent_hugepage/enabled; then echo never > /sys/kernel/mm/redhat_transparent_hugepage/enabled fi if test -f /sys/kernel/mm/transparent_hugepage/defrag; then echo never > /sys/kernel/mm/redhat_transparent_hugepage/defrag fi
To confirm, reboot the host then run the following command:
$ cat /sys/kernel/mm/transparent_hugepage/enabled always madvise [never]
During cluster install, DataNode fails to install with the following error:
resource_management.core.exceptions.
Fail: Execution of '/usr/bin/yum -d 0 -e 0 -y install snappy-devel' returned 1.
Error: Package: snappy-devel-1.0.5-1.el6.x86_64 (HDP-UTILS-1.1.0.20)
Requires: snappy(x86-64) = 1.0.5-1.el6
Installed: snappy-1.1.0-3.el7.x86_64 (@anaconda/7.1)
snappy(x86-64) = 1.1.0-3.el7
Available: snappy-1.0.5-1.el6.x86_64 (HDP-UTILS-1.1.0.20)
snappy(x86-64) = 1.0.5-1.el6
When running Ambari Server as non-root, when enabling Kerberos, if kadmin fails to authenticate, you will see the following error in ambari-server.log if Ambari cannot access the kadmind.log.
STDERR: Couldn't open log file /var/log/kadmind.log: Permission denied
kadmin: GSS-API (or Kerberos) error while initializing kadmin interface
When adding client-only services to a cluster (using Add Service), Ambari does not automatically install dependent client components with the newly added clients.
When using an Agent non-root configuration, if you attempt to register hosts automatically using SSH, the Agent registration will fail.