Try the recommended solution for each of the following problems:.
If you reboot your cluster, you must restart the Ambari Server and all the Ambari Agents manually.
Charts appear incorrectly or not at all despite being available in the native Ganglia interface or Host health status is displayed incorrectly.
SUSE 11 ships with Python version 2.6.0-8.12.2 which contains a known defect that causes this crash.
As an option you can start the HBase REST server manually after the install process is complete. It can be started on any host that has the HBase Master or the Region Server installed. If you install the REST server on the same host as the Ambari server, the http ports will conflict.
On a cluster host ps aux | grep ambari-agent
shows more than
one agent process running. This causes Ambari Server to get incorrect ids from the
host and forces Agent to restart and re-register.
When a cluster is first started, some graphs, like Services View -> HDFS and Services View -> MapReduce, do not plot a complete hour of data, instead showing data only for the length of time the service has been running. Other graphs display the run of a complete hour.
The Hive Service uses MySQL Server by default. If you choose MySQL server as the database on the Ambari Server host as the managed server for Hive, Ambari stops this database during deployment and crashes.
JVM networkaddress.cache negative.ttl default setting of 10 (never cache) may result in DNS failure. Long, or multiple queries running on the JVM may fail. Occurs in Java 6,7, and 8.
Appropriate values for networkaddress.cache negative ttl depend on various system factors, including network traffic, cluster size, and resource availability. You can set Java VM options in an Ambari-installed cluster using the following procedure:
Edit the template for hadoop-env.sh file. Ambari deploys the template file on your cluster in the following location:
/var/lib/ambari-server/resources/stacks/{stackName}/{stackVersion}/hooks/before-START/templates/hadoop-env.sh.j2
where {stackName} and {stackVersion} refer to your specific stack name and version.
Change the following line in the template to add options to all Hadoop processes, then save the file.
export HADOOP_OPTS="-Djava.net.preferIPv4Stack=true ${HADOOP_OPTS}"
Restart Ambari server.
ambari-server restart
Restart affected services, using the Ambari Web UI.
The cluster fails to install with an error related to running “groupmod”. This can occur in environments where groups are managed in LDAP, and not on local Linux machines.
Fail: Execution of 'groupmod hadoop' returned 10. groupmod: group 'hadoop' does not exist in /etc/group
When using SLES and performing host registration using SSH, there are scenarios where Agent bootstrap fails due to timeout when running the setupAgent.py script. On the host with the timeout, you will see the following process hanging:
c6401.ambari.apache.org:/etc/ # ps -ef | grep zypper root 18318 18317 5 03:15 pts/1 00:00:00 zypper -q search -s --match-exact ambari-agent
If you have a repository registered that is prompting to accept keys, via user interaction, you may see the hang and timeout. In this case, run "zypper refresh" and confirm all repository keys are accepted for the zypper command to work without user interaction.
Another alternative is to perform manual Agent setup and not use SSH for host registration. This option does not require that Ambari call zypper without user interaction.