Configure an HDFS federation
All the nodes in a federation share a common set of configuration files. To support the common configuration, you must configure a NameService ID for all the NameNodes that you want to include in the federation.
- Ensure that you are configuring the federation during a cluster maintenance window.
- Verify that you have configured HA for all the NameNodes that you want to include in the
federation. In addition, ensure that you have configured the value of
dfs.nameservices
for the NameNodes inhdfs-site.xml
.
A federation allows you to add new NameService IDs to a cluster. Each NameService denotes a new filesystem namespace. You can configure a maximum of four namespaces in a federated environment.
An active NameNode and its standby belong to the NameService ID. To support the common configuration, you must suffix the parameters for the various NameNodes in a federation with the NameService ID.
For example, if you define
ns2
as the NameService ID for a federation,
then you must add ns2
as a suffix to parameters such as
dfs.namenode.rpc-address
, dfs.namenode.http-address
, and
dfs.namenode.secondaryhttp-address
.Note | |
---|---|
This task explains how you can
configure an HDFS federation using the command line interface. For information about using
Ambari to configure a federation, see the topic Configure HDFS Federation in
the Ambari documentation. |
The following example shows the configuration for two NameNodes in a federation:
<configuration>
<property>
<name>dfs.nameservices</name>
<value>ns1,ns2</value>
</property>
<property>
<name>dfs.namenode.rpc-address.ns1</name>
<value>nn-host1:rpc-port</value>
</property>
<property>
<name>dfs.namenode.http-address.ns1</name>
<value>nn-host1:http-port</value>
</property>
<property>
<name>dfs.namenode.rpc-address.ns2</name>
<value>nn-host2:rpc-port</value>
</property>
<property>
<name>dfs.namenode.http-address.ns2</name>
<value>nn-host2:http-port</value>
</property>
.... Other common configuration ...
</configuration>