Ozone architecture
Ozone can be co-located with HDFS with single security and governance policies for easy data exchange or migration and also offers seamless application portability. Ozone has a scale-out architecture with minimal operational overheads. Ozone separates management of namespaces and storage, helping it to scale effectively. The Ozone Manager (OM) manages the namespaces while the Storage Container Manager (SCM) handles the containers.
- Hadoop Distributed Data Store
- Ozone is built on a highly available, replicated block storage layer called Hadoop Distributed Data Store (HDDS).
- Blocks
- Blocks are the basic unit of storage. In Ozone, each block is of 256 MB in size. A collection of blocks forms a storage container. The SCM allocates blocks inside storage containers for the client to store data.
- Storage Containers
- A storage container is a group of unrelated blocks managed together as a single entity. A container exists in a DataNode and is the basic unit of replication, with a capacity of 2 GB to 16 GB.
- DataNodes
- DataNodes contain storage containers comprising of data blocks. The SCM monitors DataNodes through heartbeats.
- Ozone Manager
- The Ozone Manager (OM) is the metadata manager for Ozone. The OM manages the following
storage elements:
- The list of volumes for each user
- The list of buckets for each volume
- The list of keys for each bucket
- Ozone Manager
- The Ozone Manager (OM) is a highly available namespace manager for Ozone.
OM manages the metadata for volumes, buckets, and keys. OM maintains the mappings between keys and their corresponding Block IDs. When a client application requests for keys to perform read and write operations, OM interacts with SCM for information about blocks relevant to the read and write operations, and provides this information to the client. In addition, OM also handles metadata operations from the clients.
- Storage Container Manager
- The Storage Container Manager performs multiple critical functions for an Ozone cluster.
- Pipelines
- Pipelines determine the replication strategy for the blocks associated with a write operation.
- Recon Server
- Recon is the management interface for Ozone. Recon provides a unified management API for Ozone.