Flow Management cluster layout

The Data Hub service includes two default Flow Management cluster definitions. These are the Flow Management: Light Duty and Flow Management: Heavy Duty definitions. Learn about the layout, capacity, and components of these definitions.

Flow Management: Light Duty cluster layout

You can use a Flow Management: Light Duty cluster definition in development, testing, or proof of concept scenarios. Light duty clusters include the following nodes and components (services):

Each of the cluster nodes includes:

  • NiFi and ZooKeeper co-located on all instances

  • For each node hosting NiFi and ZooKeeper:
    • AWS: m5.2xlarge
    • Azure: D8_v3
    • GCP: e2-standard-8
  • Storage requirements per NiFi node:
    • AWS: 4 x 500 GB EBS ST1
    • Azure 4 x 500 GB Standard SSD
    • GCP: 4 x 500 GB PD-Standard
  • Each NiFi node hosts the following repositories
    • FlowFile repository
    • Content repository
    • Provenance repository
    • Log and Database repository

For more information, see the Instance types and Storage information specific to your cloud provider.

Flow Management: Heavy Duty cluster layout

You can use the Flow Management: Heavy Duty cluster definition in production scenarios. Heavy duty clusters include the following nodes and components (services):

The cluster definition includes:

  • NiFi and ZooKeeper on separate nodes

  • NiFi nodes scale independently of ZooKeeper

  • For each ZooKeeper node:
    • AWS – m5.2xlarge
    • Azure – D8_v3
    • GCP: e2-standard-8
  • For each NiFi node:
    • AWS – m5.2xlarge
    • Azure – F16sv2
    • GCP: e2-standard-8
  • Storage requirements per NiFi node:
    • AWS – 4 x 1 TB EBS GP2
    • Azure – 4 x 1 TB Premium SSD
    • GCP: 4 x 1 TB PD-SSD
  • Each NiFi node hosts the following repositories
    • FlowFile repository
    • Content repository
    • Provenance repository
    • Log and Database repository

For more information, see the Instance types and Storage information specific to your cloud provider.