Profiler data testing

You must note the important information about profiler services.

Test data for VM-based environments

The following dataset has been validated and works as expected:

  • DataHub Master: m5.4xlarge
  • Hive tables: 3000 Hive assets
  • Total Number of assets (including Hive columns, tables, databases): 1,000,000
  • Total Data Size = 1 GB
  • Partitions on Hive tables: Around 5000 partitions spread across five tables