Cloudera Lakehouse Optimizer for Iceberg table maintenance
The Cloudera Lakehouse Optimizer service in Cloudera Base on premises 7.3.2 or higher versions cluster provides
automated Iceberg table maintenance, using Spark jobs, for Iceberg tables in Cloudera Open Data
Lakehouse. The service simplifies table management, improves query performance, and reduces
operational costs.
Cloudera Lakehouse Optimizer for Iceberg table maintenance The Cloudera Lakehouse Optimizer service in Cloudera Base on premises 7.3.2 or higher versions cluster provides automated Iceberg table maintenance, using Spark jobs, for Iceberg tables in Cloudera Open Data Lakehouse. The service simplifies table management, improves query performance, and reduces operational costs.Use cases for Cloudera Lakehouse Optimizer Cloudera Lakehouse Optimizer is a service in Cloudera on cloud Management Console that automates Iceberg table maintenance for Open Data Lakehouse users, leveraging on all of the optimization actions available with Iceberg. You can use Cloudera Lakehouse Optimizer for various use cases.Understand and prepare to use Cloudera Lakehouse Optimizer Before you use Cloudera Lakehouse Optimizer , you must understand how a policy works, the policy resources, and the prerequisites you must complete before you create a policy.Using Cloudera Lakehouse Optimizer REST APIs You can use Cloudera Lakehouse Optimizer REST APIs to create, maintain, and monitor Cloudera Lakehouse Optimizer policies. You can perform several maintenance operations on the Iceberg tables using REST APIs.Manage and monitor Cloudera Lakehouse Optimizer table maintenance tasks You can perform several policy managing activities, such as pausing and resuming table maintenance, associating policies, and fetching maintenance task details. You can also perform several monitoring tasks, such as verifying whether the tasks completed successfully, monitoring the Spark jobs on the Cloudera Observability dashboard, and viewing logs to troubleshoot issues.