Skip to content

Table maintenance Support in Polaris #538

@flyrain

Description

@flyrain

Describe the solution you'd like

Iceberg table maintenance allows users to focus on their business needs without worrying about table management. It ensures tables stay optimized, perform efficiently, and comply with governance policies like data retention. Common maintenance tasks include data compaction, metadata compaction, snapshot expiration, partition stats collection, and orphan file removal.
The Table Maintenance System (TMS) can operate independently from a catalog like Polaris, providing maintenance services even without direct integration. However, it functions more effectively when paired with a catalog. Meanwhile, Polaris should allow pluggable TMS options so users can choose the one that fits their needs. This document outlines how Polaris supports Iceberg table maintenance—defining its role and clarifying what it should handle and what it should leave to TMS.

Here is the proposal, https://docs.google.com/document/d/1Pd_mzZcfvnUvcH98IbwsIYf4eryet1lQDfclKYx-t-M/edit?usp=sharing.

Describe alternatives you've considered

No response

Additional context

No response

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

Status

In Progress

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions