MergeTree Parts Not Merging Automatically with Frequent Inserts, Seeking Optimization Methods

Hello chdb team,

I am using chdb for a use case that involves periodically inserting small batches of data into a MergeTree table. I have noticed that over time, each insert creates a new, small data part (a separate file/directory on disk).

The Problem:
Unlike a standard ClickHouse server, these small parts do not seem to be automatically merged in the background. This results in a large number of small data parts for a single table. Consequently, the query performance on this table degrades significantly, as the query engine has to read from many different files.

My Questions
Does chdb have an automatic background merge process for MergeTree tables like a standard ClickHouse server? If so, is there any configuration required to enable or tune it?

Or is there a way to manually trigger a merge for all the data parts in a table? Are there any recommended best practices or MergeTree settings for write-intensive scenarios with frequent, small inserts in chdb? 

The follows are some screenshots

<img width="766" height="550" alt="Image" src="https://github.com/user-attachments/assets/19d2c94f-325d-4794-9d1b-ad091c2dc67a" />

<img width="1162" height="504" alt="Image" src="https://github.com/user-attachments/assets/35c8ea0a-f483-422a-82b9-8485c796e29a" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MergeTree Parts Not Merging Automatically with Frequent Inserts, Seeking Optimization Methods #365

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

MergeTree Parts Not Merging Automatically with Frequent Inserts, Seeking Optimization Methods #365

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions