[Bug]: L0 compaction cannot keep up with upsert; dataNode memory usage suddenly increases for no apparent reason #34258
Labels
kind/bug
Issues or changes related a bug
severity/critical
Critical, lead to crash, data missing, wrong result, function totally doesn't work.
triage/accepted
Indicates an issue or PR is ready to be actively worked on.
Milestone
Is there an existing issue for this?
Environment
Current Behavior
deploy milvus with dataNode and config
test steps
result
L0 compaction cannot keep up with upsert
After about 5 hours, L0 compaction cannot keep up with upsert, and the number of flushed-L0 segments increases significantly.
The reason may be that the available memory of L0-compaction is relatively small. Because the available memory increases after the upsert is completed, the L0-compaction latency is reduced to 13 minutes
grafana link
L0 compaction after upsert completed
grafana link
dataNode memory usage during L0-compaction
After the upsert is completed, the available memory for L0 compaction increases and the L0-compaction latency is significantly reduced. However, the dn memory usage rises to 80% for no apparent reason.
grafana link
Expected Behavior
No response
Steps To Reproduce
Milvus Log
grafana link: metrics of compact-no-flush-1b1
pods:
Anything else?
No response
The text was updated successfully, but these errors were encountered: