-
Notifications
You must be signed in to change notification settings - Fork 609
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Enabling replica-auto-balance tries to replicate to disabled nodes causing lots of errors in the logs and in the UI #6508
Comments
@c3y1huang any progress on this? |
@jsalatiel I've had a few other tasks on my plate, but I will have the fix out soon. |
Pre Ready-For-Testing Checklist
|
Verified pass on longhorn-master (longhorn-manager In master-head, enable |
Describe the bug (🐛 if you encounter this issue)
I have a 3 zone cluster and create-default-disk-labeled-nodes is set to true.
Zone 1 and Zone 2 have 2 untainted nodes labeled as node.longhorn.io/create-default-disk=true each one replicating data.
Zone 3 has one single node that does not replicate anything( so no label set ) , but it is able to mount from the longhorn storage class.
It looks like this in the UI.
All the volumes are in healthy state:
The moment I set replica-auto-balance to best-effort I start getting the volume can not be scheduled in all volumes
I suppose since node0 is on another zone, the best-effort will try to schedule there even if that node is disabled which it should not be doing.
Support bundle for troubleshooting
Environment
Additional context
The text was updated successfully, but these errors were encountered: