-
Notifications
You must be signed in to change notification settings - Fork 609
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Longhorn manager crashed during backing image 100gb volume export #5209
Comments
cc @longhorn/qa for coverage |
The error log in the backing-image-ds pod:
This is a HTTP error, which is probably caused by the data-transfer connection between the backing image ds pod and the replica process. Waiting for the user to figure out the cause or provide more info. |
@shuo-wu i think the error at the end is from the --follow command while streaming the logs. The actual last line of the log is: support bundle attached. |
This means the pod is blindly trying to reuse the existing file if possible. Here there is no existing file, hence the pod will give up the reusage then follow the regular flow. As I mentioned, this is not an error... |
last night i shut down all non-essential vms, had not very much load on my harvester cluster, and tried again. same issue. Today i put one of the nodes (Harveter-01 into maintenance mode and rebooted it. when it came back up and tried to sync the degraded volumes, i started seeing more errors like this: additional support bundle attached. |
still having the same issue. i've added two additional nodes to my cluster as well, and now it fails at 24% instead of 28%. every time, same deal. i cant export volumes at all it seems. |
Maybe this is also related to get checksum timeout issue #5443 Based on the log
And the error seems to be http error in data source pod in the end
@albertkohl-monotek can you successfully create backingimage by exporting smaller voulme (say smaller than 5G) cc @shuo-wu |
Hi I revisited the issue and found this is actually a duplicated issue of #4865 The root cause was because
That's why it kept restarting and the controller kept crashing in the case. This issue was already fixed. |
I have verified that this issue no longer happens after version v1.4.1
It failed many times without panic and successfully created the backing image |
Pre Ready-For-Testing Checklist
|
Verified Create backing image with export-volume type( source volume size greater than 50 Gi) success |
Describe the bug (🐛 if you encounter this issue)
To Reproduce
Steps to reproduce the behavior:
Expected behavior
Log or Support bundle
manager.txt
backing.txt
Environment
Additional context
https://rancher-users.slack.com/archives/CC2UQM49Y/p1672880888519919
The text was updated successfully, but these errors were encountered: