[BUG] Duplicated default instance manager leads to engine/replica cannot be started #3000
Closed
Description
Describe the bug
All existing rwx are not attaching anymore after upgrading to 1.2.
To Reproduce
Existing volumes are not attaching to redeployed pods. Not even after setting the workload to zero. Restarted longhorn components and longhorn nodes.
Expected behavior
Volumes should attach.
Log
If applicable, add the Longhorn managers' log when the issue happens.
sent longhorn bundle
AttachVolume.Attach failed for volume "pvc-93aad038-6dda-482f-a8f6-d237a0414561" : rpc error: code = DeadlineExceeded desc = volume pvc-93aad038-6dda-482f-a8f6-d237a0414561 failed to attach to node pax-p-95
Environment:
- Longhorn version: 1.2
- Installation method (e.g. Rancher Catalog App/Helm/Kubectl): Rancher Catalog App
- Kubernetes distro (e.g. RKE/K3s/EKS/OpenShift) and version: rancher kubernets v1.20.10
- Number of management node in the cluster: 3
- Number of worker node in the cluster: 3
- Node config
- OS type and version: Ubuntu 20.04.3 LTS
- CPU per node: 12
- Memory per node: 32G
- Disk type(e.g. SSD/NVMe):
- Network bandwidth between the nodes:
- Underlying Infrastructure (e.g. on AWS/GCE, EKS/GKE, VMWare/KVM, Baremetal): Xen
- Number of Longhorn volumes in the cluster: 80
Additional context
Add any other context about the problem here.
Metadata
Assignees
Labels
Type
Projects
Status
Resolved
Status
Closed