-
Notifications
You must be signed in to change notification settings - Fork 609
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Unable to create snapshot: cannot get engine client because it isn't deployed #7438
Comments
Hi @yangchiu How did you capture the state of the engine image at the time the test failed?
|
The test case will check the status before taking a snapshot:
Which ensure |
AnalysisWhen deleting the engineimage daemonset (similar to the test step test_engine_image_daemonset_restart), watching the status of engineimage CR shows that the ---
apiVersion: longhorn.io/v1beta2
kind: EngineImage
metadata:
creationTimestamp: "2023-12-27T00:20:43Z"
finalizers:
- longhorn.io
generation: 1
labels:
longhorn.io/component: engine-image
longhorn.io/engine-image: ei-b907910b
longhorn.io/managed-by: longhorn-manager
name: ei-b907910b
namespace: longhorn-system
resourceVersion: "609096"
uid: 0bfb1dca-b8d9-4d3a-88b8-206369986dee
spec:
image: longhornio/longhorn-engine:master-head
status:
buildDate: "2023-12-05T06:21:33+00:00"
cliAPIMinVersion: 3
cliAPIVersion: 9
conditions:
- lastProbeTime: ""
lastTransitionTime: "2023-12-27T21:26:55Z"
message: engine binary check failed
reason: daemonSet
status: "False"
type: ready
controllerAPIMinVersion: 3
controllerAPIVersion: 5
dataFormatMinVersion: 1
dataFormatVersion: 1
gitCommit: 014125a20fd26e78be607365b6d090fe93d1f00f
noRefSince: "2023-12-27T21:10:00Z"
nodeDeploymentMap:
phan-v607-engine-image-pool2-e1b8ff09-cmchx: false
phan-v607-engine-image-pool2-e1b8ff09-ktjfk: true
phan-v607-engine-image-pool2-e1b8ff09-nk75k: true
ownerID: phan-v607-engine-image-pool2-e1b8ff09-nk75k
refCount: 0
state: deploying
version: 014125a2
---
apiVersion: longhorn.io/v1beta2
kind: EngineImage
metadata:
creationTimestamp: "2023-12-27T00:20:43Z"
finalizers:
- longhorn.io
generation: 1
labels:
longhorn.io/component: engine-image
longhorn.io/engine-image: ei-b907910b
longhorn.io/managed-by: longhorn-manager
name: ei-b907910b
namespace: longhorn-system
resourceVersion: "609150"
uid: 0bfb1dca-b8d9-4d3a-88b8-206369986dee
spec:
image: longhornio/longhorn-engine:master-head
status:
buildDate: "2023-12-05T06:21:33+00:00"
cliAPIMinVersion: 3
cliAPIVersion: 9
conditions:
- lastProbeTime: ""
lastTransitionTime: "2023-12-27T21:26:59Z"
message: Engine image ei-b907910b (longhornio/longhorn-engine:master-head) is
fully deployed on all ready nodes
reason: ""
status: "True"
type: ready
controllerAPIMinVersion: 3
controllerAPIVersion: 5
dataFormatMinVersion: 1
dataFormatVersion: 1
gitCommit: 014125a20fd26e78be607365b6d090fe93d1f00f
noRefSince: "2023-12-27T21:10:00Z"
nodeDeploymentMap:
phan-v607-engine-image-pool2-e1b8ff09-cmchx: true
phan-v607-engine-image-pool2-e1b8ff09-ktjfk: true
phan-v607-engine-image-pool2-e1b8ff09-nk75k: true
ownerID: phan-v607-engine-image-pool2-e1b8ff09-nk75k
refCount: 0
state: deployed
version: 014125a2
---
apiVersion: longhorn.io/v1beta2
kind: EngineImage
metadata:
creationTimestamp: "2023-12-27T00:20:43Z"
finalizers:
- longhorn.io
generation: 1
labels:
longhorn.io/component: engine-image
longhorn.io/engine-image: ei-b907910b
longhorn.io/managed-by: longhorn-manager
name: ei-b907910b
namespace: longhorn-system
resourceVersion: "609161"
uid: 0bfb1dca-b8d9-4d3a-88b8-206369986dee
spec:
image: longhornio/longhorn-engine:master-head
status:
buildDate: "2023-12-05T06:21:33+00:00"
cliAPIMinVersion: 3
cliAPIVersion: 9
conditions:
- lastProbeTime: ""
lastTransitionTime: "2023-12-27T21:27:01Z"
message: 'Engine image is not fully deployed on all nodes: 2 of 3'
reason: daemonSet
status: "False"
type: ready
controllerAPIMinVersion: 3
controllerAPIVersion: 5
dataFormatMinVersion: 1
dataFormatVersion: 1
gitCommit: 014125a20fd26e78be607365b6d090fe93d1f00f
noRefSince: "2023-12-27T21:10:00Z"
nodeDeploymentMap:
phan-v607-engine-image-pool2-e1b8ff09-cmchx: true
phan-v607-engine-image-pool2-e1b8ff09-ktjfk: true
phan-v607-engine-image-pool2-e1b8ff09-nk75k: false
ownerID: phan-v607-engine-image-pool2-e1b8ff09-nk75k
refCount: 0
state: deploying
version: 014125a2
---
apiVersion: longhorn.io/v1beta2
kind: EngineImage
metadata:
creationTimestamp: "2023-12-27T00:20:43Z"
finalizers:
- longhorn.io
generation: 1
labels:
longhorn.io/component: engine-image
longhorn.io/engine-image: ei-b907910b
longhorn.io/managed-by: longhorn-manager
name: ei-b907910b
namespace: longhorn-system
resourceVersion: "609162"
uid: 0bfb1dca-b8d9-4d3a-88b8-206369986dee
spec:
image: longhornio/longhorn-engine:master-head
status:
buildDate: "2023-12-05T06:21:33+00:00"
cliAPIMinVersion: 3
cliAPIVersion: 9
conditions:
- lastProbeTime: ""
lastTransitionTime: "2023-12-27T21:27:01Z"
message: 'Engine image is not fully deployed on all nodes: 2 of 3'
reason: daemonSet
status: "False"
type: ready
controllerAPIMinVersion: 3
controllerAPIVersion: 5
dataFormatMinVersion: 1
dataFormatVersion: 1
gitCommit: 014125a20fd26e78be607365b6d090fe93d1f00f
noRefSince: "2023-12-27T21:10:00Z"
nodeDeploymentMap:
phan-v607-engine-image-pool2-e1b8ff09-cmchx: true
phan-v607-engine-image-pool2-e1b8ff09-ktjfk: true
phan-v607-engine-image-pool2-e1b8ff09-nk75k: false
ownerID: phan-v607-engine-image-pool2-e1b8ff09-ktjfk
refCount: 0
state: deploying
version: 014125a2
---
apiVersion: longhorn.io/v1beta2
kind: EngineImage
metadata:
creationTimestamp: "2023-12-27T00:20:43Z"
finalizers:
- longhorn.io
generation: 1
labels:
longhorn.io/component: engine-image
longhorn.io/engine-image: ei-b907910b
longhorn.io/managed-by: longhorn-manager
name: ei-b907910b
namespace: longhorn-system
resourceVersion: "609165"
uid: 0bfb1dca-b8d9-4d3a-88b8-206369986dee
spec:
image: longhornio/longhorn-engine:master-head
status:
buildDate: "2023-12-05T06:21:33+00:00"
cliAPIMinVersion: 3
cliAPIVersion: 9
conditions:
- lastProbeTime: ""
lastTransitionTime: "2023-12-27T21:27:01Z"
message: Engine image ei-b907910b (longhornio/longhorn-engine:master-head) is
fully deployed on all ready nodes
reason: ""
status: "True"
type: ready
controllerAPIMinVersion: 3
controllerAPIVersion: 5
dataFormatMinVersion: 1
dataFormatVersion: 1
gitCommit: 014125a20fd26e78be607365b6d090fe93d1f00f
noRefSince: "2023-12-27T21:10:00Z"
nodeDeploymentMap:
phan-v607-engine-image-pool2-e1b8ff09-cmchx: true
phan-v607-engine-image-pool2-e1b8ff09-ktjfk: true
phan-v607-engine-image-pool2-e1b8ff09-nk75k: true
ownerID: phan-v607-engine-image-pool2-e1b8ff09-ktjfk
refCount: 0
state: deployed
version: 014125a2
--- The flapping flow
When engineimage flaps between I think this behavior is not a bug in Longhorn. We just need to adjust the e2e test so that it doesn't expect the |
Pre Ready-For-Testing Checklist
|
Verified passed on master-head (longhorn-tests 9b36237) by running test case Test results: https://ci.longhorn.io/job/private/job/longhorn-tests-regression/5733/ ~ https://ci.longhorn.io/job/private/job/longhorn-tests-regression/5742/ |
Describe the bug (🐛 if you encounter this issue)
Run test case
test_engine_image_daemonset_restart
Repeatedly. After engine image DaemonSet restarted, even though the engine imagestatus.conditions[0].status
has becomeTrue
:It failed at
create_snapshot
step with error message with reproducibility ~20%
:https://ci.longhorn.io/job/private/job/longhorn-tests-regression/5560/
Not sure if it's a bug or the test case needs to be refined.
To Reproduce
Run test case
test_engine_image_daemonset_restart
RepeatedlyExpected behavior
Support bundle for troubleshooting
Environment
Additional context
The text was updated successfully, but these errors were encountered: