-
Notifications
You must be signed in to change notification settings - Fork 40k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[1.3.2] Pod stuck in the ContainerCreating state while mounting secret timeout #29617
Comments
Still getting random Pod stuck while mounting secret on the GKE 1.3.3. |
@trane9991 Based on what you provided, you hit an issue where if a secret fails to unmount it will be unable to mount to any other pod. It was fixed by #28939. The fix is in v1.3.4, which should be in GKE soon. |
Closing this issue. Please re-open if you run into it in v1.3.4+ |
I'm experiencing this exact issue on 1.3.4 (albeit on AWS running jessie not GKE). Has anyone successfully found a workaround that doesn't involve patching k8s itself? We get random pods getting stuck in ContainerCreating (5-10%ish), which quite badly breaks our use case (spinning up containers on user request). We were previously on ~1.2.3 with no problems. I did downgrade to ~1.2.6 but had the same stalled container creation hang. (Sorry for lack of precise version numbers, on a phone) |
@boydgreenfield Based on what you've said you're experiencing this issue both with v1.2 and v1.3 which means it is likely not the issue described above. If you can provide more information I can help you debug: What version of master/node are you running. How are you deploying your pods? For the pods stuck in ContainerCreating, could you grab the |
@saad-ali – I was on a mobile device earlier but will respond with logs for 1.3 in the morning Pacific. Apologies about the less-than-complete error message this evening. |
@boydgreenfield No problem! And I'd love to see the 1.2 logs as well if you're experiencing the issue on both and still have access to them. |
Hello, I have GKE 1.3.2 version and my cluster stopped being able to mount Secret files as Volumes.
I'm keep getting:
after checking instances logs on the StackDriver i can see next message:
secrets with other names working normal, but the secret with name text-secret make pod hangs.
and not trying to restart it either.
UPDATE:
Looks like issue related to the #29059
Default token keeps being remounted and my secret never gets mounted and then pod failing with the timeout.
I just downgraded GKE nodes to the 1.2.5 version and everything seems ok for now.
Please take a look at this bug since it's pretty critical. Pods creating hangs randomly due inability to mount volume.
The text was updated successfully, but these errors were encountered: