-
Notifications
You must be signed in to change notification settings - Fork 40k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Not enough ips in kubernetes-jenkins-pull #25629
Comments
Now we have a different problem. Someone or something deleted all of the Jenkins GCE VMs in that project. |
Sorry, I'm retarded. I did what i always do when there's a leak, free resources. Thought i was deleting disposable vms but i clearly messed that up. |
well, I will take this opportunity to fully clean up the project. The node e2e tests occasionally leak VMs. We need to improve our infrastructure to garbage-collect these. |
Things should be running now, closing. More than 60% of the ips are unused. |
So it looks like we go from 0-100, real quick. We're back at 90 something % usage of ips. |
Cleaned up leaked node e2e VMs:
|
Hmm, looks like we're maxing out 150 ips now (#25171 (comment))
Once bitten twice shy, @ixdy kick off your cleanup script? (or gc in a cron job till we fix the leak?) |
Yeah, cleaning up now. I'll make it part of our daily cleanup, but we need to figure out why so many of the node e2e tests are getting stuck, which is the root cause. @pwittrock |
/cc @kubernetes/sig-testing |
Root cause (leaking VMs) fixed. Should be in the clear now. |
@ixdy it occurred to me that an alert on quota > 75% indicates imminent flake. I'd subscribe to that if we had it. |
I'm working on setting up monitoring and alerting of our GCP quotas. It's unnecessarily difficult to integrate, unfortunately. |
Master didn't come up because there were no ips:
https://pantheon.corp.google.com/storage/browser/kubernetes-jenkins/pr-logs/pull/23567/kubernetes-pull-build-test-e2e-gce/40061/artifacts/
we're at:
so we're either running too much on that project or leaking ips.
@kubernetes/goog-testing
The text was updated successfully, but these errors were encountered: