-
Notifications
You must be signed in to change notification settings - Fork 40k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a rate limiter to the GCE cloudprovider #23019
Conversation
It will poll for operation completion with at most 10 qps to avoid triggering GCE's rate limits.
Presumably too late for 1.2? Would be really nice for 1.2.1 though. |
Labelling this PR as size/M |
GCE e2e build/test failed for commit 6dc63f8. Please reference the list of currently known flakes when examining this failure. If you request a re-test, you must reference the issue describing the flake. |
@alex-mohr Is there an issue for this? |
Tests failed |
This is maybe debatable, but only P0s should be cherrypicked into the 1.2 branch beyond this point |
And TJ told me the issue is #21563 (please xref) |
Of course, it's possible there's an unanticipated interaction -- I'll check the retest results. Re: issue, apologies for not including the initial xref. Re: impact, it makes creating 1000 node clusters on GCE and GKE worse enough that I support it as an important enough reliability fix that it should go into 1.2, given the relatively contained nature of the PR. Willing to discuss further if you think it worth the time to do so. |
@alex-mohr |
Removing label |
GCE e2e build/test passed for commit 6dc63f8. |
GCE e2e build/test passed for commit 6dc63f8. |
Moving back to v1.2 and adding cherrypick label, as this is a candidate for 1.2.1. |
@davidopp builds are passing and polling rates dropped from > 100 qps to 10 qps as expected via manual testing. I couldn't think of a reasonable way to reliably unittest that the rate limiter is working as expected, but I'm not too concerned given code complexity -- let me know if you have ideas there or also think it's fine w/o. And if someone else can/should review, please do delegate to them! |
LGTM Very sorry for not seeing this sooner. |
And thanks for the fix. |
@k8s-bot test this Tests are more than 48 hours old. Re-running tests. |
GCE e2e build/test passed for commit 6dc63f8. |
@k8s-bot test this [submit-queue is verifying that this PR is safe to merge] |
GCE e2e build/test passed for commit 6dc63f8. |
Automatic merge from submit-queue |
Auto commit by PR queue bot
Auto commit by PR queue bot (cherry picked from commit 0fe049f)
Commit bfd84d4 found in the "release-1.2" branch appears to be this PR. Removing the "cherrypick-candidate" label. If this s an error find help to get your PR picked. |
Auto commit by PR queue bot (cherry picked from commit 0fe049f)
Auto commit by PR queue bot (cherry picked from commit 0fe049f)
Auto commit by PR queue bot (cherry picked from commit 0fe049f)
Auto commit by PR queue bot (cherry picked from commit 0fe049f)
It will poll for operation completion with at most 10 qps to avoid
triggering GCE's rate limits.
Without this PR, creating large 1000 node clusters can result in the routecontroller being too spammy.