additional backoff in azure cloudprovider #48967

jackfrancis · 2017-07-14T22:22:03Z

What this PR does / why we need it:

We want to be able to opt in to backoff retry logic for kubelet-originating request behavior: node IP address resolution and node load balancer pool membership enforcement.

Special notes for your reviewer:

The use-case for this is azure cloudprovider clusters with large node counts, especially during cluster installation, or other scenarios when lots of nodes come online at once and attempt to register all resources with the backend API. To allow clusters at scale more control over the API request rate in-cluster, backoff config has the ability to meaningful slow down this rate, when appropriate.

Release note:

EnsureHostInPool() submits a GET to azure API for VM info. We’re seeing this on agent node kubelets and would like to enable configurable backoff engagement for 4xx responses to be able to slow down the rate of reconciliation, when appropriate.

k8s-ci-robot · 2017-07-14T22:22:10Z

Hi @jackfrancis. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

seanknox · 2017-07-14T22:46:29Z

/sig azure

seanknox · 2017-07-14T22:46:49Z

/assign @brendandburns @colemickens

jdumars · 2017-07-14T22:47:28Z

/cc @colemickens

k8s-ci-robot · 2017-07-14T22:47:29Z

@jdumars: GitHub didn't allow me to request PR reviews from the following users: brendanburns.

Note that only kubernetes members can review this PR, and authors cannot review their own PRs.

In response to this:

/cc @colemickens
/cc @brendanburns

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

jdumars · 2017-07-14T22:48:25Z

/cc @brendandburns

colemickens

minor nit, makes sense otherwise, lgtm

colemickens · 2017-07-14T23:17:40Z

pkg/cloudprovider/providers/azure/azure_backoff.go

+			glog.Errorf("backoff: failure, will retry,err=%v", retryErr)
+			return false, nil
+		}
+		glog.V(2).Infof("backoff: success")


nit: maybe a little lower, we don't log much at all at 2 elsewhere.

Ha! You mean I have to bust out the --v tool?

colemickens · 2017-07-14T23:18:53Z

/lgtm but I strongly suggest adding this backoff to the call in Instances as well.

also rate limiting the call to az.getVirtualMachine inside az.getIPForMachine

brendandburns · 2017-07-19T19:07:04Z

/lgtm

k8s-github-robot · 2017-07-19T19:08:32Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: brendandburns, colemickens, jackfrancis

Associated issue: 48971

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

~~pkg/cloudprovider/providers/azure/OWNERS~~ [brendandburns,colemickens]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

k8s-github-robot · 2017-07-20T03:05:33Z

Automatic merge from submit-queue (batch tested with PRs 49218, 48253, 48967, 48460, 49230)

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jul 14, 2017

k8s-github-robot assigned piosz and thockin Jul 14, 2017

k8s-github-robot added size/S Denotes a PR that changes 10-29 lines, ignoring generated files. release-note Denotes a PR that will be considered when it comes time to generate release notes. labels Jul 14, 2017

k8s-ci-robot added the sig/azure label Jul 14, 2017

k8s-ci-robot assigned brendandburns and colemickens Jul 14, 2017

k8s-ci-robot requested a review from colemickens July 14, 2017 22:47

k8s-ci-robot requested a review from brendandburns July 14, 2017 22:48

colemickens approved these changes Jul 14, 2017

View reviewed changes

backing off az.getIPForMachine in az.NodeAddresses

f76ef29

also rate limiting the call to az.getVirtualMachine inside az.getIPForMachine

k8s-github-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Jul 15, 2017

jackfrancis changed the title ~~additional backoff in azure cloudprovider lb pool enforcement~~ additional backoff in azure cloudprovider Jul 15, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 15, 2017

piosz removed their assignment Jul 17, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 19, 2017

k8s-github-robot merged commit ecadada into kubernetes:master Jul 20, 2017

jackfrancis mentioned this pull request Apr 12, 2021

REQUEST: New membership for jackfrancis kubernetes/org#2632

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

additional backoff in azure cloudprovider #48967

additional backoff in azure cloudprovider #48967

jackfrancis commented Jul 14, 2017 •

edited

Loading

k8s-ci-robot commented Jul 14, 2017

seanknox commented Jul 14, 2017

seanknox commented Jul 14, 2017

jdumars commented Jul 14, 2017 •

edited

Loading

k8s-ci-robot commented Jul 14, 2017

jdumars commented Jul 14, 2017

colemickens left a comment

colemickens Jul 14, 2017

jackfrancis Jul 15, 2017

colemickens commented Jul 14, 2017

brendandburns commented Jul 19, 2017

k8s-github-robot commented Jul 19, 2017

k8s-github-robot commented Jul 20, 2017

additional backoff in azure cloudprovider #48967

additional backoff in azure cloudprovider #48967

Conversation

jackfrancis commented Jul 14, 2017 • edited Loading

k8s-ci-robot commented Jul 14, 2017

seanknox commented Jul 14, 2017

seanknox commented Jul 14, 2017

jdumars commented Jul 14, 2017 • edited Loading

k8s-ci-robot commented Jul 14, 2017

jdumars commented Jul 14, 2017

colemickens left a comment

Choose a reason for hiding this comment

colemickens Jul 14, 2017

Choose a reason for hiding this comment

jackfrancis Jul 15, 2017

Choose a reason for hiding this comment

colemickens commented Jul 14, 2017

brendandburns commented Jul 19, 2017

k8s-github-robot commented Jul 19, 2017

k8s-github-robot commented Jul 20, 2017

jackfrancis commented Jul 14, 2017 •

edited

Loading

jdumars commented Jul 14, 2017 •

edited

Loading