Azure - ARM Read/Write rate limiting #59830

khenidak · 2018-02-13T20:21:02Z

What this PR does / why we need it:

Azure cloud provider currently runs with:

Single ARM rate limiter for both read [put/post/delete] and write operations, while ARM provide [different rates for read/write] (https://docs.microsoft.com/en-us/azure/azure-resource-manager/resource-manager-request-limits). This causes write operation to stop even if there is available write request quotas.
Cloud provider uses rate limiter's Accept() instead of TryAccept() This causes control loop to wait for prolonged tike in case of no request quota available for all requests even for those does not require ARM interaction. A case for that the Service control loop will wait for a prolonged time trying to create LoadBalancer service even though it can fail and work on the next service which is ClusterIP. This PR moves cloud provider tp TryAccept()

Which issue(s) this PR fixes:
Fixes # #58770

Special notes for your reviewer:
n/a

Release note:

- Separate current ARM rate limiter into read/write
- Improve control over how ARM rate limiter is used within Azure cloud provider

cc @jackfrancis (need your help carefully reviewing this one) @brendanburns @jdumars

…o az-ratelimit

khenidak · 2018-02-13T21:51:01Z

@jackfrancis @brendanburns This is ready for review now.

jackfrancis · 2018-02-14T00:32:39Z

pkg/cloudprovider/providers/azure/azure_client.go

-	az.rateLimiter.Accept()
+	err = createArmRateLimitErr(false, "VMGet")
+	if !az.rateLimiterReader.TryAccept() {
+		return


Educate me in golang. :)

By implicitly returning (simple return without params) both named return types, what are we returning as the value of result in the error case?

jackfrancis · 2018-02-14T00:40:09Z

pkg/cloudprovider/providers/azure/azure_client.go

@@ -181,7 +209,11 @@ func (az *azVirtualMachinesClient) Get(resourceGroupName string, VMName string,
 }

 func (az *azVirtualMachinesClient) List(resourceGroupName string) (result compute.VirtualMachineListResult, err error) {
-	az.rateLimiter.Accept()
+	err = createArmRateLimitErr(false, "VMList")


Could we optimize by putting these err constructor invocations inside the failure condition if block (as they are only used if the failure condition is met)?

jackfrancis · 2018-02-14T00:53:41Z

@khenidak Added a couple comments for thought. Overall lgtm, audited all the rate limit injections (esp. the write ones which require us to compose a zero value return value) and they look correct/sane.

khenidak · 2018-02-15T16:56:51Z

@jackfrancis updated with your code comments PTAL
@feiskyer I have had to merge the VMSS code. can you do a sanity check?

jackfrancis

lgtm

jdumars · 2018-02-15T17:42:12Z

@feiskyer if you're good with this implementation can you add the LGTM so we can get this merged?

brendandburns · 2018-02-15T18:13:08Z

pkg/cloudprovider/providers/azure/azure_client.go

@@ -33,6 +34,15 @@ import (
 	"k8s.io/client-go/util/flowcontrol"
 )

+// Creates an error for rate limiting errors
+func createArmRateLimitErr(isWrite bool, opName string) error {


Given that the purpose of this is to return this error in a channel, perhaps having this signature being:

Also nit, golang style says Arm should be ARM.

func createARMRateLimitError(isWrite bool, opName string) (err, chan error) {

ah, I actually see that you use this elsewhere, perhaps two functions:

func createARMRateLimitError(...) error {

and

func createARMRateLimitErrChannel() chan error {

khenidak · 2018-02-15T20:44:32Z

@brendanburns code comments addressed. Thanks

feiskyer

/lgtm

k8s-ci-robot · 2018-02-15T23:35:41Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: feiskyer, khenidak

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/cloudprovider/providers/azure/OWNERS~~ [feiskyer,khenidak]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-github-robot · 2018-02-16T00:43:36Z

Automatic merge from submit-queue (batch tested with PRs 59939, 59830). If you want to cherry-pick this change to another branch, please follow the instructions here.

khenidak added 2 commits February 12, 2018 21:29

Merge branch 'master' of https://github.com/kubernetes/kubernetes int…

82e1fda

…o az-ratelimit

WIP - create read/writer rate limiter

5bf6b0f

k8s-ci-robot requested review from andyzhangx, ingvagabund, lavalamp, luxas and pwittrock February 13, 2018 20:21

k8s-github-robot added needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. kind/api-change Categorizes issue or PR as related to adding, removing, or otherwise changing an API labels Feb 13, 2018

khenidak added 2 commits February 13, 2018 20:56

Configuration changes

a86062c

Merge branch 'master' of https://github.com/kubernetes/kubernetes int…

9e4f144

…o az-ratelimit

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 13, 2018

k8s-github-robot removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. kind/api-change Categorizes issue or PR as related to adding, removing, or otherwise changing an API labels Feb 13, 2018

khenidak changed the title ~~WIP: Azure - ARM Read/Write rate limiting~~ Azure - ARM Read/Write rate limiting Feb 13, 2018

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Feb 13, 2018

khenidak mentioned this pull request Feb 13, 2018

Relevant PRs/Issues khenidak/ultra-k8s#1

Open

fix json tag on Azure.config

f909859

jackfrancis reviewed Feb 14, 2018

View reviewed changes

k8s-github-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 14, 2018

Code review + resync VMSS changes

53036bf

k8s-github-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 15, 2018

jackfrancis approved these changes Feb 15, 2018

View reviewed changes

brendandburns reviewed Feb 15, 2018

View reviewed changes

code review: create err chan via helper

38a9fc3

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Feb 15, 2018

feiskyer approved these changes Feb 15, 2018

View reviewed changes

k8s-ci-robot assigned feiskyer Feb 15, 2018

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 15, 2018

k8s-github-robot merged commit 271c267 into kubernetes:master Feb 16, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Azure - ARM Read/Write rate limiting #59830

Azure - ARM Read/Write rate limiting #59830

khenidak commented Feb 13, 2018 •

edited

Loading

khenidak commented Feb 13, 2018

jackfrancis Feb 14, 2018

jackfrancis Feb 14, 2018

jackfrancis commented Feb 14, 2018

khenidak commented Feb 15, 2018

jackfrancis left a comment

jdumars commented Feb 15, 2018

brendandburns Feb 15, 2018

brendandburns Feb 15, 2018

khenidak commented Feb 15, 2018

feiskyer left a comment

k8s-ci-robot commented Feb 15, 2018

k8s-github-robot commented Feb 16, 2018

Azure - ARM Read/Write rate limiting #59830

Azure - ARM Read/Write rate limiting #59830

Conversation

khenidak commented Feb 13, 2018 • edited Loading

khenidak commented Feb 13, 2018

jackfrancis Feb 14, 2018

Choose a reason for hiding this comment

jackfrancis Feb 14, 2018

Choose a reason for hiding this comment

jackfrancis commented Feb 14, 2018

khenidak commented Feb 15, 2018

jackfrancis left a comment

Choose a reason for hiding this comment

jdumars commented Feb 15, 2018

brendandburns Feb 15, 2018

Choose a reason for hiding this comment

brendandburns Feb 15, 2018

Choose a reason for hiding this comment

khenidak commented Feb 15, 2018

feiskyer left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Feb 15, 2018

k8s-github-robot commented Feb 16, 2018

khenidak commented Feb 13, 2018 •

edited

Loading