API server should limit the number of concurrent requests it processes #5866

fabioy · 2015-03-24T19:02:53Z

Related with issue #5865, the API server should have a cap on the number of outstanding operations/requests it's handling, and return an error otherwise. A simple semaphore is an idea for this.

ghost · 2015-03-24T21:04:42Z

+1 !

fabioy · 2015-03-24T21:44:20Z

The good news/bad news on this is that there is a rate limiter in the API server (https://github.com/GoogleCloudPlatform/kubernetes/blob/master/pkg/apiserver/handlers.go#L70), but the defaults for it seem too high: 10 qps average, 200 burst (https://github.com/GoogleCloudPlatform/kubernetes/blob/master/cmd/kube-apiserver/app/server.go).

We could start by lowering the burst level to something smaller, like 30. It may be interesting to also differentiate between GET and PUT/POST requests.

ghost · 2015-03-24T21:51:31Z

I think that we need limits on the number of outstanding requests, not the number of QPS.

fabioy · 2015-03-24T22:03:08Z

@quinton-hoole -> An max outstanding would be useful. But it may force us to have separate buckets for different types of requests, i.e. a small number of long running requests causing API server to reject short requests.

@smarterclayton -> Heard that you might be looking into adding support for flow control (or something like it?). We may need it soon to help avoid bugs like runaway node status updates (see issue #5864).

smarterclayton · 2015-03-24T22:13:16Z

I had not - the things we were looking at were the bastion changes to apiserver which will break logs and proxying into their own rest resources so they can be granularly controlled. Being able to timebox and classify requests differently would be good.

----- Original Message -----

@quinton-hoole -> An max outstanding would be useful. But it may force us to
have separate buckets for different types of requests, i.e. a small number
of long running requests causing API server to reject short requests.

@smarterclayton -> Heard that you might be looking into adding support for
flow control (or something like it?). We may need it soon to help avoid bugs
like runaway node status updates (see issue #5864).

Reply to this email directly or view it on GitHub:
#5866 (comment)

fabioy · 2015-03-24T23:15:51Z

@smarterclayton - Ah, np, I probably misheard.

gmarek · 2015-04-07T08:28:27Z

What are our plans for handling this client side? Do we want clients to handle "you're being throttled" error themselves, or do we plan to handle it in the client library?

I'm asking because currently kubelet does not have a 'short' retry loop, hence it tries to report NodeStatus every X (2) seconds, whether it succeed or not. Neither we have a sidechannel for heartbeat messages.

This would mean, that if API server will be under heavy load, and with not-negligible probability NodeStatus reporting will be throttled, we may incorrectly mark a Node as unreachable. Currently it's not a huge problem, because X << Y, where Y is the time we wait before we decide a Node is unreachable. But because of the effort somewhere close to #5864 it'll probably get worse.

smarterclayton · 2015-04-07T16:25:35Z

The client library should have options for retry on client.Config which include the ability to disable the behavior. Automatic retry is probably an "opt-out" behavior.

----- Original Message -----

What are our plans for handling this client side? Do we want clients to
handle "you're being throttled" error themselves, or do we plan to handle it
in the client library?

I'm asking because currently kubelet does not have a 'short' retry loop,
hence it tries to report NodeStatus every X (2) seconds, whether it succeed
or not. Neither we have a sidechannel for heartbeat messages.

This would mean, that if API server will be under heavy load, and with
not-negligible probability NodeStatus reporting will be throttled, we may
incorrectly mark a Node as unreachable. Currently it's not a huge problem,
because X << Y, where Y is the time we wait before we decide a Node is
unreachable. But because of the effort somewhere close to #5864 it'll
probably get worse.

Reply to this email directly or view it on GitHub:
#5866 (comment)

davidopp · 2015-04-07T22:18:29Z

@lavalamp says there are people working on rate limiting client connections to API server.

satnam6502 · 2015-04-07T22:40:36Z

I have in the past adjusted the rate limiter to support the creation of large clusters. I also added the Retry-After logic. If you adjust the rate limiter settings, please make sure it is still possible to create large clusters.

brendandburns · 2015-04-09T02:51:14Z

I'm closing this as fixed, given the original subject. If we want to expand client-side support for retry, please open a dedicated issue. (Also note that QPS based throttling was added to the client recently too)

fabioy added team/master priority/backlog Higher priority than priority/awaiting-more-evidence. labels Mar 24, 2015

brendandburns added this to the v1.0 milestone Mar 30, 2015

brendandburns self-assigned this Mar 30, 2015

brendandburns mentioned this issue Mar 31, 2015

Add a limit to the number of in-flight requests that a server processes. #6207

Merged

ghost closed this as completed in #6207 Apr 2, 2015

gmarek reopened this Apr 7, 2015

brendandburns closed this as completed Apr 9, 2015

fabioy unassigned brendandburns Aug 12, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API server should limit the number of concurrent requests it processes #5866

API server should limit the number of concurrent requests it processes #5866

fabioy commented Mar 24, 2015

ghost commented Mar 24, 2015

fabioy commented Mar 24, 2015

ghost commented Mar 24, 2015

fabioy commented Mar 24, 2015

smarterclayton commented Mar 24, 2015

fabioy commented Mar 24, 2015

gmarek commented Apr 7, 2015

smarterclayton commented Apr 7, 2015

davidopp commented Apr 7, 2015

satnam6502 commented Apr 7, 2015

brendandburns commented Apr 9, 2015

API server should limit the number of concurrent requests it processes #5866

API server should limit the number of concurrent requests it processes #5866

Comments

fabioy commented Mar 24, 2015

ghost commented Mar 24, 2015

fabioy commented Mar 24, 2015

ghost commented Mar 24, 2015

fabioy commented Mar 24, 2015

smarterclayton commented Mar 24, 2015

fabioy commented Mar 24, 2015

gmarek commented Apr 7, 2015

smarterclayton commented Apr 7, 2015

davidopp commented Apr 7, 2015

satnam6502 commented Apr 7, 2015

brendandburns commented Apr 9, 2015