Unidling proposal #3247

pmorie · 2015-01-06T16:33:10Z

No description provided.

erictune · 2015-01-06T18:22:11Z

docs/unidling.md

+
+## Lexicon
+
+- **Destination pods**: the backend pods (or other backends as necessary) that a service points to


Is this different from the "endpoints" of a "service"? If so, clarify. If not, use kubernetes-speak.

erictune · 2015-01-06T18:35:56Z

this belongs in docs/design/. Once it is implemented, user-centric documentation would go into docs/.

erictune · 2015-01-06T19:02:53Z

docs/unidling.md

+
+### Modifications to kube-proxy
+
+The kube-proxy must be modified create events when idled services are requested and make an attempt


People will sometimes want to provide their own load-balancers in place of the kube-proxy, if the kube proxy does not provide enough features, or if they want to use something like https://cloud.google.com/compute/docs/load-balancing/.
So, kube-proxy should not be the only thing that can create "demand" events.

@erictune agree with your statement that kube-proxy should not be the only thing that can create demand events; I will call that out explicitly when I take the next pass through.

erictune · 2015-01-06T19:26:27Z

Mostly looks good. A couple of comments inline. I'll suggest that @bgrant0607 and @thockin may optionally want to comment.

Remaining comments below are topics for discussion but needn't block getting this design merged:

I don't quite understand why you want to separate autoscaling from unidling. They seem like quite closely related problems. Is that just a short term thing to keep the problem scope manageable, or are you arguing that they should always be separate? If the latter, I didn't follow the argument.

Another thing that I wondered about is whether we will be able to unidle pods in a couple of seconds reliably. If a pod needs an image that has not been cached and takes a long time to pull and unpack, then it might not start in a few seconds. If we cannot do that reliably, then we may need to have the pods be in some intermediate state between non-existent and ready-to-serve. That could change the design of unidling, I think. Thoughts?

Yet another thing I wondered about was how this interacts, if at all, with (proposed) readiness checks: #620. Do we need to buffer the packet until the created endpoint is not just live but also ready to serve? Doesn't the thrashing logic need to distinguish the following types of conditions:

no pods at all, so need to create one
1 pod, which is not ready yet, but was just started, so we should keep waiting
1 pod, which is not ready yet and has been wedge for some time, so we need to start another one.
etc.

The last thing I wondered about is QoS/DoS. The kube-proxies are shared, and need to have bounded memory usage. A burst of traffic to one idled service should not hog all the buffer space so that other innocent idled services have no buffering space left.

thockin · 2015-01-07T05:03:52Z

I'm sort of confused how we can have a discussion about un-idling without also discussing idling. What does it mean to be idle? Does it imply a new kind of controller than manipulates the replication controller's count? Are idle pods dead, or are they started and frozen (a la CRIU)?

davidopp · 2015-01-07T05:17:38Z

I have a super-naive question. Are events intended to be used as an RPC mechanism between Kubernetes components? I thought they were just for logging/debugging, but they seem to be used here for communication from proxy to unidler to request unidling of a pod. I'm not saying this is a bad idea, it just surprised me. Are we already using this pattern elsewhere?

thockin · 2015-01-07T06:03:50Z

Great question

On Tue, Jan 6, 2015 at 9:18 PM, davidopp notifications@github.com wrote:

I have a super-naive question. Are events intended to be used as an RPC
mechanism between Kubernetes components? I thought they were just for
logging/debugging, but they seem to be used here for communication from
proxy to unidler to request unidling of a pod. I'm not saying this is a bad
idea, it just surprised me. Are we already using this pattern elsewhere?

Reply to this email directly or view it on GitHub
#3247 (comment)
.

bgrant0607 · 2015-01-14T07:11:23Z

@davidopp @thockin AFAIK, no components are using events for messaging at the moment, but the idea has been proposed in several scenarios, and there is also an expectation that users could use them in this way. We could decide not to use them, but would need to define hooks to call out in addition to generating the events.

bgrant0607 · 2015-01-14T07:14:40Z

Rather than buffering requests indefinitely, is Retry-After a possibility?

bgrant0607 · 2015-01-14T07:15:20Z

docs/unidling.md

+
+## Constraints and Assumptions
+
+- Unidling should be orthogonal to autoscaling and idling


Like @erictune, I don't understand the reason for this.

@erictune @bgrant0607 We decided to keep them separate for now to keep the problem space simpler. They are definitely closely related and I can imagine them being handled in the future by the same subsystem or component.

pmorie · 2015-01-14T16:10:54Z

Going to take a pass through this at some point in the next day or two to incorporate comments.

WIP: Address feedback 1

pmorie · 2015-02-26T17:43:10Z

@erictune @thockin @bgrant0607 @pweil- @abhgupta Circling back after having my attention diverted for a long while on other matters. This is a bit stale now, but I no longer personally think it makes sense to address in a vacuum. I think it's probably best to do one of the following:

Make this proposal primarily about the behaviors of system components for services that are idled
Examine idling in this PR and address idling / unidling together

Once #2863 is merged we can fully flesh out links and intersections with the auto-scaling proposal.

googlebot · 2015-02-26T17:43:11Z

Thanks for your pull request.

It looks like this may be your first contribution to a Google open source project, in which case you'll need to sign a Contributor License Agreement (CLA) at https://cla.developers.google.com/.

If you've already signed a CLA, it's possible we don't have your GitHub username or you're using a different email address. Check the information on your CLA or see this help article on setting the email on your git commits.

Once you've done that, please reply here to let us know. If you signed the CLA as a corporation, please let us know the company's name.

erictune · 2015-03-03T00:35:27Z

I predict that it may take a while longer to reach consensus on #2863.
If you want to make progress on this, I'd focus on, as you say, behaviors when there are no pods in a service. How packets are buffered and released. etc.

piosz · 2015-03-27T09:29:46Z

Is this PR active? If not please close it according to https://github.com/GoogleCloudPlatform/kubernetes/blob/master/docs/devel/pull-requests.md

piosz · 2015-03-31T09:46:35Z

Closing according to the mentioned policy.

alok87 · 2018-03-11T14:17:09Z

We should give the user flexibility to the user to set min=0. Specially after custom metrics scaling coming into pictur. We have a use case where in we want to scale our deployments from zero to n based on queue length. So when the target of the hpa is not related to pods existense we should give the user option to set min=0
What do you think?
@erictune @thockin

piosz · 2018-03-11T20:12:15Z

cc @mwielgus @MaciekPytel

MaciekPytel · 2018-03-12T09:49:56Z

@alok87 If I understand correctly what you're asking is just a change in HPA, not unidling as described by this proposal. Can you open a new issue with feature request for sig-autoscaling?

alok87 · 2018-03-12T14:55:43Z

Sure will do that

erictune reviewed Jan 6, 2015
View reviewed changes

erictune self-assigned this Jan 6, 2015

pmorie force-pushed the unidling branch from fb74768 to 2f23c02 Compare January 7, 2015 20:00

bgrant0607 reviewed Jan 14, 2015
View reviewed changes

Add unidling proposal

a9951ff

pmorie force-pushed the unidling branch from 2f23c02 to a9951ff Compare January 26, 2015 06:35

pmorie added 2 commits January 26, 2015 01:36

WIP: Address feedback 1

9b01272

Merge pull request #3 from pmorie/unidling-feedback

ec5eb81

WIP: Address feedback 1

googlebot added the cla: no label Feb 26, 2015

davidopp mentioned this pull request Feb 27, 2015

Consider moving events out of etcd #4432

Closed

bgrant0607 mentioned this pull request Feb 28, 2015

Service scale-to-zero (fka "Socket activation") #484

Closed

piosz closed this Mar 31, 2015

davidopp mentioned this pull request Apr 3, 2015

Communication between Kubernetes components #6363

Closed

smarterclayton mentioned this pull request Aug 10, 2015

Horizontal pod autoscaler basing on scale subresource #12087

Closed

smarterclayton mentioned this pull request Dec 12, 2015

Evolution of the kube-proxy #18623

Closed

bgrant0607 mentioned this pull request Aug 11, 2016

Proposal: Introduce Idling and Unidling #29471

Closed

shosti mentioned this pull request Oct 11, 2018

Allow HPA to scale to 0 #69687

Closed

Iamlovingit mentioned this pull request Oct 2, 2021

add hpa for raw deployment kserve/kserve#1830

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unidling proposal #3247

Unidling proposal #3247

pmorie commented Jan 6, 2015

erictune Jan 6, 2015

erictune commented Jan 6, 2015

erictune Jan 6, 2015

pmorie Jan 6, 2015

erictune commented Jan 6, 2015

thockin commented Jan 7, 2015

davidopp commented Jan 7, 2015

thockin commented Jan 7, 2015

bgrant0607 commented Jan 14, 2015

bgrant0607 commented Jan 14, 2015

bgrant0607 Jan 14, 2015

pmorie Jan 14, 2015

pmorie commented Jan 14, 2015

pmorie commented Feb 26, 2015

googlebot commented Feb 26, 2015

erictune commented Mar 3, 2015

piosz commented Mar 27, 2015

piosz commented Mar 31, 2015

alok87 commented Mar 11, 2018

piosz commented Mar 11, 2018

MaciekPytel commented Mar 12, 2018

alok87 commented Mar 12, 2018


		## Lexicon

		- Destination pods: the backend pods (or other backends as necessary) that a service points to


		### Modifications to kube-proxy

		The kube-proxy must be modified create events when idled services are requested and make an attempt


		## Constraints and Assumptions

		- Unidling should be orthogonal to autoscaling and idling

Unidling proposal #3247

Unidling proposal #3247

Conversation

pmorie commented Jan 6, 2015

erictune Jan 6, 2015

Choose a reason for hiding this comment

erictune commented Jan 6, 2015

erictune Jan 6, 2015

Choose a reason for hiding this comment

pmorie Jan 6, 2015

Choose a reason for hiding this comment

erictune commented Jan 6, 2015

thockin commented Jan 7, 2015

davidopp commented Jan 7, 2015

thockin commented Jan 7, 2015

bgrant0607 commented Jan 14, 2015

bgrant0607 commented Jan 14, 2015

bgrant0607 Jan 14, 2015

Choose a reason for hiding this comment

pmorie Jan 14, 2015

Choose a reason for hiding this comment

pmorie commented Jan 14, 2015

pmorie commented Feb 26, 2015

googlebot commented Feb 26, 2015

erictune commented Mar 3, 2015

piosz commented Mar 27, 2015

piosz commented Mar 31, 2015

alok87 commented Mar 11, 2018

piosz commented Mar 11, 2018

MaciekPytel commented Mar 12, 2018

alok87 commented Mar 12, 2018