Provide a way to relabel controllers and their pods #36897

0xmichalis · 2016-11-16T15:08:26Z

Currently, the deployment controller relabels replica sets and pods when it needs to adopt a replica set for a deployment. It would be more convenient to have a replica set endpoint that would force the replica set controller to do all the relabeling (change pod template labels, update old pods, change selector). The implementation should be similar to how the rollback spec works for Deployments.

It would help:

users that want to change their selectors but have no good way to relabel old pods. We are adding kubectl set selector in Add new command "kubectl set selector" #28949 enabled only for services. Eventually, we could enable it for controllers by using the new endpoint.
in dropping the pod update permission from the deployment controller and move any logic around pods one level down to replica sets (deployments should operate only on replica sets).

@kubernetes/api-review-team @kubernetes/sig-api-machinery @ncdc @liggitt @bgrant0607

The text was updated successfully, but these errors were encountered:

0xmichalis · 2016-11-16T16:55:35Z

cc @mfojtik

bgrant0607 · 2016-11-16T21:04:51Z

@Kargakis Sorry, quick comments for now.

We may want to think about a subresource, similar to /scale, that could do this uniformly across workload APIs. Maybe even we could extend /scale to be a generic workload-controller interface, but I haven't thought about that much yet.

bgrant0607 · 2016-11-16T21:05:20Z

cc @erictune

0xmichalis · 2016-11-18T13:19:58Z

@bgrant0607 agreed that this should be similar to /scale, not sure about making it generic.

0xmichalis · 2016-11-18T13:23:04Z

Use-case: Easier Blue-Green deployments: openshift/origin#11954

bgrant0607 · 2016-11-21T20:57:30Z

See also comments here:
#36859 (comment)

Note that one pattern that's safe and doesn't require complex orchestration when one wants to introduce a new attribute is to modify selectors of existing workload controllers to select pods without the new label key, and then just add the key to new pod templates. This isn't possible with RCs, however, since they don't support the more powerful selectors.

MarkRx · 2016-11-28T13:41:36Z

Would causing a relabel to happen spin down old pods and spin up new pods, or would existing pods be reused?

I don't know if this is by design, but at present it isn't really possible to store pod state in labels as changing the RC causes new pods to be created. Having pod state is useful for services as it allows them to be able to select pods based on their state.

0xmichalis · 2016-11-28T13:49:19Z

Changing the RC is not causing new pods to be created, you are probably talking about Deployments / DeploymentConfigs. Changing a RC will create new pods if the RC is scaled up or old pods die. Relabeling would merely relabel all old pods to use the new label(s).

MarkRx · 2016-11-28T13:53:45Z

If the RC selector is changed the system would suddenly see 0 pods and try to start scaling up. Since existing pods cannot have their labels changed in an atomic operation when the RC selector is changed there is possible contention.

0xmichalis · 2016-11-28T13:58:45Z

If the RC selector is changed the system would suddenly see 0 pods and try to start scaling up. Since existing pods cannot have their labels changed in an atomic operation when the RC selector is changed there is possible contention.

The RC selector should change as the last part of the "transaction" ie. once all old pods and the rc pod template labels have changed. If any of the latter actions fails, we won't change the selector. I guess once we start using transactions in etcd3, this won't be a problem.

smarterclayton · 2016-11-28T14:32:09Z

Don't assume transactions will help us - there are limitations that may prevent some of these scenarios from being possible.

…

If the RC selector is changed the system would suddenly see 0 pods and try to start scaling up. Since existing pods cannot have their labels changed in an atomic operation when the RC selector is changed there is possible contention. The RC selector should change as the last part of the "transaction" ie. once all old pods and the rc pod template labels have changed. If any of the latter actions fails, we won't change the selector. I guess once we start using transactions in etcd3, this won't be a problem.

smarterclayton · 2016-11-28T14:54:23Z

@bprashanth because we had discussed pod labels changing to track state (like leader). I think the atomicity challenge remains even with etcd3, so we should at least consider whether this is something that we might indicate another way.

lavalamp · 2016-11-30T01:01:26Z

I think at this point atomic multi object operations is an explicit non-goal of the kubernetes API system.

…

On Mon, Nov 28, 2016 at 6:54 AM, Clayton Coleman ***@***.***> wrote: @bprashanth <https://github.com/bprashanth> because we had discussed pod labels changing to track state (like leader). I think the atomicity challenge remains even with etcd3, so we should at least consider whether this is something that we might indicate another way. — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#36897 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAnglvCx2ozD0iOsfr5DDxrAufkIrzfJks5rCuskgaJpZM4Kz-D5> .

davidopp · 2016-11-30T09:09:51Z

Was that officially decided somewhere? I think there are some reasonable use cases for multi-object transactions, for example the way we do pod binding today is not great because it only updates the pod, not the node, so you can't actually detect conflicts. Even just being able to bump the resource version on the node and the pod atomically, while only actually mutating the NodeName in the pod, would be useful. But I suspect there are other useful scenarios (whether the one here is one of those, I don't know).

We were afraid of the implication for sharding etcd, but we never actually needed to shard etcd (yet)...

smarterclayton · 2016-11-30T16:38:33Z

Doing a transaction under the covers of a particular resource is probably ok. Trying to do it across resource types via the public API is probably not. If we're going to shard on anything, it's going to be resource types, and it's happening O(soon) for the API extension work. I doubt a third party extension will be allowed to have root access to the etcd cluster, and in any case it's easier if they have their own storage.

…

On Wed, Nov 30, 2016 at 4:09 AM, David Oppenheimer ***@***.*** > wrote: Was that officially decided somewhere? I think there are some reasonable use cases for multi-object transactions, for example the way we do pod binding today is not great because it only updates the pod, not the node, so you can't actually detect conflicts. Even just being able to bump the resource version on the node and the pod atomically, while only actually mutating the NodeName in the pod, would be useful. But I suspect there are other useful scenarios (whether the one here is one of those, I don't know). We were afraid of the implication for sharding etcd, but we never actually needed to shard etcd (yet)... — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#36897 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABG_pyg2_vgrsefUO3kx6FpfL6LdZm1Uks5rDT1mgaJpZM4Kz-D5> .

lavalamp · 2016-11-30T19:46:50Z

Right. API server federation means the set of resources over which a transaction is even possible is complex, since different groups may be stored in different etcds or even different storage systems. Even if we ultimately do decide to allow a few curated transactions, I'm quite confident it won't happen in 2017. On Wed, Nov 30, 2016 at 8:38 AM, Clayton Coleman <notifications@github.com> wrote:

…

Doing a transaction under the covers of a particular resource is probably ok. Trying to do it across resource types via the public API is probably not. If we're going to shard on anything, it's going to be resource types, and it's happening O(soon) for the API extension work. I doubt a third party extension will be allowed to have root access to the etcd cluster, and in any case it's easier if they have their own storage. On Wed, Nov 30, 2016 at 4:09 AM, David Oppenheimer < ***@***.*** > wrote: > Was that officially decided somewhere? I think there are some reasonable > use cases for multi-object transactions, for example the way we do pod > binding today is not great because it only updates the pod, not the node, > so you can't actually detect conflicts. Even just being able to bump the > resource version on the node and the pod atomically, while only actually > mutating the NodeName in the pod, would be useful. But I suspect there are > other useful scenarios (whether the one here is one of those, I don't know). > > We were afraid of the implication for sharding etcd, but we never actually > needed to shard etcd (yet)... > > — > You are receiving this because you are on a team that was mentioned. > Reply to this email directly, view it on GitHub > <https://github.com/kubernetes/kubernetes/issues/ 36897#issuecomment-263820891>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/ABG_pyg2_ vgrsefUO3kx6FpfL6LdZm1Uks5rDT1mgaJpZM4Kz-D5> > . > — You are receiving this because you are on a team that was mentioned. Reply to this email directly, view it on GitHub <#36897 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAngliZ3mwr-F2OEvu5BWfiQ1KhfbUeCks5rDaaSgaJpZM4Kz-D5> .

bgrant0607 · 2017-02-24T14:59:24Z

It's off topic for this issue, so please don't discuss it further in this issue, but I agree with smarterclayton and lavalamp: that we should not support atomic transactions across multiple resources, and have documented that:
https://github.com/kubernetes/community/blob/master/contributors/design-proposals/architecture.md

OTOH, I could imagine some operation similar to rollback, which performs the relabeling orchestration server-side.

bgrant0607 · 2017-02-24T15:01:36Z

See also #14894

fejta-bot · 2017-12-21T12:15:38Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

fejta-bot · 2018-01-20T13:03:25Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle rotten
/remove-lifecycle stale

bgrant0607 · 2018-01-22T17:57:00Z

/remove-lifecycle rotten
/lifecycle frozen

janetkuo · 2018-04-10T01:03:03Z

Deployment controller stops relabeling adopted ReplicaSets and Pods after #61615 got merged. Selectors became immutable in apps/v1 endpoints. Can we close this issue now that it's no longer needed by workloads controllers?

bgrant0607 · 2018-04-10T01:51:44Z

I agree we are unlikely to get to this, and permitting it would increase complexity of object management.

k8s-github-robot added area/kubectl team/ux labels Nov 16, 2016

0xmichalis added kind/new-api and removed area/kubectl labels Nov 16, 2016

bgrant0607 added sig/apps Categorizes an issue or PR as relevant to SIG Apps. area/workload-api/replicaset and removed team/ux labels Nov 16, 2016

0xmichalis mentioned this issue Nov 18, 2016

Dynamically Changing Pod Labels? openshift/origin#11954

Closed

bgrant0607 mentioned this issue Nov 21, 2016

StatefulSets with the same selector interfere with each other #36859

Closed

bgrant0607 mentioned this issue Nov 22, 2016

Vertical pod auto-sizer #10782

Closed

0xmichalis mentioned this issue Nov 23, 2016

Ability to enable/disable replication controller #37086

Closed

0xmichalis mentioned this issue Nov 28, 2016

In-place rolling updates #9043

Closed

liggitt mentioned this issue Jan 16, 2017

Give replicaset controller patch permission on pods #39961

Merged

bgrant0607 added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Feb 24, 2017

bgrant0607 mentioned this issue Feb 24, 2017

Allow Users to change labels using deployment #14894

Closed

0xmichalis mentioned this issue Apr 5, 2017

initial StatefulSet updates proposal kubernetes/community#503

Merged

bgrant0607 mentioned this issue Apr 7, 2017

Workload API v1 requirements umbrella issue #42752

Closed

0xmichalis mentioned this issue May 14, 2017

When a deployment duplicate is created containing a different label, it leads to an orphan ReplicaSet #45770

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 21, 2017

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 20, 2018

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Jan 22, 2018

bgrant0607 closed this as completed Apr 10, 2018

github-actions bot mentioned this issue Dec 14, 2020

remove "update" once pacoxu/kubernetes#919

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide a way to relabel controllers and their pods #36897

Provide a way to relabel controllers and their pods #36897

0xmichalis commented Nov 16, 2016 •

edited

Loading

0xmichalis commented Nov 16, 2016

bgrant0607 commented Nov 16, 2016

bgrant0607 commented Nov 16, 2016

0xmichalis commented Nov 18, 2016

0xmichalis commented Nov 18, 2016

bgrant0607 commented Nov 21, 2016

MarkRx commented Nov 28, 2016

0xmichalis commented Nov 28, 2016

MarkRx commented Nov 28, 2016

0xmichalis commented Nov 28, 2016

smarterclayton commented Nov 28, 2016 via email •

edited

Loading

smarterclayton commented Nov 28, 2016

lavalamp commented Nov 30, 2016 via email

davidopp commented Nov 30, 2016

smarterclayton commented Nov 30, 2016 via email

lavalamp commented Nov 30, 2016 via email

bgrant0607 commented Feb 24, 2017

bgrant0607 commented Feb 24, 2017

fejta-bot commented Dec 21, 2017

fejta-bot commented Jan 20, 2018

bgrant0607 commented Jan 22, 2018

janetkuo commented Apr 10, 2018

bgrant0607 commented Apr 10, 2018

Provide a way to relabel controllers and their pods #36897

Provide a way to relabel controllers and their pods #36897

Comments

0xmichalis commented Nov 16, 2016 • edited Loading

0xmichalis commented Nov 16, 2016

bgrant0607 commented Nov 16, 2016

bgrant0607 commented Nov 16, 2016

0xmichalis commented Nov 18, 2016

0xmichalis commented Nov 18, 2016

bgrant0607 commented Nov 21, 2016

MarkRx commented Nov 28, 2016

0xmichalis commented Nov 28, 2016

MarkRx commented Nov 28, 2016

0xmichalis commented Nov 28, 2016

smarterclayton commented Nov 28, 2016 via email • edited Loading

smarterclayton commented Nov 28, 2016

lavalamp commented Nov 30, 2016 via email

davidopp commented Nov 30, 2016

smarterclayton commented Nov 30, 2016 via email

lavalamp commented Nov 30, 2016 via email

bgrant0607 commented Feb 24, 2017

bgrant0607 commented Feb 24, 2017

fejta-bot commented Dec 21, 2017

fejta-bot commented Jan 20, 2018

bgrant0607 commented Jan 22, 2018

janetkuo commented Apr 10, 2018

bgrant0607 commented Apr 10, 2018

0xmichalis commented Nov 16, 2016 •

edited

Loading

smarterclayton commented Nov 28, 2016 via email •

edited

Loading