Consistently support graceful and immediate termination for all objects #1535

bgrant0607 · 2014-10-02T02:49:35Z

Issue discussed in #103, #1325, #1445, and other issues/PRs.

Our API isn't consistent on clean/graceful shutdown vs. immediate termination of our objects. We should support both modes, and in a consistent fashion. One mode should be the default, and the other should be available via a URL parameter on DELETE or custom verb (e.g., /stop, for graceful shutdown). Graceful shutdown should accept a timeout (as discussed in lifecycle hook PRs) and reason (#1462) as parameters.

Cleanly turning down a replication controller currently requires external orchestration: resizing it to 0 and waiting for pods to be deleted prior to deleting the replication controller itself.

On the other hand, by default, individual pods are supposed to gracefully terminate, executing their PreStop handlers, SIGTERM handlers, and, in the future, PostStop handlers.

We should define what clean shutdown would mean for a service (wait for all targeted pods to be deleted?).

However, there are definitely occasions when one would not want graceful shutdown (e.g., shutting down a test deployment), and we should support that.

The text was updated successfully, but these errors were encountered:

ghodss · 2014-10-02T07:36:35Z

Big +1 on implementing this and making it a generic principle across resources. Definition and logic for graceful vs non-graceful termination for each resource belongs in the server, not in client pieces like "kubecfg stop."

bgrant0607 · 2014-12-10T18:14:54Z

Thinking about the CLI perspective, I could imagine 3 operations:

Raw, brute-force REST operation, not necessarily graceful nor clean: delete
Graceful shutdown: stop
Clean but ~immediate shutdown: cancel

bgrant0607 · 2014-12-19T18:55:21Z

Proposed approach: Add a "stop" or "shutdown" field to each object's spec, and then support a custom verb, e.g. /op/stop or /op/shutdown, to set the field.

bgrant0607 · 2014-12-19T18:56:14Z

I'd also like to add a suspend field to each object to make it stop making changes.

derekwaynecarr · 2015-01-25T20:19:29Z

Assuming we move forward with #3613, I would like to expose a pattern where I can effectively stop a Namespace, mark all of its content for deletion by some background controller, and then ultimately purge the Namespace resource. Reading through this thread and others, it's not clear we have consensus on the right pattern for this across resources. Is there a recommended pattern to follow that I can look to prototype? I feel like I need a separate resource (NamespaceTermination) that I can post to kick off the proper workflow as I tend to agree that a DELETE should remain a true delete.

derekwaynecarr · 2015-01-25T20:31:34Z

The XxxTermination resource could be protected via a separate policy so it's not confused with traditional PUT. Acceptance of the XxxTermination would toggle a Status field on the resource. A controller would see the resource marked for termination and perform all required cleanup. Upon completion, a client with proper Policy rights would send the DELETE to the Namespace resource. A delete would be accepted iff the resource was in a Terminated status, and the final resource removal would complete. In the interim, clients could do normal Get operations on the internal resources to track the purge to completion. This seems like the general flow I would look to prototype this week.

bgrant0607 · 2015-01-28T17:52:59Z

@derekwaynecarr My proposed approach was here: #1535 (comment). We may need to bikeshed about the names/paths, though. You could think of the "custom verb" as a synthetic control resource, such as XxxTermination.

I'm happy to have "stop" and "delete" be distinct. We can provide the ability to do both in a single operation in kubectl.

bgrant0607 · 2015-11-12T18:04:55Z

We need existence dependencies for cascading deletion in the server. Deployments generate ReplicaSets which generate Pods. Doing cleanup in the client will be fragile and unfriendly to non-kubectl clients.

cc @nikhiljindal @janetkuo @ironcladlou @ncdc

soltysh · 2015-12-04T12:45:12Z

/sub

bgrant0607 · 2016-04-26T21:58:11Z

Closing in favor of #19054, which is being implemented for 1.3.

0xmichalis · 2016-09-20T15:46:27Z

We have the garbage collector in place now but is there an umbrela issue or separate issues for making it work for all of the resources that need graceful deletion?

0xmichalis · 2016-09-20T15:47:08Z

We have the garbage collector in place now but is there an umbrela issue or separate issues for making it work for all of the resources that need graceful deletion?

Maybe #26120?

bgrant0607 added the area/api Indicates an issue on api area. label Oct 2, 2014

bgrant0607 added this to the v0.9 milestone Oct 4, 2014

bgrant0607 mentioned this issue Oct 8, 2014

Proposal for new kubecfg design (kubectl) #1325

Merged

bgrant0607 added area/app-lifecycle area/usability labels Oct 10, 2014

bgrant0607 mentioned this issue Oct 11, 2014

Job Controller #1624

Closed

bgrant0607 added the workload/workflow label Oct 14, 2014

This was referenced Nov 4, 2014

Kubectl can apply config files in bulk #1905

Closed

Deprecate kubecfg #2144

Closed

Separate the pod template from replicationController #170

Closed

bgrant0607 added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Dec 3, 2014

bgrant0607 mentioned this issue Dec 8, 2014

Deleting pods and other resources with graceful shutdown #2789

Closed

bgrant0607 added the area/kubectl label Dec 10, 2014

bgrant0607 mentioned this issue Dec 19, 2014

Convert ReplicationController to a plugin (was ReplicationController redesign) #3058

Closed

bgrant0607 mentioned this issue Jan 7, 2015

Proposal for a new set of "porcelain" commands for kubectl with a cleaner user interface #3233

Closed

bgrant0607 mentioned this issue Jan 21, 2015

Resolve the form of special "verbs" in the api for v1beta3 #3652

Closed

j3ffml mentioned this issue Jan 21, 2015

Add a kubectl stop command #3662

Merged

bgrant0607 mentioned this issue Jan 29, 2015

Mark node to be decommissioned and act accordingly #3885

Closed

bgrant0607 added the sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. label Feb 5, 2015

bgrant0607 mentioned this issue Feb 5, 2015

Properly set object status #2726

Closed

goltermann removed this from the v0.9 milestone Feb 6, 2015

bgrant0607 modified the milestones: v0.9, v1.0 Feb 6, 2015

bgrant0607 mentioned this issue Oct 8, 2015

DaemonSet: requirements for graduation to beta and then to v1 #15310

Closed

4 tasks

bgrant0607 mentioned this issue Nov 21, 2015

Job patterns documentation #17345

Merged

soltysh mentioned this issue Nov 25, 2015

Cascading delete of deployment configs makes oc replace --force destructive openshift/origin#6068

Closed

bgrant0607 mentioned this issue Dec 2, 2015

Cascading delete deployment #18077

Merged

bgrant0607 mentioned this issue Jan 4, 2016

Server-side cascading deletion #19054

Closed

bgrant0607 mentioned this issue Jan 13, 2016

initial template and parameterization proposal #18215

Merged

bgrant0607 mentioned this issue Jan 22, 2016

kubectl apply should be able to delete objects missing from supplied config (aka prune) #19805

Closed

bgrant0607 modified the milestones: next-candidate, v1.2-candidate Jan 29, 2016

bgrant0607 removed the area/workflow label Feb 8, 2016

philips mentioned this issue Feb 9, 2016

locksmith: integrate with kubelet coreos/bugs#1112

Closed

bgrant0607 mentioned this issue Feb 12, 2016

Deleting jobs via the API leaves pods #20902

Closed

This was referenced Mar 3, 2016

Move kubectl client logic into server #12143

Closed

Updating pod-template-hash race #22451

Closed

bgrant0607 added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. team/api labels Mar 19, 2016

bgrant0607 mentioned this issue Mar 31, 2016

[GarbageCollector] Adding a proposal for server-side cascading deletion #23656

Merged

bgrant0607 closed this as completed Apr 26, 2016

smarterclayton mentioned this issue Oct 6, 2016

Proposal - Pod safety and termination guarantees #34160

Closed

bgrant0607 mentioned this issue Jan 25, 2017

federation: kubectl --cascade should be false by default for federation #38897

Closed

irfanurrehman mentioned this issue Oct 31, 2017

federation: kubectl --cascade should be false by default for federation kubernetes-retired/federation#89

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consistently support graceful and immediate termination for all objects #1535

Consistently support graceful and immediate termination for all objects #1535

bgrant0607 commented Oct 2, 2014

ghodss commented Oct 2, 2014

bgrant0607 commented Dec 10, 2014

bgrant0607 commented Dec 19, 2014

bgrant0607 commented Dec 19, 2014

derekwaynecarr commented Jan 25, 2015

derekwaynecarr commented Jan 25, 2015

bgrant0607 commented Jan 28, 2015

bgrant0607 commented Nov 12, 2015

soltysh commented Dec 4, 2015

bgrant0607 commented Apr 26, 2016

0xmichalis commented Sep 20, 2016

0xmichalis commented Sep 20, 2016

Consistently support graceful and immediate termination for all objects #1535

Consistently support graceful and immediate termination for all objects #1535

Comments

bgrant0607 commented Oct 2, 2014

ghodss commented Oct 2, 2014

bgrant0607 commented Dec 10, 2014

bgrant0607 commented Dec 19, 2014

bgrant0607 commented Dec 19, 2014

derekwaynecarr commented Jan 25, 2015

derekwaynecarr commented Jan 25, 2015

bgrant0607 commented Jan 28, 2015

bgrant0607 commented Nov 12, 2015

soltysh commented Dec 4, 2015

bgrant0607 commented Apr 26, 2016

0xmichalis commented Sep 20, 2016

0xmichalis commented Sep 20, 2016