Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

hectorj2f · 2016-10-20T09:24:56Z

Our user story for a FEATURE REQUEST is the following: we have been developing distributed applications using systemd during long time. To coordinate them, we used fleet as distributed system to manage them all in our infra. However, we decided to move all our systems to Kubernetes since several months.

Regardless of all the amazing features you can find in Kubernetes when defining applications. We are missing a good one, we had when creating systemd units.

A mechanism to trigger deletion actions (cleanup tasks) once we delete a pod. e.g ExecStop, ExecStopPost instructions used to trigger actions when destroying or stopping a systemd unit.

We research among all the features and we couldn't find any available mechanism to trigger delete-or-cleanup actions. As an example, I have a pod that does some stuff (like adding a key to etcd, setting some iptable rules) when I delete the pod k8s doesn't provide a way to trigger an action (like removing the key from etcd, cleaning up all the iptable rules). So, we have to create some additional pods that do the jobs which is ugly from an app lifecycle definition.

Therefore, I'd like to know or learn if there would be any possibility to get this functionality inside kubernetes. Also, if you know of any third party project trying to achieve the same goal, it'd be really helpful for us.

pires · 2016-10-27T18:14:49Z

@hectorj2f did you look at Termination of Pods, particularly step 5?

bgrant0607 · 2016-11-17T05:09:17Z

cc @smarterclayton

smarterclayton · 2016-12-01T02:40:33Z

We've hit this when looking at how to do image builds using a container. Ideally, we'd run a container in the pod to set the image state, then commit and push that image in a follow on. But it requires a more complex lifecycle than our current flow. We had concerns about adding post-containers or post stop hooks when we discussed init containers.

hectorj2f · 2016-12-04T17:55:01Z

@smarterclayton Which are those concerns ?

With the current lifecycle either, you increase the container functionality to cleanup itself when calling the PreStop operation of the pod lifecycle (this is quite ugly when those operations aren't related to the container logic :/ ); OR you have to hack some pods to be scheduled once you run a delete operation to do the cleanup. Both alternative are a bit ugly in my opinion.

What is it the blocker of this new feature for the pod lifecycle ?

@smarterclayton I'd like to help if it is necessary to push this issue.

0xmichalis · 2016-12-05T18:38:04Z

Lifecycle hooks for different objects (eg. Deployments) discussed elsewhere:
#140
#3585
#14512

Also custom strategies are related to this - having the ability to customize the lifecycle of X means you can add your own hooks anywhere you want in the process. See #33545 (comment) for more information.

cc: @mfojtik @soltysh

0xmichalis · 2016-12-05T18:38:30Z

@kubernetes/sig-apps

kow3ns · 2016-12-05T19:18:44Z

It sounds like what you are looking for is analogous to an init cotainer except that it runs after the pod is shutdown. Would a feature like this, lets call it a de-init container, be suitable to your requirements?

hectorj2f · 2016-12-15T22:24:14Z

I think that having something similar to init-containers to be launched when the pod is terminated. It could make sense.

smarterclayton · 2016-12-16T00:52:28Z

We had hesitations around adding finalizing containers when designing init containers. Allowing long strings of finalizers adds an additional state machine component to the pod that is new (and hard to make backwards compatible). There is also a serious desire to avoid anything close to a graph of tasks. Use case wise, I've heard: 1. Release resources acquired during init (global locks, API registration) 2. Upload contents from shared volumes (job logs, build artifacts) 3. Commit one of the app containers and push it (image build) 4. Signal job finalization to arbitrary entities. So "outputs" and "signal", primarily. What others are relevant? There's also the case that the caller might want more info from the pods. What guarantees are required for these use cases? Slightly deeper thought: Could we translate this into a container that receives a hook execution / signal when all other containers terminate? Pod Spec Containers - OnShutdown: Exec /bin/cleanup.sh (LifecycleHook) Adding a new post container could be unnecessary if a lifecycle hook was sufficient, although this does seem to fall afoul of the "no graph concepts of tasks" rules On Dec 15, 2016, at 5:24 PM, Hector Fernandez <notifications@github.com> wrote: I think that having something similar to init-containers to be launched when the pod is terminated. It could make sense. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#35183 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABG_p4nIjtdwNMcNTgMHZdcXqH05IvIWks5rIb4WgaJpZM4Kb50I> .

hectorj2f · 2016-12-21T19:08:29Z

I understand that adding destroy-containers for a deployment, might imply to add a new state in the workflow. If it is necessary I won't see the problem but I understand your concerns about it.

Pod
  Spec
    Containers
    - OnShutdown:
        Exec /bin/cleanup.sh
        (LifecycleHook)

On the other hand, if we add a hook when deleting a deployment, my main concern is to able to use containers that are different than the containers that run in the Pod of the deployment as an example. If so, I'd be happy with your approach but I believe with your solution. We aren't able to run any logic which is outside of the own container like it happens with the current Termination hooks.

Doing so, it'd require to:

Add additional logic to the containers that run in the pods to be able to clean themselves and whatever their init-container did.
Add a new hook postStop cause preStop might limit the cleanup operations to be triggered in multiple case scenarios.

pigmej · 2016-12-22T15:14:23Z

I can imagine some complications around that area.

How are we going to solve this problem:

user defines cleanup action (whatever we will call it)
node with a pod is down, therefore pod will be not able to call cleanup action
user want to delete pod from k8s system
It's obvious that then pod hooks weren't be executed, but then we will have 2 different behavior of the same.

Second problem:

user defines cleanup action
it crashes
- should we proceed with pod delete
- or maybe instead retry cleanup action? (how many times?)
what if cleanup can't succeed because of some conditions (delete -f ?)

dhilipkumars · 2017-04-14T07:57:50Z

cc kubernetes/community#483

bboreham · 2017-04-25T10:52:46Z

I have basically the same requirement, but for DaemonSet: I work on Weave Net which installs itself on every node via a DaemonSet. The install creates some side-effects on each node - network virtual devices, etc. - that a user would like to clear down if they decide to uninstall.

Users variously expect this to happen via kubectl delete or kubeadm reset, but currently there is no hook to run some uninstall code on each node.

We can't do an uninstall on a simple pod stop or delete - this will happen when the software is being upgraded, and destroying the pod network is very bad UX for an upgrade. We really need to know that the entire DaemonSet is being deleted.

@pigmej makes some good points, but it seems to me that users would understand that the code made some efforts to clean up, rather than no effort at all at present.

cheburakshu · 2017-04-25T23:45:49Z

@bboreham kubeadm reset can be run individually on nodes. It is a common cleanup command for both as per this, but this does not delete the network interfaces as one would expect. One thing I observed is even after doing the kubeadm reset on the node, the master listed it as running. I don't know if it was because of the un-deleted network.

bboreham · 2017-04-26T08:55:54Z

master listed it as running. I don't know if it was because of the un-deleted network.

No. It is because the code to remove a node was removed in #42713, @cheburakshu

cheburakshu · 2017-04-26T20:56:42Z

@bboreham Will the master still schedule the node since it sees it? In my experience it did and the pod was in a pending state. Is it a correct behaviour?

bboreham · 2017-04-28T12:07:56Z

@cheburakshu I dare say it will; however that is off-topic for this issue. Per #42713 the system administrator has to remove the node. Suggest you open an issue against kubeadm if this doesn't suit.

tgraf · 2017-04-28T21:33:02Z

I have basically the same requirement, but for DaemonSet: I work on Weave Net which installs itself on every node via a DaemonSet. The install creates some side-effects on each node - network virtual devices, etc. - that a user would like to clear down if they decide to uninstall.

I have the same requirement. CNI plugins typically install files (binaries & a config file in /etc/cni/net.d/) in the host filesystem when the managing pod is first started as a DaemonSet.

I'm doing the installing part using a PostStart hook. For the cleanup I'm deleting the files in a PreStop hook. I'm not 100% positive yet whether this works perfectly fine all the time as the volume unmounting seems to happen in parallel with the PreStop.

In case this is not guaranteed, a naive thought is to consider a PreUnmount hook which would run before volume unmounting takes places.

tgraf · 2017-04-29T09:05:31Z

In case this is not guaranteed, a naive thought is to consider a PreUnmount hook which would run before volume unmounting takes places.

Confirmed that this is not guaranteed. If I sleep in the PreStop hook for 60 seconds and then attempt to delete files on mounted volumes they will not be deleted as the volume has already been unmounted.

dhilipkumars · 2017-04-30T14:52:36Z

Hi @bboreham, @tgraf , @cheburakshu, could you please take a look at this proposal and provide more feedback it appears to me that we could achieve these using deferContainers, this will not only allow us to isolate termination scripts from the main image but also provide more guarantee and control than preStop hooks.

bboreham · 2017-04-30T22:02:32Z

@dhillipkumars I couldn't see how that would allow me to know that the entire DaemonSet is being deleted. Conversely I don't seem to need additional isolation.

dhilipkumars · 2017-05-01T12:14:56Z

@bboreham hmmm... you may probably need this one too to precisely know why the pod is being brought down.

fejta-bot · 2017-12-23T22:12:40Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

Prevent issues from auto-closing with an /lifecycle frozen comment.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or @fejta.
/lifecycle stale

bgrant0607 · 2018-01-22T20:41:01Z

/remove-lifecycle stale
/lifecycle frozen

bboreham · 2020-05-20T18:09:50Z

How about using a finalizer?

In the case of a Pod, it would need permission to write to its own object, which is unorthodox.

In the case of a DaemonSet the pods could all add themselves as finalizers to the DaemonSet, but that gets out of hand at scale. Add another pod to coordinate shutdown (an operator, if you like), it feels like it could work.

reedchan7 · 2021-01-28T06:34:59Z

Any update? I have the similar requirement now. I want a hook to do something when a job is deleted (or a pod is terminated). I tried to use the preStop hook, but it didn't work when I try to delete my jobs in a cron-job.

krmayankk · 2021-03-08T07:36:04Z

this proposal kubernetes/enhancements#1995 seems to handling this use case, please review it and see if that is in the right direction as per this issue

bboreham · 2021-03-08T09:35:50Z

@krmayankk can you point more precisely to how kubernetes/enhancements#1995 would notify when a pod, deployment, daemonset, etc., is being deleted?

I think it could be extended to cover the case described here, by defining system-triggered notifications, but all I can see in the proposal as it stands is user-triggered.

wgahnagl · 2021-06-24T20:54:05Z

/kind feature

thockin · 2022-08-19T17:28:09Z

I don't think we're going to implement new "do this when a pod is deleted" hooks in the core system any time soon. kubernetes/enhancements#1995 is not what you want here, it's what bboreham described - user-triggered. If someone wants to think up a general event-integration, it should at least start as an out-of core implementation.

k8s-github-robot added area/kubectl team/cluster labels Oct 20, 2016

bgrant0607 added area/app-lifecycle sig/node Categorizes an issue or PR as relevant to SIG Node. team/ux and removed area/kubectl team/cluster labels Nov 17, 2016

pwittrock added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Nov 17, 2016

bboreham mentioned this issue Apr 25, 2017

Hook for "remove for good" on DaemonSet #44911

Closed

0xmichalis added sig/apps Categorizes an issue or PR as relevant to SIG Apps. and removed team/ux (deprecated - do not use) labels Apr 26, 2017

bboreham mentioned this issue Apr 26, 2017

Weave network not deleted by 'kubeadm reset' weaveworks/weave#2911

Open

coeki mentioned this issue Apr 27, 2017

(Weave) Kubeadm reset on node not restoring initial state. kubernetes/kubeadm#255

Closed

erictune mentioned this issue Sep 1, 2017

[WIP] Implement deferContainers - Pod Termination Semantics #47422

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 23, 2017

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 22, 2018

shaneutt mentioned this issue Jul 24, 2019

Update DKA and add initContainer setup mesosphere-backup/kubeaddons-configs#157

Merged

antoninbas mentioned this issue Dec 4, 2019

Deleting Antrea should do a complete cleanup antrea-io/antrea#181

Closed

hectorj2f mentioned this issue Feb 27, 2020

REQUEST: New membership for hectorj2f kubernetes/org#1656

Closed

6 tasks

k8s-ci-robot added the kind/feature Categorizes issue or PR as related to a new feature. label Jun 24, 2021

thockin closed this as completed Aug 19, 2022

antoninbas mentioned this issue Sep 29, 2022

Pod traffic doesn't go through OVS when Antrea agent is in networkPolicyOnly mode antrea-io/antrea#4228

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

hectorj2f commented Oct 20, 2016

pires commented Oct 27, 2016

bgrant0607 commented Nov 17, 2016

smarterclayton commented Dec 1, 2016

hectorj2f commented Dec 4, 2016

0xmichalis commented Dec 5, 2016

0xmichalis commented Dec 5, 2016

kow3ns commented Dec 5, 2016

hectorj2f commented Dec 15, 2016

smarterclayton commented Dec 16, 2016 via email

hectorj2f commented Dec 21, 2016

pigmej commented Dec 22, 2016

dhilipkumars commented Apr 14, 2017 •

edited

Loading

bboreham commented Apr 25, 2017 •

edited

Loading

cheburakshu commented Apr 25, 2017

bboreham commented Apr 26, 2017

cheburakshu commented Apr 26, 2017

bboreham commented Apr 28, 2017

tgraf commented Apr 28, 2017 •

edited

Loading

tgraf commented Apr 29, 2017 •

edited

Loading

dhilipkumars commented Apr 30, 2017

bboreham commented Apr 30, 2017

dhilipkumars commented May 1, 2017

fejta-bot commented Dec 23, 2017

bgrant0607 commented Jan 22, 2018

bboreham commented May 20, 2020

reedchan7 commented Jan 28, 2021

krmayankk commented Mar 8, 2021

bboreham commented Mar 8, 2021

wgahnagl commented Jun 24, 2021

thockin commented Aug 19, 2022

Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

Comments

hectorj2f commented Oct 20, 2016

pires commented Oct 27, 2016

bgrant0607 commented Nov 17, 2016

smarterclayton commented Dec 1, 2016

hectorj2f commented Dec 4, 2016

0xmichalis commented Dec 5, 2016

0xmichalis commented Dec 5, 2016

kow3ns commented Dec 5, 2016

hectorj2f commented Dec 15, 2016

smarterclayton commented Dec 16, 2016 via email

hectorj2f commented Dec 21, 2016

pigmej commented Dec 22, 2016

dhilipkumars commented Apr 14, 2017 • edited Loading

bboreham commented Apr 25, 2017 • edited Loading

cheburakshu commented Apr 25, 2017

bboreham commented Apr 26, 2017

cheburakshu commented Apr 26, 2017

bboreham commented Apr 28, 2017

tgraf commented Apr 28, 2017 • edited Loading

tgraf commented Apr 29, 2017 • edited Loading

dhilipkumars commented Apr 30, 2017

bboreham commented Apr 30, 2017

dhilipkumars commented May 1, 2017

fejta-bot commented Dec 23, 2017

bgrant0607 commented Jan 22, 2018

bboreham commented May 20, 2020

reedchan7 commented Jan 28, 2021

krmayankk commented Mar 8, 2021

bboreham commented Mar 8, 2021

wgahnagl commented Jun 24, 2021

thockin commented Aug 19, 2022

dhilipkumars commented Apr 14, 2017 •

edited

Loading

bboreham commented Apr 25, 2017 •

edited

Loading

tgraf commented Apr 28, 2017 •

edited

Loading

tgraf commented Apr 29, 2017 •

edited

Loading