RFC: Deployment hooks #14512

ironcladlou · 2015-09-24T18:47:03Z

Users often need a means to inject custom behavior into the lifecycle of a deployment process. The deployment API (#1743) could be expanded to support the execution user-specified Docker images which are given an opportunity to complete at various points during the recon process for a deployment.

Use cases and various design approaches were discussed previously in an OpenShift deployment hooks proposal.

This RFC is to capture initial thoughts on the topic and to link any existing related issues.

ironcladlou · 2015-09-24T18:47:10Z

cc @smarterclayton @nikhiljindal @ghodss @bgrant0607

pmorie · 2015-09-25T04:31:26Z

Hooks be like

↪️
↩️

smarterclayton · 2015-10-11T19:47:18Z

A hook is a materialization of the "process" a deployment requires. A hook is a way of reducing the cost of implementing a full custom deployment process - instead, at logical points in the flow control of the process is handed off to the control of user code. Practically, hooks often involve one way transitions in state, such as a forward database migration, removal of old values from a persistent store, or the clearing of a state cache. A hook is effectively a synchronous event listener with veto power - it may return success or failure, or in some cases need reexectution.

Because deployment processes typically involve coupling between state (assumed elsewhere) and code (frequently the target of the deployment), it should be easy for a hook to be coupled to a particular version of code, and easy for a deployer to use the code under deployment from the hook.

Not all deployment processes are equal - most code deployments are small, with infrequent larger deployments that require schema or state changes. It should be easy to reason about when hooks will get run as well as temporarily disable them. Important deployments are usually manual, therefore there is less motivation to build hook enable/disablement mechanisms - it is usually better to allow automatic deployment and hooks to be disabled and an imperative series of actions taken instead.

bgrant0607 · 2015-11-12T17:34:50Z

Related: #3585

gravis · 2015-11-19T16:16:47Z

Let me sum-up our recent experience with openshift here.
We're using a dc to deploy a nginx + rails pod (2 containers). The 2 containers have an EmptyDir in common to share assets, generated by sprockets in rails. Rails was previously deploy using capistrano, and a recipe was in charge of pre-compiling these assets (http://guides.rubyonrails.org/asset_pipeline.html#precompiling-assets).
This is a good example of where a pre would fit. In dev, we don't precompile assets, therefore the container command is just "unicorn [...]" (a ruby webserver).
In production, we need to exec a pre-hook to compile these assets, before unicorn is starting.
This is where we felt into that issue: openshift/origin#4711 (comment)

bgrant0607 · 2015-11-21T19:36:44Z

Example of a pre-rollout hook: #1899 (comment)

smarterclayton · 2015-11-22T16:30:27Z

This could also be done with the future init container proposal, where the
init container on each pod gets a chance to run arbitrary code to fill out
volumes.

On Nov 19, 2015, at 11:17 AM, Philippe Lafoucrière notifications@github.com
wrote:

Let me sum-up our recent experience with openshift here.
We're using a dc to deploy a nginx + rails pod (2 containers). The 2
containers have an EmptyDir in common to share assets, generated by
sprockets in rails. Rails was previously deploy using capistrano, and a
recipe was in charge of pre-compiling these assets (
http://guides.rubyonrails.org/asset_pipeline.html#precompiling-assets).
This is a good example of where a pre would fit. In dev, we don't
precompile assets, therefore the container command is just "unicorn [...]"
(a ruby webserver).
In production, we need to exec a pre-hook to compile these assets, before
unicorn is starting.
This is where we felt into that issue: openshift/origin#4711 (comment)
openshift/origin#4711 (comment)

—
Reply to this email directly or view it on GitHub
#14512 (comment)
.

bgrant0607 · 2016-01-11T23:47:13Z

A hook use case is described here:
https://groups.google.com/forum/#!topic/google-containers/zblnAzLeSJA

edevil · 2016-06-08T15:56:30Z

I'm also having difficulty deciding when/how to do database migrations in the context of a rolling-upgrade. It seems these hooks would be the perfect place for running the migration script.

wombat · 2016-07-28T08:36:10Z

@edevil well, as you can´t have breaking schema changes in a rolling-upgrade scenario you can also use K8s jobs to do this. You can either wait for it to be completed before doing the upgrade or trigger them at the same time

0xmichalis · 2016-08-03T10:15:22Z

I'm also having difficulty deciding when/how to do database migrations in the context of a rolling-upgrade. It seems these hooks would be the perfect place for running the migration script.

This is the place where a mid-hook with the Recreate deployment strategy makes sense

cc: @kubernetes/deployment

smarterclayton · 2016-08-03T20:20:42Z

Yes, that's what mid hooks are primarily geared for. An open question is how hooks might interact with petsets as well. As part of PetSet we've been discussing how membership changes are signaled to the cluster, but in an update scenario there will need to be an orchestrated series of steps (some of which require the processes to be stopped). So I would argue that the general use case for both deployments and petsets must involve some sort of "associated change", which might be hooks, or a babysitter process, or a custom controller. It's better if those hooks are re-entrant and frequently applied (since that makes their behavior fit a control loop style, rather than a workflow style)

F21 · 2017-01-24T23:07:21Z

Any updates for this one? I am interested in being able to run a deployment and being able to start a chain of init containers for the deployment (not on a per pod basis).

edevil · 2017-01-30T18:30:02Z

Yeah, init containers per-deployment would be a good option as well.

0xmichalis · 2017-01-31T09:11:28Z

It's better if those hooks are re-entrant and frequently applied (since that makes their behavior fit a control loop style, rather than a workflow style)

Agreed. FWIW, the way we ensure rollbacks are re-entrant for Deployments is having an API field that when populated, the controller will remove and update the pod template to the specified version in one atomic call.

0xmichalis · 2017-04-23T16:10:02Z

Proposed deferContainers would give better support for Cleanup actions like db-migration etc.,

@dhilipkumars deferContainers are executed in the pod level and are equivalent to post-hooks. This issue is about hooks at the deployment level and it includes other types of hooks too (pre, mid).

dhilipkumars · 2017-04-30T14:42:02Z

@Kargakis deferContainers are proposed to be Prestop so db-migration can be programmed easily using it. If reasons for these hooks are towards 'better support for stateful apps' shouldnt we think of doing these in statefulsets instead?

0xmichalis · 2017-04-30T21:06:08Z

There are more reasons for hooks than db migrations, eg. image promotion. Although, if we have auto-pausing in the workload controllers (#11505), we can satisfy the use cases of this issue by programming hooks on top of the existing APIs.

montanaflynn · 2017-06-20T22:34:12Z

Not sure if this is helpful or even related but here's our use case:

We want to get an http request webhook anytime a deployment changes, this will allow us to correlate errors and changes to metrics with deployments.

gkop · 2017-08-24T18:33:37Z

We're using a very vanilla setup on GKE with the default rolling deployments. It's surprising that there's apparently not a simple and conventional way to get a hook when the rolling deploy completes. Would anybody be so kind to share a workaround for this? <3

2rs2ts · 2017-09-19T17:08:28Z

We have a very similar use case as @montanaflynn and deferContainers would not solve it.

gkop · 2017-09-19T17:15:43Z

Our workaround is to spawn kubectl rollout status deploy/$DEPLOYMENT_NAME --watch=true | tail -n 1 and wait for the process to exit. We use the process exit status to determine whether or not the deploy was successful, and include the last line of output in our deploy notifications (usually "deployment $DEPLOYMENT_NAME successfully rolled out") #hacktastic

486 · 2017-12-13T06:32:39Z

Hi,

Any updates on this? We are using OpenShift but also want to be compatible with Vanilla Kubernetes. Loosing Deployment Hooks is the single biggest headache in that transition.

It is just so common to have older software that expects migrations or other setup code to be executed by exactly one process, which can be easily done in the pre-deployment hook.

From a user perspective, I find it very helpful to have deployment hooks as a first-class feature. "Your deployment failed because your migrations, which happen in your pre-hook, didn't run through, you should check that."

As a workaround, Helm can orchestrate Jobs that have to run before your Deployment, but we don't want to adopt more moving parts for this single feature.

edevil · 2018-01-18T13:59:55Z

Just to add more fuel to the discussion, I was recently bitten by a problem using initContainers for running Django migrations.

During one of the rolling upgrades several pods ran the "./manage.py migrate" script at the same time through the "initContainers" feature. Since Django does not do every migration operation inside a transaction, some of the operations were interleaved which did not end well.

As I really need to run this script once per rolling-deployment (recreate is not an option), I am left with no other option but to do it manually... Launch a Job, check on it periodically, make sure it has run successfully, and only after perform the deployment.

ironcladlou · 2018-01-18T16:07:11Z

@486 @edevil the discussion continues over here: kubernetes/community#1171

bgrant0607 · 2018-01-22T18:19:46Z

/lifecycle frozen

shenshouer · 2019-02-24T02:22:45Z

any update for this ?

galindro · 2019-03-22T14:00:11Z

@ironcladlou did you know where in https://github.com/kubernetes/enhancements the Proposal kubernetes/community#1171 was moved into?

alper · 2024-06-12T12:19:18Z

Is this still open or has this been completed somewhere?

sftim · 2024-12-13T09:28:20Z

The deployment API (#1743) could be expanded to support the execution user-specified Docker images which are given an opportunity to complete at various points during the recon process for a deployment.

How about CEL as an alternative? Maybe MutatingAdmissionPolicy lets us build this now?

bgrant0607 added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. area/app-lifecycle team/ux labels Sep 24, 2015

bgrant0607 added the kind/feature Categorizes issue or PR as related to a new feature. label Oct 1, 2015

bgrant0607 added this to the v1.2-candidate milestone Nov 19, 2015

bgrant0607 modified the milestones: next-candidate, v1.2-candidate Jan 16, 2016

bgrant0607 removed this from the next-candidate milestone Apr 28, 2016

0xmichalis mentioned this issue May 20, 2016

RFC: Allow perm-failed deployments #14519

Closed

smarterclayton mentioned this issue Oct 11, 2016

Enable custom workflows to be built on top of Deployments kubernetes/enhancements#129

Closed

23 tasks

This was referenced Oct 29, 2016

[WIP] Initial deployment lifecycle hooks proposal #33545

Closed

post-deployment hook not finishing openshift/origin#11069

Closed

0xmichalis mentioned this issue Nov 9, 2016

Support init containers per deployment #36477

Closed

0xmichalis mentioned this issue Dec 5, 2016

Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

Closed

hongchaodeng mentioned this issue Mar 3, 2017

doc: op guide for install, uninstall etcd operator coreos/etcd-operator#855

Merged

0xmichalis added sig/apps Categorizes an issue or PR as relevant to SIG Apps. and removed team/ux (deprecated - do not use) labels Apr 22, 2017

0xmichalis added priority/awaiting-more-evidence Lowest priority. Possibly useful, but not yet enough support to actually get it done. and removed priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Apr 23, 2017

0xmichalis mentioned this issue May 21, 2017

Hook initContainers into lifecycle #45931

Open

escattone mentioned this issue Aug 30, 2017

Serve MDN sitemaps, other media files mozmeao/infra#445

Closed

erictune mentioned this issue Sep 1, 2017

[WIP] Implement deferContainers - Pod Termination Semantics #47422

Closed

tnozicka mentioned this issue Oct 10, 2017

Proposal for Lifecycle Hooks kubernetes/community#1171

Closed

k8s-ci-robot added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Jan 22, 2018

jhorwit2 mentioned this issue Feb 3, 2018

No option to write hook on notification on top of kubernetes deployment completion or deployment stuck #59173

Closed

tjhiggins mentioned this issue May 14, 2020

Deployment lifecycle hooks architect-team/architect-cli#205

Closed

helayoty added this to SIG Apps Sep 29, 2023

github-project-automation bot moved this to Needs Triage in SIG Apps Sep 29, 2023

sftim mentioned this issue Dec 13, 2024

RFC: Support user defined and extensible deployment strategies #14510

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Deployment hooks #14512

RFC: Deployment hooks #14512

ironcladlou commented Sep 24, 2015

ironcladlou commented Sep 24, 2015

pmorie commented Sep 25, 2015

smarterclayton commented Oct 11, 2015

bgrant0607 commented Nov 12, 2015

gravis commented Nov 19, 2015

bgrant0607 commented Nov 21, 2015

smarterclayton commented Nov 22, 2015

bgrant0607 commented Jan 11, 2016

edevil commented Jun 8, 2016

wombat commented Jul 28, 2016 •

edited

Loading

0xmichalis commented Aug 3, 2016

smarterclayton commented Aug 3, 2016 via email

F21 commented Jan 24, 2017 •

edited

Loading

edevil commented Jan 30, 2017

0xmichalis commented Jan 31, 2017

0xmichalis commented Apr 23, 2017

dhilipkumars commented Apr 30, 2017

0xmichalis commented Apr 30, 2017

montanaflynn commented Jun 20, 2017

gkop commented Aug 24, 2017

2rs2ts commented Sep 19, 2017

gkop commented Sep 19, 2017

486 commented Dec 13, 2017

edevil commented Jan 18, 2018

ironcladlou commented Jan 18, 2018

bgrant0607 commented Jan 22, 2018

shenshouer commented Feb 24, 2019

galindro commented Mar 22, 2019

alper commented Jun 12, 2024

sftim commented Dec 13, 2024

RFC: Deployment hooks #14512

RFC: Deployment hooks #14512

Comments

ironcladlou commented Sep 24, 2015

ironcladlou commented Sep 24, 2015

pmorie commented Sep 25, 2015

smarterclayton commented Oct 11, 2015

bgrant0607 commented Nov 12, 2015

gravis commented Nov 19, 2015

bgrant0607 commented Nov 21, 2015

smarterclayton commented Nov 22, 2015

bgrant0607 commented Jan 11, 2016

edevil commented Jun 8, 2016

wombat commented Jul 28, 2016 • edited Loading

0xmichalis commented Aug 3, 2016

smarterclayton commented Aug 3, 2016 via email

F21 commented Jan 24, 2017 • edited Loading

edevil commented Jan 30, 2017

0xmichalis commented Jan 31, 2017

0xmichalis commented Apr 23, 2017

dhilipkumars commented Apr 30, 2017

0xmichalis commented Apr 30, 2017

montanaflynn commented Jun 20, 2017

gkop commented Aug 24, 2017

2rs2ts commented Sep 19, 2017

gkop commented Sep 19, 2017

486 commented Dec 13, 2017

edevil commented Jan 18, 2018

ironcladlou commented Jan 18, 2018

bgrant0607 commented Jan 22, 2018

shenshouer commented Feb 24, 2019

galindro commented Mar 22, 2019

alper commented Jun 12, 2024

sftim commented Dec 13, 2024

wombat commented Jul 28, 2016 •

edited

Loading

F21 commented Jan 24, 2017 •

edited

Loading