[WIP] Initial deployment lifecycle hooks proposal #33545

mfojtik · 2016-09-27T09:31:04Z

@smarterclayton @Kargakis

This proposal is a summary collected from various sources.

A PoC branch (that is heavily an WIP): https://github.com/mfojtik/kubernetes/tree/deployment-hooks

This change is

0xmichalis · 2016-09-27T09:52:23Z

@kubernetes/deployment ptal

0xmichalis · 2016-09-27T09:57:48Z

docs/proposals/deployment-hooks.md

+
+* **Retry** - using this policy will cause that in case of a deployment lifecycle
+  hook failure, the lifecycle Pod will be restarted until the deployment reached the
+  `TimeoutSeconds` or the hook succeeds.


TimeoutSeconds is ProgressDeadlineSeconds added in the perma-failed PR.

ok, I will remove it and use progressdeadline

0xmichalis · 2016-09-27T09:58:39Z

docs/proposals/deployment-hooks.md

+### Default values
+
+Initially the default `FailurePolicy` should be set to `Retry` until the deployments can
+safely be rolled back automatically.


Link to the issue regarding automatic rollback.

0xmichalis · 2016-09-27T09:59:51Z

docs/proposals/deployment-hooks.md

+type RollingUpdateDeployment struct {
+  // TimeoutSeconds is the time to wait for updates before giving up. If the
+  // value is nil, a default will be used.
+  TimeoutSeconds *int64


No need for this, ProgressDeadlineSeconds will be in the spec. You should mention that this proposal is depending on the perma-failed PR.

k8s-ci-robot · 2016-09-27T10:11:55Z

Jenkins GCI GCE e2e failed for commit 7d193a9. Full PR test history.

The magic incantation to run this job again is @k8s-bot gci gce e2e test this. Please help us cut down flakes by linking to an open flake issue when you hit one in your PR.

k8s-ci-robot · 2016-09-27T10:47:11Z

Jenkins verification failed for commit 6cf98c0. Full PR test history.

The magic incantation to run this job again is @k8s-bot verify test this. Please help us cut down flakes by linking to an open flake issue when you hit one in your PR.

smarterclayton · 2016-09-27T15:47:10Z

docs/proposals/deployment-hooks.md

+Because deployment processes typically involve coupling between state (assumed elsewhere)
+and code (frequently the target of the deployment), it should be easy for a hook to be
+coupled to a particular version of code, and easy for a deployer to use the code under
+deployment from the hook.


Add a background section here including:

Why we did this on OpenShift and lessons learned

How this might be applicable to pet sets

Why this may or may not be applicable to daemon sets.

smarterclayton · 2016-09-27T15:47:41Z

docs/proposals/deployment-hooks.md

+Kubernetes provides container lifecycle hooks for containers within a Pod. Currently,
+post-start and pre-stop hooks are supported. For deployments, post-start is the most
+relevant. Because these hooks are implemented by the Kubelet, the post-start hook provides
+some unique guarantees:


Can't I used an init container?

+1 to this question

smarterclayton · 2016-09-27T17:51:10Z

As mentioned on IRC it may be useful to do custom deployment strategies (deployment executed by non core controller) first, which would allow this to be built as an extension controller (using observe or higher level tools) that applied hooks before / after deployment. I think process driven deployments are something it should be possible to build easily, and this is really a variant of a custom deployment. If you can spawn a side proposal only covering how someone could define a custom strategy we could have a multistage approach to hooks which makes Kube more extensible and reduces risk in the design process.

bgrant0607 · 2016-10-07T00:41:21Z

Ref #31571 re. custom controller. I like that approach, esp. in the short term (1.5/Q4). Review bandwidth is extremely limited right now.

Also, I think hooks are more likely to be relevant to PetSet than to Deployment. Deployment is really aimed at stateless applications with lots of rollout flexibility.

bgrant0607 · 2016-10-07T00:46:01Z

docs/proposals/deployment-hooks.md

+relevant. Because these hooks are implemented by the Kubelet, the post-start hook provides
+some unique guarantees:
+
+* The hook is executed synchronously during the pod lifecycle.


PostStart is async, unless I'm missing something.

@bgrant0607 PostStart is definitely synchronous. According to the docs and my testing

To clarify, it can start the entry point and the hook at the same time, but the hook has to finish running before a container is considered to be started

bgrant0607 · 2016-10-07T00:47:26Z

docs/proposals/deployment-hooks.md

+* The status of the pod is linked to the status and outcome of the hook execution.
+* The pod will not enter a ready status until the hook has completed successfully.
+* Service endpoints will not be created for the pod until the pod has a ready status.
+* If the hook fails, the pod's creation is considered a failure, and the retry behavior is


I think pod and container are confused here. Post-start is at the container level.

bgrant0607 · 2016-10-07T01:18:42Z

docs/proposals/deployment-hooks.md

+```
+
+```go
+type ExecNewPodHook struct {


Why not a distinct pod or job spec?

bgrant0607 · 2016-10-07T01:20:03Z

cc @erictune @pwittrock

smarterclayton · 2016-10-07T02:30:10Z

Re: need for hooks Right now 50-60% of our ootb web app examples that use a DB use the hook to initialize schema, so very relevant from the "I want to build something self contained that dtrt". Somewhere upwards of 70% of the ruby apps running on openshift online use the hook as a rails migration step. In enterprise environments, we've seen hooks used extensively with JEE apps to run mid migration updates with recreate strategies (clear cache, etc). I would those two examples are the most critical for ease of use in the use cases we target, but both of those examples benefit because of openshift's triggered deployments (via docker builds or tagged images resulting in images being changed automatically). If you don't have that capability, you are likely using custom scripting or Jenkins to drive your deployment updates, and you probably have a place to insert the hook logic. So a real philosophical question is - if you don't have a development flow that is innate / integrated with Kube, you'll be able to do hooks if you want. The hooks being on deployments gives a declarative tool for running something like pre-start but at a larger logical level for anyone performing deployment spec mutations - is that innately valuable to everyone, or just those integrated flows. I'd like to demonstrate a generic trigger controller (annotation that can result in a mutation of a deployment / daemonset / petset based on the latest level state of some external resource like a tagged image or git repo or URL) but I'm trying to close the partial object retrieval story first so I can do it more efficiently. But even that sort of tool is not fundamental, since many other workflows exist. The reason we focus on triggers is because they emphasize periodic refresh of cluster state (new db image) without user intervention, but without failed deployments those triggers could push resources into failing states. But as we talk about this a lot it really does highlight how OpenShift's dev workflow is opinionated and reinforcing (image resources -> triggers -> hooks -> failed deployments). If each of those pieces hangs on the other, it makes each individual piece in isolation less valuable. The fact we run the deployment in a pod also reinforces that - it's easy for someone to tweak the default deployment logic because we already have the concept of custom code running on the users' behalf in a pod. Perhaps we need to assess how someone like OpenShift can correctly implement the opinionated, reinforcing, easy deployment workflow on top of deployments first (which goes to the custom deployment question). That's more "custom workflow on top of Kube". While that complicates the movement of openshift users to deployments, it empowers others to build their own workflows.

bgrant0607 · 2016-10-07T04:44:32Z

@smarterclayton Thanks much for the detailed background.

Do the web-db cases put the DB in the same container/pod as the web app?

smarterclayton · 2016-10-07T22:37:51Z

They typically do not - schema and "execution" logic come from web app, db
is usually assumed to be a completely separate service.

On Oct 6, 2016, at 10:44 PM, Brian Grant notifications@github.com wrote:

@smarterclayton https://github.com/smarterclayton Thanks much for the
detailed background.

Do the web-db cases put the DB in the same container/pod as the web app?

—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
#33545 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/ABG_p0MgMVAaMUhoBxY83XyrTZY6K-T3ks5qxc49gaJpZM4KHckS
.

0xmichalis · 2016-10-29T00:45:02Z

ref #14512

k8s-github-robot · 2016-12-01T01:18:43Z

Adding label:do-not-merge because PR changes docs prohibited to auto merge
See http://kubernetes.io/editdocs/ for information about editing docs

k8s-github-robot · 2017-01-24T00:11:13Z

[APPROVALNOTIFIER] Needs approval from an approver in each of these OWNERS Files:

docs/OWNERS

We suggest the following people:
cc @brendandburns
You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

k8s-github-robot · 2017-02-27T09:52:52Z

This PR hasn't been active in 90 days. Closing this PR. Please reopen if you would like to work towards merging this change, if/when the PR is ready for the next round of review.

cc @brendandburns @erictune @mfojtik @thockin

You can add 'keep-open' label to prevent this from happening again, or add a comment to keep it open another 90 days

googlebot added the cla: yes label Sep 27, 2016

k8s-github-robot assigned brendandburns Sep 27, 2016

k8s-github-robot added kind/design Categorizes issue or PR as related to design. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. release-note-label-needed labels Sep 27, 2016

0xmichalis assigned bgrant0607 and unassigned brendandburns Sep 27, 2016

0xmichalis reviewed Sep 27, 2016

View reviewed changes

wip: initial deployment lifecycle hooks proposal

6cf98c0

mfojtik force-pushed the hooks-proposal branch from 7d193a9 to 6cf98c0 Compare September 27, 2016 10:20

smarterclayton reviewed Sep 27, 2016

View reviewed changes

bgrant0607 reviewed Oct 7, 2016

View reviewed changes

docs/proposals/deployment-hooks.md

```

```go

type ExecNewPodHook struct {

Copy link

Member

bgrant0607 Oct 7, 2016

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why not a distinct pod or job spec?

smarterclayton mentioned this pull request Oct 11, 2016

Enable custom workflows to be built on top of Deployments kubernetes/enhancements#129

Closed

23 tasks

bgrant0607 assigned erictune and unassigned bgrant0607 Oct 24, 2016

bgrant0607 added release-note-none Denotes a PR that doesn't merit a release note. and removed release-note-label-needed labels Oct 24, 2016

0xmichalis mentioned this pull request Oct 29, 2016

post-deployment hook not finishing openshift/origin#11069

Closed

k8s-github-robot added kind/design Categorizes issue or PR as related to design. kind/old-docs do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. labels Dec 1, 2016

0xmichalis mentioned this pull request Dec 5, 2016

Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

Closed

k8s-github-robot assigned brendandburns and unassigned erictune Jan 30, 2017

apelisse assigned erictune Jan 31, 2017

k8s-github-robot assigned thockin Jan 31, 2017

k8s-github-robot closed this Feb 27, 2017

tnozicka mentioned this pull request Oct 10, 2017

Proposal for Lifecycle Hooks kubernetes/community#1171

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Initial deployment lifecycle hooks proposal #33545

[WIP] Initial deployment lifecycle hooks proposal #33545

mfojtik commented Sep 27, 2016 •

edited by k8s-github-robot

Loading

0xmichalis commented Sep 27, 2016

0xmichalis Sep 27, 2016

mfojtik Sep 27, 2016

0xmichalis Sep 27, 2016

mfojtik Sep 27, 2016

0xmichalis Sep 27, 2016

mfojtik Sep 27, 2016

k8s-ci-robot commented Sep 27, 2016

k8s-ci-robot commented Sep 27, 2016

smarterclayton Sep 27, 2016

smarterclayton Sep 27, 2016

bgrant0607 Oct 7, 2016

smarterclayton commented Sep 27, 2016 via email

bgrant0607 commented Oct 7, 2016

bgrant0607 Oct 7, 2016

thomastaylor312 Nov 28, 2016

thomastaylor312 Nov 28, 2016

bgrant0607 Oct 7, 2016

bgrant0607 Oct 7, 2016

bgrant0607 commented Oct 7, 2016

smarterclayton commented Oct 7, 2016 via email

bgrant0607 commented Oct 7, 2016

smarterclayton commented Oct 7, 2016

0xmichalis commented Oct 29, 2016

k8s-github-robot commented Dec 1, 2016

k8s-github-robot commented Jan 24, 2017

k8s-github-robot commented Feb 27, 2017

[WIP] Initial deployment lifecycle hooks proposal #33545

[WIP] Initial deployment lifecycle hooks proposal #33545

Conversation

mfojtik commented Sep 27, 2016 • edited by k8s-github-robot Loading

0xmichalis commented Sep 27, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

k8s-ci-robot commented Sep 27, 2016

k8s-ci-robot commented Sep 27, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

smarterclayton commented Sep 27, 2016 via email

bgrant0607 commented Oct 7, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bgrant0607 commented Oct 7, 2016

smarterclayton commented Oct 7, 2016 via email

bgrant0607 commented Oct 7, 2016

smarterclayton commented Oct 7, 2016

0xmichalis commented Oct 29, 2016

k8s-github-robot commented Dec 1, 2016

k8s-github-robot commented Jan 24, 2017

k8s-github-robot commented Feb 27, 2017

mfojtik commented Sep 27, 2016 •

edited by k8s-github-robot

Loading