Add daemon design doc #13368

AnanyaKumar · 2015-08-31T04:01:50Z

No description provided.

AnanyaKumar · 2015-08-31T04:04:20Z

@bgrant0607 @davidopp @vmarmol @rjnagal @vishh @erictune @timothysc @smarterclayton @eparis (cc'ing people who commented on the original proposal, feel free to add others if I missed anyone!)

k8s-bot · 2015-08-31T04:33:25Z

GCE e2e build/test failed for commit da64887.

davidopp · 2015-08-31T08:17:24Z

docs/design/daemon.md

+
+## Motivation
+
+In Kubernetes, a Replication Controller ensures that the specified number of a specified pod are running in the cluster at all times (pods are restarted if they are killed). With the Replication Controller, users cannot control which nodes their pods run on - Kubernetes decides how to schedule the pods onto nodes. However, many users want control over how certain pods are scheduled. In particular, many users have requested for a way to run a daemon on every node in the cluster, or on a certain set of nodes in the cluster. This is essential for use cases such as building a sharded datastore, or running a logger on every node. In comes the daemon controller, a way to conveniently create and manage daemon-like workloads in Kubernetes.  


It's not really true that "users cannot control which nodes their pods run on" -- you can use node selector in your Pod.Spec. The reason why ReplicationController is sub-optimal for this use case is that you can't know what value to set for replicas in the RC spec, since the size of the cluster (or size of the subset of the cluster defined by the node selector) may vary over time.

Will change.

davidopp · 2015-08-31T08:18:50Z

Made a few comments. One stylistic notes: in parts of the doc you write in future tense (e.g. "The daemon controller will support...") while in others you write in past tense (e.g. "Each component was unit tested.") Probably best to stick to one tense.

AnanyaKumar · 2015-09-01T04:07:12Z

@davidopp Thanks for the review, I'll make the changes tomorrow! I'll change all the tenses to the past tense.

k8s-github-robot · 2015-09-02T00:05:20Z

Labelling this PR as size/L

AnanyaKumar · 2015-09-02T02:04:16Z

@davidopp Sorry for the delay, made the requested changes. PTAL when you have time!

k8s-bot · 2015-09-02T02:27:04Z

GCE e2e build/test passed for commit f2297d8.

kamalmarhubi · 2015-09-03T02:11:58Z

docs/design/daemon.md

+    - update
+    - Daemon controllers have labels, so you could, for example, list all daemon controllers with a certain label (the same way you would for a Replication Controller).
+  - In general, for all the supported features like get, describe, update, etc, the Daemon Controller works in a similar way to the Replication Controller. However, note that the Daemon Controller and the Replication Controller are different constructs.
+


I think that the update needs to be explained a bit more carefully. Will there be a rolling update to limit the number of instances that are changing? The description of the daemon controller's operation doesn't seem to allow for that.

We'd probably describe this in deployment terminology as MaxAvailability:
80% and MaxSurge: 0. The deployment process would need a way to make the
daemon controller "inert" or "inactive", create a new daemon controller,
and then selectively delete old pods.

On Wed, Sep 2, 2015 at 10:12 PM, Kamal Marhubi notifications@github.com
wrote:

In docs/design/daemon.md
#13368 (comment)
:

image: kubernetes/sharded

ports:

- containerPort: 9042

name: main
+```

commands that get info

get (e.g. kubectl get dc)

describe

Modifiers

delete

stop: first we turn down all the pods controller by the daemon (by setting the nodeName to a non-existed name). Then we turn down the daemon controller.

label

update

Daemon controllers have labels, so you could, for example, list all daemon controllers with a certain label (the same way you would for a Replication Controller).

In general, for all the supported features like get, describe, update, etc, the Daemon Controller works in a similar way to the Replication Controller. However, note that the Daemon Controller and the Replication Controller are different constructs.

I think that the update needs to be explained a bit more carefully. Will
there be a rolling update to limit the number of instances that are
changing? The description of the daemon controller's operation doesn't seem
to allow for that.

—
Reply to this email directly or view it on GitHub
https://github.com/kubernetes/kubernetes/pull/13368/files#r38607531.

Clayton Coleman | Lead Engineer, OpenShift

I agree you should provide some description of a plausible update process. Even if it managed by deployment controller, I assume we need some way deployment controller tweaks daemon controller to accomplish the update? (Or would deployment controller manage the update directly?)

Also, since one of the main purposes of daemon controller is to guarantee one replica per node, you probably want to kill the old pod before starting the new pod (i.e. the application make break if there are two instances running on the same node).

I don't want to rathole on updates in the initial implementation, but I envision Daemon orchestrating the updates, analogous to (but not the same as) Deployment.

For now I just specified which fields can be updated.

timothysc · 2015-09-03T14:58:10Z

@AnanyaKumar one thing I think is missing is QoS, and possibly priority of some kind.

e.g.

What prevents nodes from getting filled with other tasks?
What tier of service are daemons, will they ever get bumped?
...

bgrant0607 · 2015-09-04T04:10:22Z

docs/design/daemon.md

+
+**Author**: Ananya Kumar (@AnanyaKumar)
+
+**Status**: Draft proposal; prototype in progress.


Implementation is in progress.

bgrant0607 · 2015-09-04T04:13:51Z

docs/design/daemon.md

+        labels:
+          name: datastore-shard
+      spec:
+        node-selector: 


nodeSelector:
https://github.com/kubernetes/kubernetes/blob/master/pkg/api/v1/types.go#L1215

node-selector violates our API conventions:
https://github.com/kubernetes/kubernetes/blob/master/docs/devel/api-conventions.md#naming-conventions

mikedanese · 2015-09-16T02:56:27Z

Can we merge this before v1.1 considering the DaemonSet will have been entirely merged? @AnanyaKumar do you have time to address comments? Thanks for all your work. I would love to have some accompanying doc included in v1.1.

AnanyaKumar · 2015-09-17T01:38:16Z

@mikedanese Sorry for the delay, I had a few big deadlines. Is @erictune taking over this, based on the PR he sent out? Or is that for something else? Thanks for finishing up the DaemonSet PRs! :)

davidopp · 2015-09-17T07:24:21Z

I don't mind taking over this PR and making the necessary changes given how soon the code freeze is, if @AnanyaKumar doesn't have time. But I've pinged #14079 to see if @erictune is trying to replace this doc or just augment it.

davidopp · 2015-09-21T21:37:18Z

I'm taking over this PR. I am going to send out a new PR that incorporates the review comments made since Ananya's last commit. In the case of comments that merit a reply, I will reply in this PR (not the new one) since there's no way I'm aware of to move the comments over.

AnanyaKumar · 2015-09-22T00:00:46Z

@davidopp Thanks for taking over, and I'm so sorry for not revising the design doc - I've been pretty occupied at school, and spent the weekend at a hackathon. If you have any questions feel free to ping me.

Originally started as PR kubernetes#13368.

davidopp · 2015-09-22T00:34:21Z

Superceded by #14326. Please put any followup comments there.

Originally started as PR kubernetes#13368.

Add daemon design doc

da64887

googlebot added the cla: yes label Aug 31, 2015

davidopp reviewed Aug 31, 2015
View reviewed changes

lavalamp assigned davidopp Aug 31, 2015

AnanyaKumar mentioned this pull request Sep 1, 2015

Initial Daemon controller proposal #10210

Closed

k8s-github-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Sep 2, 2015

Update daemon.md

f2297d8

kamalmarhubi reviewed Sep 3, 2015
View reviewed changes

bgrant0607 reviewed Sep 4, 2015
View reviewed changes

yujuhong mentioned this pull request Sep 16, 2015

Update static Pod annotations with contents of mirror pod. #14010

Closed

davidopp mentioned this pull request Sep 17, 2015

Job and DaemonSet documentation. #14079

Merged

davidopp pushed a commit to davidopp/kubernetes that referenced this pull request Sep 22, 2015

Design doc for daemon controller.

85e0814

Originally started as PR kubernetes#13368.

davidopp mentioned this pull request Sep 22, 2015

Design doc for DaemonSet #14326

Closed

davidopp closed this Sep 22, 2015

davidopp mentioned this pull request Sep 24, 2015

Daemon design, take 3 #14529

Merged

ananth-at-camphor-networks pushed a commit to Juniper/kubernetes that referenced this pull request Oct 3, 2015

Design doc for daemon controller.

31ee752

Originally started as PR kubernetes#13368.

mkulke pushed a commit to mkulke/kubernetes that referenced this pull request Oct 7, 2015

Design doc for daemon controller.

c9d0eb7

Originally started as PR kubernetes#13368.

xingzhou pushed a commit to xingzhou/kubernetes that referenced this pull request Dec 15, 2016

Design doc for daemon controller.

dc60757

Originally started as PR kubernetes#13368.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add daemon design doc #13368

Add daemon design doc #13368

AnanyaKumar commented Aug 31, 2015

AnanyaKumar commented Aug 31, 2015

k8s-bot commented Aug 31, 2015

davidopp Aug 31, 2015

AnanyaKumar Sep 1, 2015

davidopp commented Aug 31, 2015

AnanyaKumar commented Sep 1, 2015

k8s-github-robot commented Sep 2, 2015

AnanyaKumar commented Sep 2, 2015

k8s-bot commented Sep 2, 2015

kamalmarhubi Sep 3, 2015

smarterclayton Sep 3, 2015

davidopp Sep 3, 2015

bgrant0607 Sep 4, 2015

davidopp Sep 21, 2015

timothysc commented Sep 3, 2015

bgrant0607 Sep 4, 2015

bgrant0607 Sep 4, 2015

mikedanese commented Sep 16, 2015

AnanyaKumar commented Sep 17, 2015

davidopp commented Sep 17, 2015

davidopp commented Sep 21, 2015

AnanyaKumar commented Sep 22, 2015

davidopp commented Sep 22, 2015


		## Motivation

		In Kubernetes, a Replication Controller ensures that the specified number of a specified pod are running in the cluster at all times (pods are restarted if they are killed). With the Replication Controller, users cannot control which nodes their pods run on - Kubernetes decides how to schedule the pods onto nodes. However, many users want control over how certain pods are scheduled. In particular, many users have requested for a way to run a daemon on every node in the cluster, or on a certain set of nodes in the cluster. This is essential for use cases such as building a sharded datastore, or running a logger on every node. In comes the daemon controller, a way to conveniently create and manage daemon-like workloads in Kubernetes.


		Author: Ananya Kumar (@AnanyaKumar)

		Status: Draft proposal; prototype in progress.

Add daemon design doc #13368

Add daemon design doc #13368

Conversation

AnanyaKumar commented Aug 31, 2015

AnanyaKumar commented Aug 31, 2015

k8s-bot commented Aug 31, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

davidopp commented Aug 31, 2015

AnanyaKumar commented Sep 1, 2015

k8s-github-robot commented Sep 2, 2015

AnanyaKumar commented Sep 2, 2015

k8s-bot commented Sep 2, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

timothysc commented Sep 3, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mikedanese commented Sep 16, 2015

AnanyaKumar commented Sep 17, 2015

davidopp commented Sep 17, 2015

davidopp commented Sep 21, 2015

AnanyaKumar commented Sep 22, 2015

davidopp commented Sep 22, 2015