New Job resource #7380

bprashanth · 2015-04-27T19:25:00Z

Plumbing required to create a new job resource (storage, kubectl, rest client, validation).
@davidopp, @bgrant0607, @lavalamp (not sure if you're interested, but fyi anyway).

derekwaynecarr · 2015-04-27T19:53:01Z

pkg/api/v1beta1/types.go

@@ -747,6 +747,27 @@ type ReplicationController struct {
 	Labels       map[string]string          `json:"labels,omitempty" description:"map of string keys and values that can be used to organize and categorize replication controllers"`
 }

+// JobState is the state of a job, either input (create, update) or as output (list, get).
+type JobState struct {
+	Completions int               `json:"replicas" description:"number of replicas (desired or observed, as appropriate)"`


json and internal name have a mismatch, is that intentional?

bgrant0607 · 2015-04-28T19:37:05Z

The v1beta3 API LGTM.

One issue is that we'll need to figure out how to make Job kill off all its pods when it is killed (#1535). We shouldn't do that in the client the way it was done for RC, though it should also be the default behavior (--cascade=true).

Any volunteers to do a more thorough review?

cc @smarterclayton

bgrant0607 · 2015-04-28T22:42:16Z

pkg/api/v1beta3/types.go

+// JobSpec is the specification of a job.
+type JobSpec struct {
+	// Completions is the number of desired completions.
+	Completions int `json:"completions" description:"number of times the job has completed successfully (desired or observed, as appropriate)"`


Please make this field optional (omitempty), default this value to 1, and make it *int.

Should this read: number of times a pod has completed successfully

smarterclayton · 2015-04-28T23:59:47Z

Will look tomorrow.

bgrant0607 · 2015-04-29T00:03:11Z

@smarterclayton Thanks, but not super-urgent if other things are more critical.

smarterclayton · 2015-04-29T00:04:21Z

pkg/api/types.go

+	Completions int `json:"completions"`
+
+	// Selector is a label query over pods that should match the completed count.
+	Selector map[string]string `json:"selector"`


Any guarantee given if half the count runs, the nodes are lost, and so count isn't tracked? Or is the job controller expected to cache the pending completions until the job passes the threshold? Can completed pods not be deleted until enough job pods have run? Maybe i missed the deeper discussion on this...

I can see it work both ways, the simpler version is if the kubelet doesn't have a chance to report the completed status of the job, it could run multiple times. There was some discussion on #1624 about this case. The more involved fix would be to stash the expected completions locally, update it everytime a pod status matching the selectors changes to completed, and have the job controller monitor something like observedCompleted + active < spec.completions.

I think I'm going for the simpler version first, thoughts?

It was discussed here:
#1624 (comment)

There's no absolutely preventing multiple execution, since containers could always fail just before exiting with 0. I think the initial version could make a pretty-good effort at tracking completions in status. I had suggested persisting them in an annotation as a kind of private checkpoint.

bprashanth · 2015-04-29T02:04:14Z

there's been some discussion around if we really want the job controller for 1.0, and if we don't there's no point adding just a job resource, so like @bgrant0607 said, no hurry for this :)

I'm really waiting for #1624 to either get the label post-1.0 or 1.0, Brian or @davidopp might be able to help with prioritizing that

davidopp · 2015-04-29T03:55:54Z

I can do a more thorough review.

erictune · 2015-04-29T16:41:54Z

pkg/api/v1beta3/types.go

+// JobStatus represents the current status of a job.
+type JobStatus struct {
+	// Completions is the number of actual completions.
+	Completions int `json:"completions" description:"most recently oberved number of completions"`


A UI would want to show a user a list of running, successful, and failed pods. Is this something that can be reconstructed from get pods and an appropriate label match, or is it something the JobStatus needs to contain?

erictune · 2015-04-29T17:55:34Z

I've commented in #1624. Please don't merge this until we reach a conclusion on that comment.

davidopp · 2015-04-29T20:12:18Z

pkg/api/types.go

+// JobSpec is the specification of a job.
+// As the internal representation of a job, it may have either a TemplateRef or a Template set.
+type JobSpec struct {
+	// Completions is the number of desired successful completions of this job.


"of this job" -> "of pods of this job" (the "job" only runs to completion once)

k8s-bot · 2015-06-19T16:47:52Z

GCE e2e build/test failed for commit af960c1.

k8s-bot · 2015-07-27T23:41:33Z

GCE e2e build/test failed for commit 640d4625ebd7b1b524e4dbb6d6fb83bd0ae88fe4.

bprashanth · 2015-07-27T23:46:33Z

Rebased. @mikedanese as (I think) he's going to be picking up the job controller. I've only lightly tested this post-rebase (job unittests and basic kubectl resource operations using the example job.json).

k8s-bot · 2015-07-27T23:47:30Z

GCE e2e build/test failed for commit 071d313.

smarterclayton · 2015-07-27T23:53:00Z

@soltysh

soltysh · 2015-07-28T14:27:44Z

I think we should wait with further work on this PR until #11746 settles down. Afterwards I'll be happy to pick it up and rework to match the designed API.

k8s-bot · 2015-08-07T11:17:56Z

GCE e2e build/test passed for commit 071d313.

soltysh · 2015-08-19T11:39:28Z

I've picked up the changes but had to rework this to place job in expapi, here's the current work #12910

googlebot added the cla: yes label Apr 27, 2015

derekwaynecarr reviewed Apr 27, 2015
View reviewed changes

bprashanth force-pushed the job_resource branch from d4a794b to af960c1 Compare April 27, 2015 20:29

bgrant0607 reviewed Apr 28, 2015
View reviewed changes

bgrant0607 self-assigned this Apr 28, 2015

smarterclayton reviewed Apr 29, 2015
View reviewed changes

erictune reviewed Apr 29, 2015
View reviewed changes

erictune mentioned this pull request Apr 29, 2015

Job Controller #1624

Closed

davidopp reviewed Apr 29, 2015
View reviewed changes

bgrant0607 added this to the v1.0-post milestone May 22, 2015

ncdc mentioned this pull request Jul 20, 2015

Job controller proposal openshift/origin#3693

Closed

bgrant0607 removed this from the v1.0-post milestone Jul 24, 2015

davidopp mentioned this pull request Jul 27, 2015

Job controller proposal #11746

Merged

bprashanth force-pushed the job_resource branch from af960c1 to 640d462 Compare July 27, 2015 23:39

bprashanth added 2 commits July 27, 2015 16:43

Job resource, validation, kubectl

16286e6

Boilerplate and generated code

071d313

bprashanth force-pushed the job_resource branch from 640d462 to 071d313 Compare July 27, 2015 23:44

soltysh mentioned this pull request Aug 19, 2015

New Job resource #12910

Merged

bgrant0607 closed this Aug 25, 2015

bprashanth deleted the job_resource branch October 26, 2015 00:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Job resource #7380

New Job resource #7380

bprashanth commented Apr 27, 2015

derekwaynecarr Apr 27, 2015

bprashanth Apr 27, 2015

bgrant0607 commented Apr 28, 2015

bgrant0607 Apr 28, 2015

erictune Apr 29, 2015

smarterclayton commented Apr 28, 2015

bgrant0607 commented Apr 29, 2015

smarterclayton Apr 29, 2015

bprashanth Apr 29, 2015

bgrant0607 Apr 29, 2015

bprashanth commented Apr 29, 2015

davidopp commented Apr 29, 2015

erictune Apr 29, 2015

erictune commented Apr 29, 2015

davidopp Apr 29, 2015

k8s-bot commented Jun 19, 2015

k8s-bot commented Jul 27, 2015

bprashanth commented Jul 27, 2015

k8s-bot commented Jul 27, 2015

smarterclayton commented Jul 27, 2015

soltysh commented Jul 28, 2015

k8s-bot commented Aug 7, 2015

soltysh commented Aug 19, 2015

New Job resource #7380

New Job resource #7380

Conversation

bprashanth commented Apr 27, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bgrant0607 commented Apr 28, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

smarterclayton commented Apr 28, 2015

bgrant0607 commented Apr 29, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bprashanth commented Apr 29, 2015

davidopp commented Apr 29, 2015

Choose a reason for hiding this comment

erictune commented Apr 29, 2015

Choose a reason for hiding this comment

k8s-bot commented Jun 19, 2015

k8s-bot commented Jul 27, 2015

bprashanth commented Jul 27, 2015

k8s-bot commented Jul 27, 2015

smarterclayton commented Jul 27, 2015

soltysh commented Jul 28, 2015

k8s-bot commented Aug 7, 2015

soltysh commented Aug 19, 2015