Kubelet to POST pod status to apiserver #4561

erictune · 2015-02-18T22:55:14Z

Split off #156 as a smaller more specific work item.

In the kubelet, once every N sync loops, it should POST /api/$VERSION/namespaces/$NS/pods/$NAME/status for each pod.

The kubelet would do this if enabled by a flag, and emit a warning if it failed to POST the update.

The kubelet would ideally handle a 429 by retrying after the Retry-after header.

The text was updated successfully, but these errors were encountered:

dchen1107 · 2015-02-18T23:12:35Z

In a long run, we want to do a bulk POST to apiserver, but that is not v1 blocker.

derekwaynecarr · 2015-02-19T02:49:33Z

api/$VERSION/namespaces/$NS/pods/$NAME/status for each pod.

Sent from my iPhone

On Feb 18, 2015, at 5:55 PM, Eric Tune notifications@github.com wrote:

api/$VERSION/namespace/$NS/pods/$NAME/status for each pod.

fgrzadkowski · 2015-02-23T16:15:56Z

As suggested by @dchen1107 I'll work on this.

timothysc · 2015-02-26T19:06:49Z

Out of curiosity why can we roll up Node and Pod status into a single status update?

dchen1107 · 2015-02-27T01:59:52Z

That is the plan to have a bulk status update, but not strictly required at this moment.

erictune · 2015-03-02T22:17:00Z

@timothysc what URL do you propose for doing a node + pod update?

yujuhong · 2015-03-04T18:15:19Z

@fgrzadkowski, FYI, my PR #5019 modifies kubelet to reject (set pod status to fail) pods that have port conflict. The status is stored in a map and gets reported back later via status polling.

smarterclayton · 2015-03-05T22:01:37Z

#5085 is quasi blocked on this - in that if we do graceful deletion with TTL (the optimal way) then we won't be able to clear the binding at the point the pod is actually deleted. We could still delete the binding at the point the TTL starts, which is somewhat reasonable (since you can't stop or delay a deletion as I've implemented it so far) because it will trigger the kubelet to remove the pod gracefully. However, since true graceful would be SIGTERM to Docker with the remaining TTL window as soon as the pod sees it, then SIGKILL when delete happens, that's harder to do if the pod disappears from the binding.

fgrzadkowski · 2015-03-06T20:57:56Z

I have almost ready PR for this (some tests are failing). Will send it on Monday.

fgrzadkowski · 2015-03-17T15:35:00Z

I had to revert PR #5305 due to bugs. Reopening issue. Will send fixed version soon.

timothysc · 2015-03-18T19:47:08Z

So in the case of deletion, we're now seeing gobs of traffic on deletion trying to send updated status and the api server presenting NOT-FOUND.

Easy repro:

run density tests
start traffic monitoring on api-server ** tcpdump -nnvvXSs 1514 'tcp port 8080 and (((ip[2:2] - ((ip[0]&0xf)<<2)) - ((tcp[12]&0xf0)>>2)) != 0)' **
Wait for cleanup..

results in what appears to be a death spiral we can not exit from without a hard cluster reboot.

More details:
On a 23 node cluster running 1001 pods in a steady state env we can see api-server consuming >50% cpu on a 40-core box. Where steady state = no load just internal k8's traffic.

numerous 'kubectl get pods' return no result without an error, but I'm guessing yields ~429.

vmarmol · 2015-03-18T22:41:38Z

Sent out #5619 to lower and spread that load. It lowers the qps from 100 to ~9.

vmarmol · 2015-03-18T23:29:01Z

After discussions with @bgrant0607 and @dchen1107, they suggested to only update the status when it changes and on startup. The heartbeat will be handled by the node controller rather than per-pod. I'll file a separate issue for that and #5619 will go in for now.

erictune mentioned this issue Feb 18, 2015

Apiserver to support disabling pod status probes #4564

Closed

dchen1107 added priority/backlog Higher priority than priority/awaiting-more-evidence. sig/node Categorizes an issue or PR as relevant to SIG Node. labels Feb 18, 2015

dchen1107 added this to the v1.0 milestone Feb 18, 2015

erictune mentioned this issue Feb 18, 2015

Remove need for the apiserver to contact kubelet for current container state #156

Closed

dchen1107 mentioned this issue Feb 20, 2015

Kubelet to POST node status to apiserver #4562

Closed

mikedanese mentioned this issue Feb 22, 2015

Add PATCH verb to apiserver resources #4578

Closed

fgrzadkowski self-assigned this Feb 23, 2015

yujuhong mentioned this issue Feb 24, 2015

Node should sync back to master with allocated Pods through file source #4090

Closed

fgrzadkowski mentioned this issue Feb 25, 2015

Minimal status mutation change #4779

Merged

bgrant0607 mentioned this issue Feb 27, 2015

Properly set object status #2726

Closed

yujuhong mentioned this issue Feb 28, 2015

Kubelet to fail pods that have hostPort conflicts, etc #4623

Closed

dchen1107 mentioned this issue Feb 28, 2015

Kubelet should report last terminated container status #4919

Closed

ddysher mentioned this issue Feb 28, 2015

kubelet should take a client interface #4907

Merged

bgrant0607 mentioned this issue Feb 28, 2015

Invalidate pod state cache before events from kubelet are propagated #2947

Closed

fgrzadkowski mentioned this issue Mar 9, 2015

Periodically update pod status from kubelet. #5205

Merged

vmarmol closed this as completed in #5205 Mar 17, 2015

fgrzadkowski reopened this Mar 17, 2015

fgrzadkowski mentioned this issue Mar 17, 2015

Periodically update pod status from kubelet. #5555

Merged

bgrant0607 mentioned this issue Mar 18, 2015

Clean up Kubelet RESTful APIs #2098

Closed

vmarmol closed this as completed in #5555 Mar 18, 2015

fgrzadkowski mentioned this issue Mar 19, 2015

Update Pod status when it changes #5624

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kubelet to POST pod status to apiserver #4561

Kubelet to POST pod status to apiserver #4561

erictune commented Feb 18, 2015

dchen1107 commented Feb 18, 2015

derekwaynecarr commented Feb 19, 2015

fgrzadkowski commented Feb 23, 2015

timothysc commented Feb 26, 2015

dchen1107 commented Feb 27, 2015

erictune commented Mar 2, 2015

yujuhong commented Mar 4, 2015

smarterclayton commented Mar 5, 2015

fgrzadkowski commented Mar 6, 2015

fgrzadkowski commented Mar 17, 2015

timothysc commented Mar 18, 2015

vmarmol commented Mar 18, 2015

vmarmol commented Mar 18, 2015

Kubelet to POST pod status to apiserver #4561

Kubelet to POST pod status to apiserver #4561

Comments

erictune commented Feb 18, 2015

dchen1107 commented Feb 18, 2015

derekwaynecarr commented Feb 19, 2015

fgrzadkowski commented Feb 23, 2015

timothysc commented Feb 26, 2015

dchen1107 commented Feb 27, 2015

erictune commented Mar 2, 2015

yujuhong commented Mar 4, 2015

smarterclayton commented Mar 5, 2015

fgrzadkowski commented Mar 6, 2015

fgrzadkowski commented Mar 17, 2015

timothysc commented Mar 18, 2015

vmarmol commented Mar 18, 2015

vmarmol commented Mar 18, 2015