NodeController doesn't evict Pods if no Nodes are Ready #25571

gmarek · 2016-05-13T14:32:03Z

When NodeControllers don't see any Ready Node it goes into "network segmentation mode". In this mode it cancels all evictions and don't evict any Pods.

It leaves network segmentation mode when it sees at least one Ready Node. When leaving it resets all timers, so each Node has full grace period to reconnect to the cluster.

cc @lavalamp @davidopp @mml @wojtek-t @fgrzadkowski

gmarek · 2016-05-13T18:47:09Z

I think this is it. I want to run tests on real cluster, but I believe it's ready for review.

@lavalamp @davidopp

lavalamp · 2016-05-13T19:05:50Z

pkg/controller/node/rate_limited_queue.go

@@ -63,6 +63,11 @@ type UniqueQueue struct {
 	set   sets.String
 }

+// Length returns a number of items currently in the queue
+func (q *UniqueQueue) Length() int {


looks unsafe re: locking. Please don't add this, esp don't make it public.

lavalamp · 2016-05-13T19:10:54Z

looks OK to me.

lavalamp · 2016-05-13T22:44:33Z

Ping me when you get tests working. :)

davidopp · 2016-05-16T03:52:38Z

@lavalamp tests are passing now
(I didn't review the PR, just saw your comment)

gmarek · 2016-05-16T05:42:56Z

I guess that @lavalamp meant the tests I mentioned in previous comment. There's no e2e tests for this and I didn't have time to write those. I'll hand test it today and add tests sometime next week.

gmarek · 2016-05-16T09:31:06Z

@lavalamp I run manual tests and it seems to work now (I had a bug - now I skip master node when looking for Ready Nodes).

lavalamp · 2016-05-16T22:31:43Z

Can you exercise your bugfix in the unit test? then LGTM.

gmarek · 2016-05-17T05:23:29Z

There is a unit test for this - the last case I added checks exactly this.

gmarek · 2016-05-17T10:07:10Z

@k8s-bot test this issue: #IGNORE
yesterday's jenkins problems

lavalamp · 2016-05-17T17:59:58Z

Sorry, maybe I'm blind, but I can't see a test case with a node having a name ending in "-master"?

gmarek · 2016-05-17T19:05:42Z

Oh, sorry - the master thing. I miss-read your comment.

gmarek · 2016-05-17T21:03:43Z

@lavalamp PTAL

lavalamp · 2016-05-18T06:12:29Z

LGTM!

gmarek · 2016-05-19T12:34:32Z

It's pretty high in the queue, but we might need to merge it by hand if it won't make it until tomorrow.

k8s-github-robot · 2016-05-20T11:42:23Z

@k8s-bot test this [submit-queue is verifying that this PR is safe to merge]

k8s-bot · 2016-05-20T12:17:21Z

GCE e2e build/test passed for commit 6d27009.

k8s-github-robot · 2016-05-20T12:31:34Z

Automatic merge from submit-queue

pesho · 2016-05-20T12:43:47Z

This looks like it might also resolve the scenario described in #19124. Will it be backported to 1.2 as well?

davidopp · 2016-05-26T05:42:40Z

@pesho No. But 1.3 should be out relatively soon (less than a month).

manojlds · 2016-10-02T15:44:42Z

I am seeing this error in the kubelet logs:

E0930 18:21:12.386245    2565 kubelet.go:2913] Error updating node status, will retry: failed to get node address from cloud provider: RequestError: send request failed
caused by: Get http://169.254.169.254/latest/meta-data/local-ipv4: dial tcp 169.254.169.254:80: getsockopt: no route to host

All pods in the node were rescheduled to other nodes. Is this how this is supposed to behave? Also, did it reschedule without doing few retries at the least?

gmarek · 2016-10-02T21:34:22Z

@manojlds - yes it is is as expected, and NC does a number of retries before it evicts Pods (by default 30 times, but it depends on how often NodeStatus is reported, and what are the settings for PodEvictionGracePeriod). From what you wrote there were other Nodes that were serving, so the behavior seems to be correct, or did you expect something else?

This log line suggests that the cloud provider became unreachable from some of your Nodes. @yujuhong

manojlds · 2016-10-03T09:41:53Z

@gmarek - thanks. This is the only instance of the error I could see in the log, so not sure if it did retry before actually evicting the pods. The behaviour seems fine, only doubt/question is on the retry.

gmarek · 2016-10-03T09:55:13Z

Even if the kubelet does not retry it's fine, as NC requires a lot of failed NodeStatus updates to actually start doing something.

For kubelet-side of this @yujuhong should be able to answer/point you to someone who can.

yujuhong · 2016-10-03T17:37:21Z

Kubelet should retry up to 5 times by default. On top of that, kubelet tries to update the node status every 10s (or a customized period passed by flags/configs), and node controller only reacts in O(min). One failed attempt should not cause any serious reactions.

manojlds · 2016-10-04T03:24:39Z

@yujuhong thanks. Will take a look at the flags, and see how best I can use them in my cluster.

gmarek self-assigned this May 13, 2016

gmarek added the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label May 13, 2016

googlebot added the cla: yes label May 13, 2016

k8s-github-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. release-note-label-needed labels May 13, 2016

gmarek force-pushed the nodecontroller branch from cc01b11 to 592987f Compare May 13, 2016 18:44

gmarek assigned lavalamp and unassigned gmarek May 13, 2016

lavalamp reviewed May 13, 2016
View reviewed changes

davidopp added this to the v1.3 milestone May 13, 2016

gmarek force-pushed the nodecontroller branch from 592987f to ac2be30 Compare May 13, 2016 21:57

gmarek force-pushed the nodecontroller branch 3 times, most recently from ac2be30 to 59a83a4 Compare May 16, 2016 09:22

gmarek changed the title ~~WIP: NodeController doesn't evict Pods if no Nodes are Ready~~ NodeController doesn't evict Pods if no Nodes are Ready May 16, 2016

gmarek removed the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label May 16, 2016

NodeController doesn't evict Pods if no Nodes are Ready

6d27009

gmarek force-pushed the nodecontroller branch from 59a83a4 to 6d27009 Compare May 17, 2016 21:03

lavalamp added lgtm "Looks good to me", indicates that a PR is ready to be merged. release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note-label-needed labels May 18, 2016

gmarek added the priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. label May 20, 2016

k8s-github-robot merged commit 3b0a6da into kubernetes:master May 20, 2016

roberthbailey mentioned this pull request May 26, 2016

NodeController shouldn't evict Pods when all Nodes are NotReady #24597

Closed

davidopp mentioned this pull request May 27, 2016

API server recovery actions are destructive #24200

Closed

gmarek mentioned this pull request Jun 2, 2016

Cloud provider API downtime still leads to NodeNotReady #19124

Closed

gmarek deleted the nodecontroller branch August 30, 2016 09:50

d-grigorenko mentioned this pull request Jul 21, 2017

Kubelet gets throttled by AWS metadata service and becomes NotReady #49331

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NodeController doesn't evict Pods if no Nodes are Ready #25571

NodeController doesn't evict Pods if no Nodes are Ready #25571

gmarek commented May 13, 2016 •

edited

Loading

gmarek commented May 13, 2016

lavalamp May 13, 2016

gmarek May 13, 2016

lavalamp commented May 13, 2016

lavalamp commented May 13, 2016

davidopp commented May 16, 2016

gmarek commented May 16, 2016

gmarek commented May 16, 2016

lavalamp commented May 16, 2016

gmarek commented May 17, 2016

gmarek commented May 17, 2016

lavalamp commented May 17, 2016

gmarek commented May 17, 2016

gmarek commented May 17, 2016

lavalamp commented May 18, 2016

gmarek commented May 19, 2016

k8s-github-robot commented May 20, 2016

k8s-bot commented May 20, 2016

k8s-github-robot commented May 20, 2016

pesho commented May 20, 2016

davidopp commented May 26, 2016

manojlds commented Oct 2, 2016

gmarek commented Oct 2, 2016

manojlds commented Oct 3, 2016

gmarek commented Oct 3, 2016

yujuhong commented Oct 3, 2016

manojlds commented Oct 4, 2016

NodeController doesn't evict Pods if no Nodes are Ready #25571

NodeController doesn't evict Pods if no Nodes are Ready #25571

Conversation

gmarek commented May 13, 2016 • edited Loading

gmarek commented May 13, 2016

lavalamp May 13, 2016

Choose a reason for hiding this comment

gmarek May 13, 2016

Choose a reason for hiding this comment

lavalamp commented May 13, 2016

lavalamp commented May 13, 2016

davidopp commented May 16, 2016

gmarek commented May 16, 2016

gmarek commented May 16, 2016

lavalamp commented May 16, 2016

gmarek commented May 17, 2016

gmarek commented May 17, 2016

lavalamp commented May 17, 2016

gmarek commented May 17, 2016

gmarek commented May 17, 2016

lavalamp commented May 18, 2016

gmarek commented May 19, 2016

k8s-github-robot commented May 20, 2016

k8s-bot commented May 20, 2016

k8s-github-robot commented May 20, 2016

pesho commented May 20, 2016

davidopp commented May 26, 2016

manojlds commented Oct 2, 2016

gmarek commented Oct 2, 2016

manojlds commented Oct 3, 2016

gmarek commented Oct 3, 2016

yujuhong commented Oct 3, 2016

manojlds commented Oct 4, 2016

gmarek commented May 13, 2016 •

edited

Loading