nodecontroller never finishes updates on 2k nodes #26211

zmerlynn · 2016-05-24T21:54:58Z

My cluster has been up for 20m or so and the nodecontroller is going crazy (this is filtering out "observed a new Node" messages, because they're very spammy):

W0524 21:47:27.424786      19 reflector.go:334] pkg/controller/node/nodecontroller.go:271: watch of *api.Node ended with: too
 old resource version: 574213 (575678)
E0524 21:47:28.035688      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-yhg1: Operat
ion cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-yhg1": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:28.103729      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-ymsy: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-ymsy": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:28.128099      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-ynx0: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-ynx0": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:28.158973      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-yrcz: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-yrcz": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:28.721125      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-yw2l: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-yw2l": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:30.945799      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-yxk4: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-yxk4": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:31.097177      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-z63d: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-z63d": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:32.075794      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-zelk: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-zelk": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:33.662551      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-zfcu: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-zfcu": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:34.095979      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-znja: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-znja": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:35.965741      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-zt9s: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-zt9s": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:36.900036      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-ztv5: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-ztv5": the object has been modified; please apply your changes to the latest version and try again
E0524 21:47:36.935768      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-zv49: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-zv49": the object has been modified; please apply your changes to the latest version and try again
W0524 21:47:43.901803      19 reflector.go:334] pkg/controller/node/nodecontroller.go:271: watch of *api.Node ended with: too old resource version: 575746 (576929)

The text was updated successfully, but these errors were encountered:

zmerlynn · 2016-05-24T21:57:44Z

@dchen1107: Can you find someone to maybe look at this or triage further?

dchen1107 · 2016-05-24T22:09:58Z

@Random-Liu Can you take a look at this to make sure the issue is not caused by NodeProblemDetector updating a new NodeCondition?

Random-Liu · 2016-05-24T22:10:19Z

Sure!

Random-Liu · 2016-05-24T22:12:42Z

@dchen1107 Normally node-problem-detector should not update node object so frequently.
@zmerlynn Could you paste the kubectl describe nodes on one of the nodes here?

Random-Liu · 2016-05-24T22:40:21Z

@dchen1107 I checked the cluster. There is no NodeProblemDetector running in this cluster. :)
If so, this looks like a race between kubelet and node controller. I'll dig into it more.

dchen1107 · 2016-05-24T23:02:39Z

output of kubectl describe nodes?

Random-Liu · 2016-05-24T23:14:25Z

Here is the result of kubectl describe nodes gke-jenkins-e2e-default-pool-4-25892338-ztv5:

lantaol@gke-51872839970-de36245b0d6cd411088b ~ $ sudo kubectl describe nodes gke-jenkins-e2e-default-pool-4-25892338-ztv5
Name:           gke-jenkins-e2e-default-pool-4-25892338-ztv5
Labels:         beta.kubernetes.io/arch=amd64
            beta.kubernetes.io/instance-type=n1-standard-1
            beta.kubernetes.io/os=linux
            cloud.google.com/gke-nodepool=default-pool-4
            failure-domain.beta.kubernetes.io/region=us-east1
            failure-domain.beta.kubernetes.io/zone=us-east1-a
            kubernetes.io/hostname=gke-jenkins-e2e-default-pool-4-25892338-ztv5
Taints:         <none>
CreationTimestamp:  Tue, 24 May 2016 20:12:08 +0000
Phase:          
Conditions:
  Type          Status  LastHeartbeatTime           LastTransitionTime          Reason              Message
  ----          ------  -----------------           ------------------          ------              -------
  OutOfDisk         False   Tue, 24 May 2016 23:15:06 +0000     Tue, 24 May 2016 20:57:58 +0000     KubeletHasSufficientDisk    kubelet has sufficient disk space available
  MemoryPressure    False   Tue, 24 May 2016 23:15:06 +0000     Tue, 24 May 2016 20:12:08 +0000     KubeletHasSufficientMemory  kubelet has sufficient memory available
  Ready         True    Tue, 24 May 2016 23:15:06 +0000     Tue, 24 May 2016 20:57:58 +0000     KubeletReady            kubelet is posting ready status. WARNING: CPU hardcapping unsupported
Addresses:      10.240.3.38,104.196.50.75
Capacity:
 alpha.kubernetes.io/nvidia-gpu:    0
 cpu:                   1
 memory:                3801020Ki
 pods:                  110
System Info:
 Machine ID:            
 System UUID:           A4B3A092-ECD9-2651-4200-2B7F8832FB12
 Boot ID:           306f3786-f6a7-4452-a187-a6d38650cbe1
 Kernel Version:        3.16.0-4-amd64
 OS Image:          Debian GNU/Linux 7 (wheezy)
 Operating System:      linux
 Architecture:          amd64
 Container Runtime Version: docker://1.9.1
 Kubelet Version:       v1.3.0-alpha.4.438+6e4f494ad0d749
 Kube-Proxy Version:        v1.3.0-alpha.4.438+6e4f494ad0d749
PodCIDR:            10.10.53.0/24
ExternalID:         2804474502546262545
Non-terminated Pods:        (13 in total)
  Namespace         Name                                        CPU Requests    CPU Limits  Memory Requests Memory Limits
  ---------         ----                                        ------------    ----------  --------------- -------------
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-2mw9g             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-5th8k             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-axdew             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-eemie             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-ezc1o             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-f1eex             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-i9a5b             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-oc497             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-w3ebg             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-ye5vy             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  e2e-tests-kubelet-1ofyw   cleanup20000-09ec9032-21ef-11e6-b5fa-0242ac11000e-z0pr8             0 (0%)      0 (0%)      0 (0%)      0 (0%)
  kube-system           fluentd-cloud-logging-gke-jenkins-e2e-default-pool-4-25892338-ztv5      80m (8%)    0 (0%)      200Mi (5%)  200Mi (5%)
  kube-system           kube-proxy-gke-jenkins-e2e-default-pool-4-25892338-ztv5             20m (2%)    0 (0%)      0 (0%)      0 (0%)
Allocated resources:
  (Total limits may be over 100 percent, i.e., overcommitted. More info: http://releases.k8s.io/HEAD/docs/user-guide/compute-resources.md)
  CPU Requests  CPU Limits  Memory Requests Memory Limits
  ------------  ----------  --------------- -------------
  100m (10%)    0 (0%)      200Mi (5%)  200Mi (5%)
No events.

We can see that the transition time of Ready and OutOfDisk is around 20:57:58, and the transition time of MemoryPressure is around 20:12:08.

Then in controller log, the first race happens at 21:29:15.

E0524 21:29:15.400731      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-0-44232fbd-2hnk: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-0-44232fbd-2hnk": the object has been modified; please apply your changes to the latest version and try again

Last one happens at 21:51:11:

E0524 21:51:11.363353      19 nodecontroller.go:856] Error updating node gke-jenkins-e2e-default-pool-4-25892338-zztn: Operation cannot be fulfilled on nodes "gke-jenkins-e2e-default-pool-4-25892338-zztn": the object has been modified; please apply your changes to the latest version and try again

dchen1107 · 2016-05-25T00:12:30Z

@zmerlynn Can we have related Kubelet logs?

zmerlynn · 2016-05-25T00:14:44Z

I gave @Random-Liu access.

dchen1107 · 2016-05-25T00:16:38Z

Looks like we rule out two potential causes based on above output: 1) NodeProblemDetector 2) Out-of-Resource condition updates. Looks like this is specific failure caused by those hollow nodes. cc/ @wojtek-t

dchen1107 · 2016-05-25T00:17:41Z

cc/ @lavalamp Just mentioned that Kubemark test was disabled this morning due to some failure. @zmerlynn you might run into the same issue?

zmerlynn · 2016-05-25T00:22:51Z

What do you mean by hollow nodes?

zmerlynn · 2016-05-25T00:24:59Z

(This is a real 2k node cluster.)

Random-Liu · 2016-05-25T00:33:21Z

@zmerlynn Sorry, I misunderstood what you said. I thought you were saying "kubernetes e2e scalability test"

lavalamp · 2016-05-25T17:00:10Z

@zmerlynn This doesn't look like a node team problem, this is a nodecontroller problem. nodecontroller is owned by control plane.

node controller clearly needs to do something in a retry loop.

@mikedanese @gmarek <--- people who know something about node controller.

lavalamp · 2016-05-25T17:01:24Z

@Random-Liu the only way this is your bug is if your component does many many writes to the node status on startup.

zmerlynn · 2016-05-25T17:22:44Z

@lavalamp - I took a fast stab and asked for triage, sorry. To be fair, there's about 2k nodes posting status, but yes, that's probably the controller falling over, not the node team's fault.

wojtek-t · 2016-05-25T17:27:30Z

But IIUC, this should be fixed by #26207 . Or am I missing something?

Random-Liu · 2016-05-25T17:39:15Z

the only way this is your bug is if your component does many many writes to the node status on startup.

@lavalamp There is no node-problem-detector running in this cluster, but as you said, node-problem-detector should also avoid updating node status at the same time when the cluster start up.

@zmerlynn I checked the cluster yesterday. It's wired that all nodes started at 20:12:00, but the apiserver started at 21:27:00. After apiserver started, all the nodes were trying to update node statuses at the same time, and the nodecontroller also started complaining the update error at 21:29:15. After 21:51:11, there is no such error in nodecontroller any more.
A wild guess is that: this is just a temporary error when kubelet and node controller trying to update the same node status at the same time. There are tons of logs in a short period of time, just because there are too many nodes updating statuses when control plane starts up.

But IIUC, this should be fixed by #26207 . Or am I missing something?

It looks like #26207 doesn't solve this one. This one is a real update failure, not incorrect log. :)

zmerlynn · 2016-05-25T17:51:42Z

@Random-Liu: That time delta is really odd. The master is sometimes slow to create compared to the individual IGMs on GKE, but not the nodes themselves, which shouldn't even be that uniform.

lavalamp · 2016-05-25T18:43:53Z

I am suspicious of the time delta-- is one of those from a source that
doesn't include daylight savings or something?

On Wed, May 25, 2016 at 10:57 AM, Zach Loafman notifications@github.com
wrote:

@Random-Liu https://github.com/Random-Liu: That time delta is really
odd. The master is sometimes slow to create compared to the individual IGMs
on GKE, but not the nodes themselves, which shouldn't even be that uniform.

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#26211 (comment)

Random-Liu · 2016-05-25T19:53:04Z

I am suspicious of the time delta-- is one of those from a source that doesn't include daylight savings or something?

The time on the node:

lantaol@gke-jenkins-e2e-default-pool-4-25892338-zztn:~$ date
Wed May 25 19:51:00 UTC 2016

The time on the master:

lantaol@gke-51872839970-de36245b0d6cd411088b /var/log/containers $ date
Wed May 25 19:51:22 UTC 2016

The creation timestamp of node lantaol@gke-jenkins-e2e-default-pool-4-25892338-zztn:

Tue, 24 May 2016 20:14:37 +0000

The first log in kube-apiserver.log on the master gke-51872839970-de36245b0d6cd411088b:

I0524 21:27:05.295795      20 genericapiserver.go:604] Will report 104.196.50.83 as public IP address.

davidopp · 2016-05-26T07:22:22Z

Is this kubernetes/node-problem-detector#9 ?

Random-Liu · 2016-05-26T07:50:48Z

@davidopp Nope, node-problem-detector is not running in the cluster.

lavalamp · 2016-05-26T20:57:39Z

I'm giving this to @davidopp to delegate-- I can't see any reason why it should be @Random-Liu's problem. :)

davidopp · 2016-06-03T17:52:52Z

@gmarek do you think you could take a look at this?

gmarek · 2016-06-03T17:56:18Z

I'll look into it on Monday.

davidopp · 2016-06-03T21:08:58Z

Thanks!

gmarek · 2016-06-06T12:43:04Z

I have logs from the last enormous-cluster run. I expect that we're throttling something at a client side, but I need to confirm.

gmarek · 2016-06-06T13:05:04Z

Though I see the problem in route-controller not node-controller itself.

wojtek-t · 2016-06-06T13:19:28Z

This was observed 2 weeks a go. We did a bunch of changes especially in route-controller since then. Maybe this is no longer an issue. Did we check it?

gmarek · 2016-06-06T13:20:07Z

I'm looking at the last enormous cluster run from June 3rd.

gmarek · 2016-06-06T13:24:43Z

I looked at api-server logs and it looks that it's working as intended (more or less) - conflicts that I was able to find was caused by StatusUpdates sent by Kubelets.

This shouldn't be too surprising in 2k Node clusters.

@zmerlynn - what was the result of this behavior? NodeStatuses weren't up to date?

zmerlynn · 2016-06-06T13:31:35Z

At the time, it looked like the nodecontroller was flapping and just not getting around to updating node state.

I'm happy to close this for now, though. We've certainly changed enough and I also haven't seen it.

zmerlynn added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. sig/node Categorizes an issue or PR as relevant to SIG Node. team/control-plane labels May 24, 2016

zmerlynn assigned dchen1107 May 24, 2016

dchen1107 assigned Random-Liu and unassigned dchen1107 May 24, 2016

dchen1107 added the kind/bug Categorizes issue or PR as related to a bug. label May 24, 2016

dchen1107 added this to the v1.3 milestone May 24, 2016

dchen1107 mentioned this issue May 25, 2016

nodecontroller: Fix log message on successful update #26207

Merged

lavalamp added the area/nodecontroller label May 25, 2016

roberthbailey added the team/gke label May 25, 2016

lavalamp assigned davidopp and unassigned Random-Liu May 26, 2016

zmerlynn closed this as completed Jun 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nodecontroller never finishes updates on 2k nodes #26211

nodecontroller never finishes updates on 2k nodes #26211

zmerlynn commented May 24, 2016

zmerlynn commented May 24, 2016

dchen1107 commented May 24, 2016

Random-Liu commented May 24, 2016

Random-Liu commented May 24, 2016 •

edited

Loading

Random-Liu commented May 24, 2016 •

edited

Loading

dchen1107 commented May 24, 2016

Random-Liu commented May 24, 2016 •

edited

Loading

dchen1107 commented May 25, 2016

zmerlynn commented May 25, 2016

dchen1107 commented May 25, 2016

dchen1107 commented May 25, 2016

zmerlynn commented May 25, 2016

zmerlynn commented May 25, 2016

Random-Liu commented May 25, 2016 •

edited

Loading

lavalamp commented May 25, 2016

lavalamp commented May 25, 2016

zmerlynn commented May 25, 2016

wojtek-t commented May 25, 2016

Random-Liu commented May 25, 2016 •

edited

Loading

zmerlynn commented May 25, 2016

lavalamp commented May 25, 2016

Random-Liu commented May 25, 2016 •

edited

Loading

davidopp commented May 26, 2016

Random-Liu commented May 26, 2016

lavalamp commented May 26, 2016

davidopp commented Jun 3, 2016

gmarek commented Jun 3, 2016

davidopp commented Jun 3, 2016

gmarek commented Jun 6, 2016

gmarek commented Jun 6, 2016

wojtek-t commented Jun 6, 2016

gmarek commented Jun 6, 2016

gmarek commented Jun 6, 2016

zmerlynn commented Jun 6, 2016

nodecontroller never finishes updates on 2k nodes #26211

nodecontroller never finishes updates on 2k nodes #26211

Comments

zmerlynn commented May 24, 2016

zmerlynn commented May 24, 2016

dchen1107 commented May 24, 2016

Random-Liu commented May 24, 2016

Random-Liu commented May 24, 2016 • edited Loading

Random-Liu commented May 24, 2016 • edited Loading

dchen1107 commented May 24, 2016

Random-Liu commented May 24, 2016 • edited Loading

dchen1107 commented May 25, 2016

zmerlynn commented May 25, 2016

dchen1107 commented May 25, 2016

dchen1107 commented May 25, 2016

zmerlynn commented May 25, 2016

zmerlynn commented May 25, 2016

Random-Liu commented May 25, 2016 • edited Loading

lavalamp commented May 25, 2016

lavalamp commented May 25, 2016

zmerlynn commented May 25, 2016

wojtek-t commented May 25, 2016

Random-Liu commented May 25, 2016 • edited Loading

zmerlynn commented May 25, 2016

lavalamp commented May 25, 2016

Random-Liu commented May 25, 2016 • edited Loading

davidopp commented May 26, 2016

Random-Liu commented May 26, 2016

lavalamp commented May 26, 2016

davidopp commented Jun 3, 2016

gmarek commented Jun 3, 2016

davidopp commented Jun 3, 2016

gmarek commented Jun 6, 2016

gmarek commented Jun 6, 2016

wojtek-t commented Jun 6, 2016

gmarek commented Jun 6, 2016

gmarek commented Jun 6, 2016

zmerlynn commented Jun 6, 2016

Random-Liu commented May 24, 2016 •

edited

Loading

Random-Liu commented May 24, 2016 •

edited

Loading

Random-Liu commented May 24, 2016 •

edited

Loading

Random-Liu commented May 25, 2016 •

edited

Loading

Random-Liu commented May 25, 2016 •

edited

Loading

Random-Liu commented May 25, 2016 •

edited

Loading