node not exist failure during node status update flush controller's log #30898

jingxu97 · 2016-08-18T16:39:36Z

@saad-ali @matchstick
From issue #29903, I noticed that under attachdetach controller, reconciler keeps updating node status every 100ms. In some cases, the node no longer exists but reconciler still has a stale cache information of it. The error message quickly flush the kube-controller-manager's log.

Quick fix: Lower the level of this log so by default it won't show up
Right fix: have a good comprehensive understanding of all scenarios especially when certain components in the system fail/exit, how the volume manager should react/recover from them.

Related code and PR: https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/volume/attachdetach/reconciler/reconciler.go#L104
#29358
#30737

saad-ali · 2016-08-18T18:49:01Z

Lowering the priority is non-ideal but should be fine for now

dims · 2016-11-15T19:41:52Z

switching this to 1.6 as it's too late for 1.5. ok? (please switch it right back if you disagree)

ethernetdan · 2017-03-13T22:23:04Z

Moving to 1.7 as late to happen in 1.6. Feel free to switch back if this is incorrect.

saad-ali · 2017-06-07T15:44:10Z

Fixed by PRs for k8s 1.7:

jingxu97 added the sig/storage Categorizes an issue or PR as relevant to SIG Storage. label Aug 18, 2016

jingxu97 added this to the v1.4 milestone Aug 18, 2016

jingxu97 self-assigned this Aug 18, 2016

k8s-github-robot added area/kubelet team/cluster labels Aug 18, 2016

jingxu97 changed the title ~~node status failed log flush kube-controller-manager.log if node no longer exist~~ node not exist failure during node status update should not block other nodes update Aug 18, 2016

jingxu97 changed the title ~~node not exist failure during node status update should not block other nodes update~~ node not exist failure during node status update flush controller's log Aug 18, 2016

jingxu97 mentioned this issue Aug 18, 2016

Avoid failure message flush log when node no longer exist #30903

Merged

goltermann added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Sep 6, 2016

goltermann modified the milestones: v1.5, v1.4 Sep 6, 2016

dims modified the milestones: v1.6, v1.5 Nov 15, 2016

ethernetdan modified the milestones: v1.7, v1.6 Mar 13, 2017

saad-ali closed this as completed Jun 7, 2017

saad-ali removed the team/cluster (deprecated - do not use) label Jun 7, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

node not exist failure during node status update flush controller's log #30898

node not exist failure during node status update flush controller's log #30898

jingxu97 commented Aug 18, 2016

saad-ali commented Aug 18, 2016

dims commented Nov 15, 2016

ethernetdan commented Mar 13, 2017

saad-ali commented Jun 7, 2017

node not exist failure during node status update flush controller's log #30898

node not exist failure during node status update flush controller's log #30898

Comments

jingxu97 commented Aug 18, 2016

saad-ali commented Aug 18, 2016

dims commented Nov 15, 2016

ethernetdan commented Mar 13, 2017

saad-ali commented Jun 7, 2017