kubeadm: Don't drain and remove the current node on kubeadm reset #42713

luxas · 2017-03-08T06:37:07Z

What this PR does / why we need it:

Which issue this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close that issue when PR gets merged): fixes #

Special notes for your reviewer:

In v1.5, kubeadm reset would drain your node and remove it from your cluster if you specified, but now in v1.6 we can't do that due to the RBAC rules we have set up.

After conversations with @liggitt, I also agree this functionality was somehow a little mis-placed (though still very convenient to use), so we're removing it for v1.6.

It's the system administrator's duty to drain and remove nodes from the cluster, not the nodes' responsibility.

The current behavior is therefore a bug that needs to be fixed in v1.6

Release note:

kubeadm: `kubeadm reset` won't drain and remove the current node anymore

@liggitt @deads2k @jbeda @dmmcquay @pires @errordeveloper

k8s-reviewable · 2017-03-08T06:37:14Z

This change is

luxas · 2017-03-08T06:39:13Z

Sorry, my intention was not to unassign @pires, that was a race condition in the Github page

luxas · 2017-03-08T07:53:29Z

@k8s-bot unit test this

pires

/lgtm

liggitt · 2017-03-08T15:11:33Z

cmd/kubeadm/app/cmd/reset.go

 	}, nil
 }

 // Run reverts any changes made to this host by "kubeadm init" or "kubeadm join".
 func (r *Reset) Run(out io.Writer) error {

-	// Try to drain and remove the node from the cluster
-	err := drainAndRemoveNode(r.removeNode)


would you want to mark it unschedulable or taint it instead? seems like it should do something to signal that no more pods should be sent

I could do it, but I think the node will be marked NotReady by controller manager by default, right?
If we have to set the NodeNotReady taint I'll do it, but I'd like to change the minimum amount of things in this PR

jbeda · 2017-03-13T20:34:05Z

Haven't reviewed the code but...

Ideally we would do the following:

Detect if the node is drained. If it is not, then error out with instructions on what to run with admin privs.
Provide a --force flag for reseting the node without draining.

Not sure we can bring this together in time for 1.6.

luxas · 2017-03-13T21:02:24Z

@jbeda I think we should do that long-term, but it won't fit into this release.
I think this is the best we can do, just revert to the earlier behavior.

liggitt · 2017-03-14T20:53:45Z

If this was previous behavior, I'm ok with reverting to it. admins should drain their node first, but if they don't, the nodecontroller will take care of the node when it stops updating status.

jbeda · 2017-03-14T20:54:37Z

Not excited about this as it is a regression in functionality but I guess it is the best we can do on short notice.

/lgtm

luxas · 2017-03-14T20:59:01Z

Not excited about this as it is a regression in functionality but I guess it is the best we can do on short notice.

Feel exactly the same but this is better than erroring in 100 % of the cases.

Root cause it (as discussed) that with RBAC a node can't drain itself.
We should document the best practices around removing a node in the docs though. @lukemarsden

luxas · 2017-03-14T21:04:52Z

But thanks for the LGTM!

k8s-github-robot · 2017-03-14T21:17:06Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jbeda, luxas

Needs approval from an approver in each of these OWNERS Files:

~~cmd/kubeadm/OWNERS~~ [jbeda,luxas]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

k8s-ci-robot · 2017-03-14T22:01:31Z

@luxas: The following test(s) failed:

Test name	Commit	Details	Rerun command
Jenkins non-CRI GCE e2e	`c7fc530`	link	`@k8s-bot non-cri e2e test this`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

k8s-github-robot · 2017-03-14T22:59:19Z

Automatic merge from submit-queue (batch tested with PRs 43018, 42713)

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Mar 8, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 8, 2017

k8s-github-robot assigned pires Mar 8, 2017

k8s-github-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. release-note Denotes a PR that will be considered when it comes time to generate release notes. labels Mar 8, 2017

luxas assigned liggitt and unassigned pires Mar 8, 2017

luxas added the kind/bug Categorizes issue or PR as related to a bug. label Mar 8, 2017

luxas added this to the v1.6 milestone Mar 8, 2017

luxas assigned pires Mar 8, 2017

kubeadm: Don't drain and remove the current node on kubeadm reset

c7fc530

luxas force-pushed the kubeadm_fix_reset branch from b803069 to c7fc530 Compare March 8, 2017 07:31

pires approved these changes Mar 8, 2017

View reviewed changes

liggitt reviewed Mar 8, 2017

View reviewed changes

jbeda self-assigned this Mar 14, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 14, 2017

k8s-github-robot merged commit 5826b09 into kubernetes:master Mar 14, 2017

bboreham mentioned this pull request Apr 26, 2017

Pod or Job lifecycle lacks of a mechanism to define cleanup actions once you delete a pod #35183

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kubeadm: Don't drain and remove the current node on kubeadm reset #42713

kubeadm: Don't drain and remove the current node on kubeadm reset #42713

luxas commented Mar 8, 2017 •

edited

Loading

k8s-reviewable commented Mar 8, 2017

luxas commented Mar 8, 2017

luxas commented Mar 8, 2017

pires left a comment

liggitt Mar 8, 2017

luxas Mar 13, 2017

jbeda commented Mar 13, 2017

luxas commented Mar 13, 2017

liggitt commented Mar 14, 2017

jbeda commented Mar 14, 2017

luxas commented Mar 14, 2017

luxas commented Mar 14, 2017

k8s-github-robot commented Mar 14, 2017

k8s-ci-robot commented Mar 14, 2017 •

edited

Loading

k8s-github-robot commented Mar 14, 2017

kubeadm: Don't drain and remove the current node on kubeadm reset #42713

kubeadm: Don't drain and remove the current node on kubeadm reset #42713

Conversation

luxas commented Mar 8, 2017 • edited Loading

k8s-reviewable commented Mar 8, 2017

luxas commented Mar 8, 2017

luxas commented Mar 8, 2017

pires left a comment

Choose a reason for hiding this comment

liggitt Mar 8, 2017

Choose a reason for hiding this comment

luxas Mar 13, 2017

Choose a reason for hiding this comment

jbeda commented Mar 13, 2017

luxas commented Mar 13, 2017

liggitt commented Mar 14, 2017

jbeda commented Mar 14, 2017

luxas commented Mar 14, 2017

luxas commented Mar 14, 2017

k8s-github-robot commented Mar 14, 2017

k8s-ci-robot commented Mar 14, 2017 • edited Loading

k8s-github-robot commented Mar 14, 2017

luxas commented Mar 8, 2017 •

edited

Loading

k8s-ci-robot commented Mar 14, 2017 •

edited

Loading