Fix controller-manager race condition issue which cause endpoints flush during restart #23035

xinxiaogang · 2016-03-16T08:49:19Z

Fix: Wait for endpoints to become available instead of assuming there are no endpoints. (On kube-controller-manager restart.)

k8s-bot · 2016-03-16T08:50:13Z

Can one of the admins verify that this patch is reasonable to test? (reply "ok to test", or if you trust the user, reply "add to whitelist")

This message may repeat a few times in short succession due to jenkinsci/ghprb-plugin#292. Sorry.

Otherwise, if this message is too spammy, please complain to ixdy.

k8s-bot · 2016-03-16T08:50:49Z

Can one of the admins verify that this patch is reasonable to test? (reply "ok to test", or if you trust the user, reply "add to whitelist")

This message may repeat a few times in short succession due to jenkinsci/ghprb-plugin#292. Sorry.

Otherwise, if this message is too spammy, please complain to ixdy.

k8s-bot · 2016-03-16T08:51:10Z

Can one of the admins verify that this patch is reasonable to test? (reply "ok to test", or if you trust the user, reply "add to whitelist")

This message may repeat a few times in short succession due to jenkinsci/ghprb-plugin#292. Sorry.

Otherwise, if this message is too spammy, please complain to ixdy.

k8s-github-robot · 2016-03-16T09:01:07Z

Labelling this PR as size/S

bprashanth · 2016-03-16T16:58:51Z

pkg/controller/endpoint/endpoints_controller.go

@@ -305,6 +305,21 @@ func (e *EndpointController) syncService(key string) {
 		e.queue.Add(key) // Retry
 		return
 	}
+	if len(pods.Items) == 0 {


Can you do https://github.com/kubernetes/kubernetes/blob/master/pkg/controller/replication/replication_controller.go#L500 instead?

+1, that's the correct fix.

@bprashanth Good suggestion. Updated and please review again.

bprashanth · 2016-03-16T16:59:55Z

@kubernetes/sig-api-machinery

lavalamp · 2016-03-16T20:50:28Z

This looks like a good thing to fix, just we should fix by waiting for the pod store to sync instead of by listing every pod. I think we should consider this for a cherry-pick. (Problem statement: you restart controller-manager and all of your endpoints briefly go away.)

lavalamp · 2016-03-16T20:51:08Z

(Not urgent to get into 1.2.0, too late anyway. For 1.2.1.)

…use endpoints flush during restart

k8s-github-robot · 2016-03-17T03:25:02Z

Labelling this PR as size/M

bprashanth · 2016-03-17T17:08:32Z

Thanks, LGTM

k8s-github-robot · 2016-03-17T17:15:28Z

@k8s-bot ok to test
@k8s-bot test this

pr builder appears to be missing, activating due to 'lgtm' label.

k8s-bot · 2016-03-17T17:53:07Z

GCE e2e build/test passed for commit f5c631e.

k8s-github-robot · 2016-03-17T18:06:52Z

The author of this PR is not in the whitelist for merge, can one of the admins add the 'ok-to-merge' label?

k8s-github-robot · 2016-03-17T18:24:11Z

@k8s-bot test this [submit-queue is verifying that this PR is safe to merge]

k8s-bot · 2016-03-17T19:01:41Z

GCE e2e build/test passed for commit f5c631e.

k8s-github-robot · 2016-03-17T19:05:56Z

Automatic merge from submit-queue

Auto commit by PR queue bot

lavalamp · 2016-03-17T22:33:39Z

Thanks for this! I took the liberty of editing your initial comment so it'll be clear to the people evaluating the cherry-pick what this does. :)

Auto commit by PR queue bot (cherry picked from commit cda4583)

k8s-cherrypick-bot · 2016-03-24T22:04:04Z

Commit d71c1b9 found in the "release-1.2" branch appears to be this PR. Removing the "cherrypick-candidate" label. If this s an error find help to get your PR picked.

Auto commit by PR queue bot (cherry picked from commit cda4583)

googlebot added the cla: yes label Mar 16, 2016

k8s-github-robot assigned bprashanth Mar 16, 2016

k8s-github-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Mar 16, 2016

xinxiaogang force-pushed the xnxin-master branch from a0734a2 to c6588fa Compare March 16, 2016 10:25

bprashanth reviewed Mar 16, 2016
View reviewed changes

lavalamp added the cherrypick-candidate label Mar 16, 2016

lavalamp added this to the v1.2 milestone Mar 16, 2016

smarterclayton mentioned this pull request Mar 16, 2016

Finalize Kube items 3.2 openshift/origin#6766

Closed

85 tasks

kubernetes#23034 Fix controller-manager race condition issue which ca…

f5c631e

…use endpoints flush during restart

xinxiaogang force-pushed the xnxin-master branch from 8d3c804 to f5c631e Compare March 17, 2016 03:19

k8s-github-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Mar 17, 2016

bprashanth added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 17, 2016

k8s-github-robot added the needs-ok-to-merge label Mar 17, 2016

wojtek-t added ok-to-merge and removed needs-ok-to-merge labels Mar 17, 2016

k8s-github-robot pushed a commit that referenced this pull request Mar 17, 2016

Merge pull request #23035 from xinxiaogang/xnxin-master

cda4583

Auto commit by PR queue bot

k8s-github-robot merged commit cda4583 into kubernetes:master Mar 17, 2016

xinxiaogang deleted the xnxin-master branch March 18, 2016 01:11

bgrant0607 added the cherry-pick-approved Indicates a cherry-pick PR into a release branch has been approved by the release branch manager. label Mar 23, 2016

eparis pushed a commit to eparis/kubernetes that referenced this pull request Mar 24, 2016

Merge pull request kubernetes#23035 from xinxiaogang/xnxin-master

d71c1b9

Auto commit by PR queue bot (cherry picked from commit cda4583)

eparis mentioned this pull request Mar 24, 2016

Cherry-picks for release 1.2 - 24 Mar #23450

Merged

bgrant0607 added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note Denotes a PR that will be considered when it comes time to generate release notes. labels Mar 24, 2016

k8s-cherrypick-bot removed the cherrypick-candidate label Mar 24, 2016

AlainRoy pushed a commit to vmware-archive/kubernetes-archived that referenced this pull request Mar 29, 2016

Merge pull request kubernetes#23035 from xinxiaogang/xnxin-master

19e8cd5

Auto commit by PR queue bot (cherry picked from commit cda4583)

alena1108 pushed a commit to rancher/kubernetes that referenced this pull request May 20, 2016

Merge pull request kubernetes#23035 from xinxiaogang/xnxin-master

dca9a50

Auto commit by PR queue bot (cherry picked from commit cda4583)

david-mcmahon changed the title ~~kubernetes/kubernetes#23034 Fix controller-manager race condition issue which cause endpoints flush during restart~~ Fix controller-manager race condition issue which cause endpoints flush during restart Jul 1, 2016

shyamjvs pushed a commit to shyamjvs/kubernetes that referenced this pull request Dec 1, 2016

Merge pull request kubernetes#23035 from xinxiaogang/xnxin-master

9abf10c

Auto commit by PR queue bot (cherry picked from commit cda4583)

shouhong pushed a commit to shouhong/kubernetes that referenced this pull request Feb 14, 2017

Merge pull request kubernetes#23035 from xinxiaogang/xnxin-master

4647480

Auto commit by PR queue bot (cherry picked from commit cda4583)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix controller-manager race condition issue which cause endpoints flush during restart #23035

Fix controller-manager race condition issue which cause endpoints flush during restart #23035

xinxiaogang commented Mar 16, 2016

k8s-bot commented Mar 16, 2016

k8s-bot commented Mar 16, 2016

k8s-bot commented Mar 16, 2016

k8s-github-robot commented Mar 16, 2016

bprashanth Mar 16, 2016

lavalamp Mar 16, 2016

xinxiaogang Mar 17, 2016

bprashanth commented Mar 16, 2016

lavalamp commented Mar 16, 2016

lavalamp commented Mar 16, 2016

k8s-github-robot commented Mar 17, 2016

bprashanth commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

k8s-bot commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

k8s-bot commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

lavalamp commented Mar 17, 2016

k8s-cherrypick-bot commented Mar 24, 2016

Fix controller-manager race condition issue which cause endpoints flush during restart #23035

Fix controller-manager race condition issue which cause endpoints flush during restart #23035

Conversation

xinxiaogang commented Mar 16, 2016

k8s-bot commented Mar 16, 2016

k8s-bot commented Mar 16, 2016

k8s-bot commented Mar 16, 2016

k8s-github-robot commented Mar 16, 2016

bprashanth Mar 16, 2016

Choose a reason for hiding this comment

lavalamp Mar 16, 2016

Choose a reason for hiding this comment

xinxiaogang Mar 17, 2016

Choose a reason for hiding this comment

bprashanth commented Mar 16, 2016

lavalamp commented Mar 16, 2016

lavalamp commented Mar 16, 2016

k8s-github-robot commented Mar 17, 2016

bprashanth commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

k8s-bot commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

k8s-bot commented Mar 17, 2016

k8s-github-robot commented Mar 17, 2016

lavalamp commented Mar 17, 2016

k8s-cherrypick-bot commented Mar 24, 2016