Improve replication controller manager performance #5884

bprashanth · 2015-03-24T23:26:44Z

This is an umbrella issue to address replication controller/manager performance that might matter for 1.0. In broad strokes:

Performance of stop Is limited by a 3s polling interval. This needs to be a watch. We cannot currently watch specific fields of an rc (Make the status.Replica count useful to watchers #5745). We could also watch a subresource endpoint (Create subresource end points for replication controllers #4909).
The rc currently uses a 5-10s polling interval (https://github.com/GoogleCloudPlatform/kubernetes/blob/c12303eaa373a34dccb9fb25898e13c38ab93630/pkg/controller/replication_controller.go#L60).
- It could be 10s before it notices inadequate pods
- It could be 20s before its status.Replicas reflects this
  - Even after fixing (1) it could be 20s before stop notices anything
The controller manager lists all pods for each rc (it uses a selector filter, but that doesn't really matter to the apiserver: https://github.com/GoogleCloudPlatform/kubernetes/blob/c12303eaa373a34dccb9fb25898e13c38ab93630/pkg/controller/replication_controller.go#L194). With (Migrate replication controllers to generic etcd #5746), the filter is at least performed server-side. We could use a reflector/fifo/controller framework to watch for changes to pod status (Periodically update pod status from kubelet. #5555).

@lavalamp thoughts (on specifically the last one, since it overlaps with your controller PR)?

The text was updated successfully, but these errors were encountered:

davidopp · 2015-03-25T00:23:02Z

@bgrant0607 can you remind me which of these you were telling me yesterday you thought was the most important (or maybe it's something that isn't on the list). I've forgotten.

bprashanth · 2015-03-25T00:34:58Z

There's also an alternative to 3, which is list all pods once per tick instead of once per controller per tick. Feels like we're close enough to the watch-pod-status solution that we should just go for that (we default to using the podcache today but it's feature flagged: https://github.com/GoogleCloudPlatform/kubernetes/blob/master/pkg/master/master.go#L384).

bgrant0607 · 2015-03-25T04:05:29Z

Whether indexed or not, we shouldn't be doing a list of all matching pods per replication controller.

Whether watching pod status or listing pods once per tick, we need to move to a model where the appropriate replication controller's count is updated based on the existence (or deletion) of a particular pod.

@fgrzadkowski Is posting of pod status from kubelet really conditional?

bprashanth · 2015-03-25T05:32:38Z

Status.Replicas is already updated upon creation/deletion of a replica in the controller manager. The posting of the pod status isn't conditional if I've understood this PR correctly (https://github.com/fgrzadkowski/kubernetes/blob/336525a27d3a815954bf67792ae83d88c5e33096/pkg/kubelet/kubelet.go#L571), the use of the pod cache is (@fgrzadkowski correct me if I'm wrong). I'd rather not have to guess if our match filters are applied before or after decorating the status, so if removing status from the pod cache isn't on someones plate I'll get to first.

fgrzadkowski · 2015-03-25T09:20:00Z

@bprashanth is right. Use of pod_cache is conditional, but kubelet already updates pod status when it changes (PR #5714). I've already sent PR #5854 to delete pod_cache completely.

bprashanth · 2015-03-26T22:57:15Z

A strawman for the third problem:

Use the controller framework to watch for updates to replication controllers
Also watch for updates to all pods, through a separate reflector (memory issues?)
Periodically wake up the controller manager:
- list pods from local store (based off Phase index?)
- build a map[selector]{InActive pod-count}
- list all rcs from local store, get pod count from map, do what we do today (i.e schedule/kill pods and update Status.Replicas)

Avoiding the periodic sync of the controller manager will make the system incremental, and thereby more fickle to things like network flake.

fgrzadkowski · 2015-04-09T11:59:08Z

I already started working on this. So assigning to myself.

bprashanth · 2015-04-09T15:45:05Z

So am I (I'd just forgotten to assign to myself), I'm just waiting on @lavalamp's reflector framework to start on this :)

Though if you really want this go ahead and I can find something else, or the otherway around

Edit: I had assigned it to myself perviously, so just taking it back now

fgrzadkowski · 2015-04-09T18:31:59Z

I hoped to finish this today, but didn't make it. Since I'll be OOO until
Mon feel free to jump in. Just let me know.

-- sent from a mobile device
9 kwi 2015 17:45 "Prashanth B" notifications@github.com napisał(a):

So am I (I'd just forgotten to assign to myself), I'm just waiting on
@lavalamp https://github.com/lavalamp's reflector framework to start on
this :)

Though if you really want this go ahead and I can find something else,
or the otherway around

—
Reply to this email directly or view it on GitHub
#5884 (comment)
.

bprashanth · 2015-04-09T18:35:01Z

Yeah I'm doing it, the controller framework mentioned isn't merged yet #6546, so unless there's some urgency to get this in we might as well work on different things, and might as well wait till the framework is in to reduce churn

fgrzadkowski · 2015-04-14T07:03:51Z

@bprashanth What's the ETA for this change? Without this change it's hard for me to move forward with profiling bottlenecks for large clusters.

bprashanth · 2015-04-15T03:57:30Z

@fgrzadkowski got sidetracked writing benchmarks to prove this isn't going to have a negative impact on perf. I have a pr ready, will upload tomorrow (at least for the pod part, seems like the controller framework still has some kinks that need ironing out).

wojtek-t · 2015-04-28T07:42:28Z

@bprashanth what is the current status of this PR?

wojtek-t · 2015-05-05T08:18:28Z

@bprashanth - is it fixed now?

bprashanth · 2015-05-05T14:30:17Z

Yep, stop is still slow because kubectl uses polling but that has nothing to do with the replication manager. I'll spin off another bug for that.

bprashanth added area/controller-manager team/master labels Mar 24, 2015

bprashanth self-assigned this Mar 24, 2015

bprashanth added this to the v1.0 milestone Mar 24, 2015

bprashanth mentioned this issue Mar 26, 2015

Add the ability to watch fields of a replication controller #5971

Merged

bprashanth added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Mar 31, 2015

bprashanth mentioned this issue Apr 8, 2015

Replication controller issues too many lists #6562

Closed

fgrzadkowski assigned fgrzadkowski and unassigned bprashanth Apr 9, 2015

bprashanth assigned bprashanth and unassigned fgrzadkowski Apr 9, 2015

davidopp added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. and removed priority/backlog Higher priority than priority/awaiting-more-evidence. labels Apr 28, 2015

bprashanth closed this as completed May 5, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve replication controller manager performance #5884

Improve replication controller manager performance #5884

bprashanth commented Mar 24, 2015

davidopp commented Mar 25, 2015

bprashanth commented Mar 25, 2015

bgrant0607 commented Mar 25, 2015

bprashanth commented Mar 25, 2015

fgrzadkowski commented Mar 25, 2015

bprashanth commented Mar 26, 2015

fgrzadkowski commented Apr 9, 2015

bprashanth commented Apr 9, 2015

fgrzadkowski commented Apr 9, 2015

bprashanth commented Apr 9, 2015

fgrzadkowski commented Apr 14, 2015

bprashanth commented Apr 15, 2015

wojtek-t commented Apr 28, 2015

wojtek-t commented May 5, 2015

bprashanth commented May 5, 2015

Improve replication controller manager performance #5884

Improve replication controller manager performance #5884

Comments

bprashanth commented Mar 24, 2015

davidopp commented Mar 25, 2015

bprashanth commented Mar 25, 2015

bgrant0607 commented Mar 25, 2015

bprashanth commented Mar 25, 2015

fgrzadkowski commented Mar 25, 2015

bprashanth commented Mar 26, 2015

fgrzadkowski commented Apr 9, 2015

bprashanth commented Apr 9, 2015

fgrzadkowski commented Apr 9, 2015

bprashanth commented Apr 9, 2015

fgrzadkowski commented Apr 14, 2015

bprashanth commented Apr 15, 2015

wojtek-t commented Apr 28, 2015

wojtek-t commented May 5, 2015

bprashanth commented May 5, 2015