New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

HPA: fixed wrong count for target replicas calculations. #34955

Merged

k8s-github-robot merged 1 commit into kubernetes:master from jszczepkowski:hpa-fix2

Oct 18, 2016

Contributor

jszczepkowski commented Oct 17, 2016 •

edited by jessfraz

Loading

HPA: fixed wrong count for target replicas calculations (#34821).

HPA: fixed wrong count for target replicas calculations (#34821).

This change is

googlebot added the cla: yes label

jszczepkowski assigned mwielgus

jszczepkowski added the sig/autoscaling label

jszczepkowski added this to the v1.4 milestone

jszczepkowski added the cherrypick-candidate label

Contributor Author

jszczepkowski commented Oct 17, 2016

CC @DirectXMan12
This is a short fix for #34821 which should be cherry-picked to 1.4. Complex fix, together with cleanups, should be implemented as a part of #33593.

k8s-github-robot added size/M release-note-label-needed labels

Contributor

k8s-ci-robot commented Oct 17, 2016

Jenkins unit/integration failed for commit 47451cb. Full PR test history.

The magic incantation to run this job again is @k8s-bot unit test this. Please help us cut down flakes by linking to an open flake issue when you hit one in your PR.

DirectXMan12 suggested changes

View reviewed changes

Contributor

DirectXMan12 left a comment

One small request on a comment (it should reference a bug number so that future readers can look up the problem), but otherwise LGTM 👍

pkg/controller/podautoscaler/horizontal.go

               		if desiredReplicas > hpa.Spec.MaxReplicas {
               			desiredReplicas = hpa.Spec.MaxReplicas
               		}
+              		// Do not upscale too much to prevent incorrect rapid increase of the number of master replicas caused by
+              		// bogus CPU usage report from heapster/kubelet.

Contributor

DirectXMan12 Oct 17, 2016

can you add a bug number here so that future readers of this recognize what we're talking about here (I'm assuming this is to avoid the persistent issues with overflow/astronomically high CPU)?

Contributor Author

jszczepkowski Oct 18, 2016

done

DirectXMan12 suggested changes

View reviewed changes

pkg/controller/podautoscaler/horizontal.go

               	}
-              	return currentReplicas, &utilization, timestamp, nil
+              	desiredReplicas := math.Ceil(usageRatio * float64(numRunningPods))

Contributor

DirectXMan12 Oct 17, 2016

this could effectively cause a scale down, even if it looks like a scale up (if desiredReplicas * numRunningPods < (numRunningPods + numPendingPods))? Should we deal with that, or at least log or note it somewhere?

Contributor Author

jszczepkowski Oct 18, 2016

I think it's fine: not running pods are not consuming CPU. We are logging the number of running pods in DesiredReplicasComputed event bellow, so, it is clear what we are doing.

jessfraz self-assigned this

jessfraz added release-note cherry-pick-approved and removed release-note-label-needed labels

mwielgus reviewed

View reviewed changes

pkg/controller/podautoscaler/horizontal.go

+              	desiredReplicas := math.Ceil(usageRatio * float64(numRunningPods))
+              	a.eventRecorder.Eventf(hpa, api.EventTypeNormal, "DesiredReplicasComputed",
+              		"Computed the desired num of replicas: %d, on a base of %d reports (avgCPUutil: %d)", int32(desiredReplicas), numRunningPods, utilization)

Contributor

mwielgus Oct 17, 2016

Also add the current number of created (possibly not started) replicas.

Contributor Author

jszczepkowski Oct 18, 2016

done

pkg/controller/podautoscaler/horizontal.go

+              		// Do not upscale too much to prevent incorrect rapid increase of the number of master replicas caused by
+              		// bogus CPU usage report from heapster/kubelet.
+              		if desiredReplicas > int32(math.Max(2.0*float64(currentReplicas), 4.0)) {

Contributor

mwielgus Oct 17, 2016

Make 2.0 ad 4.0 a flag.

Contributor Author

jszczepkowski Oct 18, 2016

done (made consts like other HPA parameters)

pkg/controller/podautoscaler/horizontal.go

+              		// Do not upscale too much to prevent incorrect rapid increase of the number of master replicas caused by
+              		// bogus CPU usage report from heapster/kubelet.
+              		if desiredReplicas > int32(math.Max(2.0*float64(currentReplicas), 4.0)) {

Contributor

mwielgus Oct 17, 2016

Also put the formula to variable - no need to calculate it 2 times (thats a room for mistakes).

Contributor Author

jszczepkowski Oct 18, 2016

done

jszczepkowski force-pushed the hpa-fix2 branch from 47451cb to 678c291 Compare

October 18, 2016 08:05

k8s-github-robot added size/L and removed size/M labels

Contributor Author

jszczepkowski commented Oct 18, 2016

Comments applied, PTAL

mwielgus reviewed

View reviewed changes

pkg/controller/podautoscaler/horizontal.go

@@ @@ -48,8 +48,15 @@ const ( @@
               	HpaCustomMetricsTargetAnnotationName = "alpha/target.custom-metrics.podautoscaler.kubernetes.io"
               	HpaCustomMetricsStatusAnnotationName = "alpha/status.custom-metrics.podautoscaler.kubernetes.io"
+              	scaleUpLimitFactor  = 2
+              	scaleUpLimitMinimum = 1

Contributor

mwielgus Oct 18, 2016

In the previous version we had 4 here. 1 seems to little. With this setup if 1 replica is present we could only scale it up to 2. We should be able to get to 3-4 immediately.

Contributor Author

jszczepkowski Oct 18, 2016

sorry, a typo :)


          HPA: fixed wrong count for target replicas calculations.

f495e73

HPA: fixed wrong count for target replicas calculations (kubernetes#34821).

jszczepkowski force-pushed the hpa-fix2 branch from 678c291 to f495e73 Compare

October 18, 2016 08:20

Contributor

mwielgus commented Oct 18, 2016

LGTM

mwielgus added the lgtm label

k8s-github-robot commented Oct 18, 2016

Automatic merge from submit-queue

k8s-github-robot merged commit 16fa327 into kubernetes:master

jessfraz mentioned this pull request

Automated cherry pick of #34955 #35046

Merged

k8s-github-robot pushed a commit that referenced this pull request


          Merge pull request #35046 from jessfraz/automated-cherry-pick-of-#349…

c47ba9e

…55-origin-release-1.4

Automatic merge from submit-queue

Automated cherry pick of #34955

Cherry pick of #34955 on release-1.4.

#34955: HPA: fixed wrong count for target replicas calculations.

k8s-cherrypick-bot commented Oct 18, 2016

Commit found in the "release-1.4" branch appears to be this PR. Removing the "cherrypick-candidate" label. If this is an error find help to get your PR picked.

k8s-cherrypick-bot removed the cherrypick-candidate label

shyamjvs pushed a commit to shyamjvs/kubernetes that referenced this pull request


          Merge pull request kubernetes#35046 from jessfraz/automated-cherry-pi…

8d1790e

…ck-of-#34955-origin-release-1.4

Automatic merge from submit-queue

Automated cherry pick of kubernetes#34955

Cherry pick of kubernetes#34955 on release-1.4.

kubernetes#34955: HPA: fixed wrong count for target replicas calculations.

mattjmcnaughton mentioned this pull request

HPA should have scale down/up limits #39090

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cherry-pick-approved lgtm release-note sig/autoscaling size/L