Only system-node-critical pods should be OOM Killed last #99729

ravisantoshgudimetla · 2021-03-03T21:37:30Z

What type of PR is this?

/kind feature
/kind design

What this PR does / why we need it:

Allow only system-node-critical pods to have -997 OOM score

Which issue(s) this PR fixes:

Fixes #99727

Special notes for your reviewer:

Does this PR introduce a user-facing change?

System-cluster-critical pods should not get a low OOM Score. 

As of now both system-node-critical and system-cluster-critical pods have -997 OOM score, making them one of the last processes to be OOMKilled. By definition system-cluster-critical pods can be scheduled elsewhere if there is a resource crunch on the node where as system-node-critical pods cannot be rescheduled. This was the reason for system-node-critical to have higher priority value than system-cluster-critical.  This change allows only system-node-critical priority class to have low OOMScore.

action required
If the user wants to have the pod to be OOMKilled last and the pod has system-cluster-critical priority class, it has to be changed to system-node-critical priority class to preserve the existing behavior

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

cc @sjenning @rphillips @smarterclayton @wking

ravisantoshgudimetla · 2021-03-03T21:37:44Z

/sig node

rphillips · 2021-03-03T22:57:37Z

/priority important-soon
/retest
/triage accepted

rphillips · 2021-03-03T22:58:52Z

/lgtm

smarterclayton · 2021-03-03T23:13:24Z

Before we merge this can we reconstruct the reasoning that led to system-cluster-critical getting this oomscore? It's been long enough that I don't remember everything but there was definitely thought put into it. If we think it's correct to make a change, we need the same people who cared to also weigh in

/hold

ravisantoshgudimetla · 2021-03-04T00:14:18Z

The reasoning was mentioned here. The main goal was a way to map OOMScore to priority of the pod which already expresses criticality of the pod. It was also mentioned that we can change the heuristic later, if we decide to do so.

rphillips · 2021-03-04T15:41:11Z

/assign @sjenning

derekwaynecarr · 2021-03-04T18:31:43Z

I agree that system-node-critical should have oom_score_adj value that differentiates it as more critical than system-cluster-critical, but I also worry this change to treat the system-cluster-critical with the same score as all Burstable QoS tiers isn't ideal either.

If we change system-node-critical -997, and system-cluster-critical -996, it seems to have the same effect, lower surprise, and still leaves us open to evolve further.

ehashman · 2021-04-05T16:25:00Z

@ehashman we don't backport features which this clearly is.

Ah, while I know this was tagged as feature, it looked like a bugfix to me. But yeah, I suppose this being "action required:" might make it ineligible for k8s backport.

ehashman · 2021-04-09T00:26:04Z

I think @smarterclayton's comments have been addressed, can we remove the hold?

dims · 2021-04-09T00:37:23Z

/hold cancel

we can debate on back port in the cherry pick :)

fejta-bot · 2021-04-09T04:11:18Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

fejta-bot · 2021-04-09T08:44:19Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

ravisantoshgudimetla · 2021-04-09T12:33:37Z

Yeah, Clayton's comments are addressed, we can remove the hold. Thanks for doing it @dims

As far as the backport goes, this will be a behavioral change but from the history of implementation, this was a miss during the initial implementation as @sjenning pointed out, so it is worth backport. Let me know if you folks think otherwise.

ravisantoshgudimetla · 2021-04-09T12:35:27Z

/cherry-pick release-1.20

ravisantoshgudimetla · 2021-04-09T12:37:28Z

/cherry-pick release-1.21

fejta-bot · 2021-04-09T15:22:48Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

ehashman · 2021-04-09T17:01:00Z

@ravisantoshgudimetla k8s prow doesn't have cherry-pick functionality. You will need to use the ./hack/cherry_pick_pull.sh tool. More info here.

ravisantoshgudimetla · 2021-04-09T18:45:02Z

Containerd job is failing continuously with the following error:

W0409 15:54:57.780] ERROR: (gcloud.compute.scp) [/usr/bin/scp] exited with return code [1].

Hopefully it passes this time

/retest

ravisantoshgudimetla · 2021-04-09T18:45:49Z

@ravisantoshgudimetla k8s prow doesn't have cherry-pick functionality. You will need to use the ./hack/cherry_pick_pull.sh tool. More info here.

Thank you @ehashman

Long time since I worked on a upstream cherry-pick. Thank you for the pointer

fejta-bot · 2021-04-09T21:40:25Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

fejta-bot · 2021-04-10T00:27:59Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

fejta-bot · 2021-04-10T03:58:02Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

ravisantoshgudimetla · 2021-04-10T05:22:29Z

Seems this test is broken:

#100978

/hold

Putting a hold for now.

ravisantoshgudimetla · 2021-04-11T16:27:53Z

/hold cancel

#100978 addressed.

ravisantoshgudimetla · 2021-04-11T16:28:21Z

/retest

Only system-node-critical pods should be OOM Killed last

e64576a

k8s-ci-robot added sig/node Categorizes an issue or PR as relevant to SIG Node. and removed do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Mar 3, 2021

k8s-ci-robot requested review from dims and yifan-gu March 3, 2021 21:38

k8s-ci-robot added the area/kubelet label Mar 3, 2021

k8s-ci-robot assigned rphillips Mar 3, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 3, 2021

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 3, 2021

k8s-ci-robot assigned sjenning Mar 4, 2021

openshift-ci-robot mentioned this pull request Apr 7, 2021

Updating openshift-enterprise-hyperkube builder & base images to be consistent with ART openshift/kubernetes#559

Merged

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 9, 2021

ravisantoshgudimetla mentioned this pull request Apr 9, 2021

Automated cherry pick of #99729: Only system-node-critical pods should be OOM Killed last #100975

Closed

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 10, 2021

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 11, 2021

k8s-ci-robot merged commit 0691157 into kubernetes:master Apr 11, 2021

k8s-ci-robot added this to the v1.22 milestone Apr 11, 2021

This was referenced Apr 15, 2021

Automated cherry pick of #99729: Only system-node-critical pods should be OOM Killed last #101149

Closed

Automated cherry pick of #99729: Only system-node-critical pods should be OOM Killed last #101150

Closed

rphillips mentioned this pull request Jun 24, 2021

add rphillips to sig-node-reviewers #103159

Closed

6 tasks

tnqn mentioned this pull request Apr 17, 2024

Request basic memory for antrea-controller antrea-io/antrea#6233

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only system-node-critical pods should be OOM Killed last #99729

Only system-node-critical pods should be OOM Killed last #99729

ravisantoshgudimetla commented Mar 3, 2021 •

edited

Loading

ravisantoshgudimetla commented Mar 3, 2021

rphillips commented Mar 3, 2021

rphillips commented Mar 3, 2021

smarterclayton commented Mar 3, 2021

ravisantoshgudimetla commented Mar 4, 2021

rphillips commented Mar 4, 2021

derekwaynecarr commented Mar 4, 2021

ehashman commented Apr 5, 2021

ehashman commented Apr 9, 2021

dims commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021 •

edited

Loading

ravisantoshgudimetla commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

ehashman commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

fejta-bot commented Apr 10, 2021

fejta-bot commented Apr 10, 2021

ravisantoshgudimetla commented Apr 10, 2021

ravisantoshgudimetla commented Apr 11, 2021

ravisantoshgudimetla commented Apr 11, 2021

Only system-node-critical pods should be OOM Killed last #99729

Only system-node-critical pods should be OOM Killed last #99729

Conversation

ravisantoshgudimetla commented Mar 3, 2021 • edited Loading

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

ravisantoshgudimetla commented Mar 3, 2021

rphillips commented Mar 3, 2021

rphillips commented Mar 3, 2021

smarterclayton commented Mar 3, 2021

ravisantoshgudimetla commented Mar 4, 2021

rphillips commented Mar 4, 2021

derekwaynecarr commented Mar 4, 2021

ehashman commented Apr 5, 2021

ehashman commented Apr 9, 2021

dims commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021 • edited Loading

ravisantoshgudimetla commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

ehashman commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021

ravisantoshgudimetla commented Apr 9, 2021

fejta-bot commented Apr 9, 2021

fejta-bot commented Apr 10, 2021

fejta-bot commented Apr 10, 2021

ravisantoshgudimetla commented Apr 10, 2021

ravisantoshgudimetla commented Apr 11, 2021

ravisantoshgudimetla commented Apr 11, 2021

ravisantoshgudimetla commented Mar 3, 2021 •

edited

Loading

ravisantoshgudimetla commented Apr 9, 2021 •

edited

Loading