Node system swap support #2602

ehashman · 2021-04-06T22:29:43Z

This is an initial draft, based on the community design doc. I wanted to make sure we're all in high-level alignment before I begin on the implementation specifics. ~~That should be the only section left TODO.~~ (2021-04-28) I've added @ike-ma's initial draft of the implementation specifics as well, so this should be ready for overall review.

I've tried to keep the scope as narrow as possible to reduce unknowns and make this as easy/clear to graduate as possible. This will necessarily mean (IMO) splitting any workload-level accounting for swap into a separate KEP.

/sig node
/cc @dims @SergeyKanzhelev @dchen1107 @derekwaynecarr @mrunalp
cc @ike-ma

nikhita · 2021-04-07T05:08:27Z

creating a (dummy) approve comment to re-trigger CI, the approval mechanism was broken for a bit - kubernetes/test-infra#21687

/approve

keps/sig-node/2400-node-swap/README.md

k8s-ci-robot · 2021-04-29T00:13:59Z

Thanks for your pull request. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please follow instructions at https://git.k8s.io/community/CLA.md#the-contributor-license-agreement to sign the CLA.

It may take a couple minutes for the CLA signature to be fully registered; after that, please reply here with a new comment and we'll verify. Thanks.

If you've already signed a CLA, it's possible we don't have your GitHub username or you're using a different email address. Check your existing CLA data and verify that your email is set on your git commits.
If you signed the CLA as a corporation, please sign in with your organization's credentials at https://identity.linuxfoundation.org/projects/cncf to be authorized.
If you have done the above and are still having issues with the CLA being reported as unsigned, please log a ticket with the Linux Foundation Helpdesk: https://support.linuxfoundation.org/
Should you encounter any issues with the Linux Foundation Helpdesk, send a message to the backup e-mail support address at: login-issues@jira.linuxfoundation.org

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

ehashman · 2021-04-29T00:28:30Z

@ike-ma it appears you haven't signed the Kubernetes CLA, can you please follow the instructions in the comment above to do so and run /check-cla when you're done?

keps/sig-node/2400-node-swap/README.md

bobbypage · 2021-05-06T05:23:56Z

keps/sig-node/2400-node-swap/README.md

+### Non-Goals
+
+- Provisioning swap. Swap must already be available on the system.
+- Setting [swappiness]. This can already be set on a system-wide level outside of Kubernetes.


What is meant by "system-wide level" here? Is it referring to vm.swappiness sysctl?

If that's the case, the downside I see there is it's system wide. For example maybe there are some usecases where you want certain workloads to swap often (and as such set a high swappiness), but for system workloads (e.g those in /system.slice, i.e. kubelet, container runtime, etc), you don't want them to swap at all (and want a zero swappiness). If it's a system wide setting I guess you lose the ability to make those types of adjustments.

Not sure how important that is and probably a post alpha thing, but maybe something to consider :)

This KEP doesn't enable setting per-workload swap utilization, so I think it's YAGNI on per-workload swappiness configuration in the CRI. When we're ready to add per-workload swap configuration to k8s, then we should add it IMO.

but for system workloads (e.g those in /system.slice, i.e. kubelet, container runtime, etc), you don't want them to swap at all (and want a zero swappiness).

Typically one wouldn't run the container runtime, kubelet, etc. using k8s though, they're prerequisites for a node that run directly on the host. So potentially this is still individually configurable for them, no?

When we're ready to add per-workload swap configuration to k8s, then we should add it IMO.

Make sense!

So potentially this is still individually configurable for them, no?

I guess what you would do in that case is set vm.swappiness sysctl so that it applies system wide, but then you would have to edit all of your systemd units for kubelet, container runtime etc and override their swap settings. Might be helpful to note somewhere if that's the recommended path to configure swappiness since I would imagine most existing kubelet/container runtime units don't override swap settings.

edit: actually looking at the systemd config options, I don't think it exposes overriding swap for individual units... https://www.freedesktop.org/software/systemd/man/systemd.resource-control.html

keps/sig-node/2400-node-swap/README.md

deads2k · 2021-05-11T14:13:35Z

PRR looks good. An overall note when going to beta: I'm not clear from the proposal as a whole how a pod author

indicates the container does or does not want to use swap
knows the binary is using swap

deads2k · 2021-05-11T14:13:46Z

PRR

/approve

derekwaynecarr

A number of requests throughout, let me know what you think.

keps/sig-node/2400-node-swap/README.md

derekwaynecarr

A few more requests, I think this is close to ready.

Let me know what you think.

derekwaynecarr · 2021-05-12T23:06:08Z

keps/sig-node/2400-node-swap/README.md

+1. Add a feature gate `NodeSwapEnabled` to enable swap support.
+1. Leave the default value of kubelet flag `--fail-on-swap` to `true`, to avoid
+   changing default behaviour.
+1. Introduce a new kubelet config parameter, `MemorySwapLimit`.


i would prefer to never introduce the MemorySwapLimit parameter.

I would update this section to say:

kubelet detects available swap on startup (all available swap is basically allocatable)

kubelet enables reservation of swap to reduce node allocatable swap (apha or beta period)

derekwaynecarr · 2021-05-12T23:08:15Z

keps/sig-node/2400-node-swap/README.md

+   changing default behaviour.
+1. Introduce a new kubelet config parameter, `MemorySwapLimit`.
+1. Introduce a new CRI parameter, `memory_swap_limit_in_bytes`.
+1. Integrate new kubelet config and pass values to CRI for container creation.


This needs a little more clarification.

The container runtime will write the swap settings to the container level cgroup. This will include ephemeral containers. We need to account for the proper setting that is applied at the pod cgroup boundary. Please list addressing this as a blocker to beta for the feature

keps/sig-node/2400-node-swap/README.md

ehashman · 2021-05-13T00:10:14Z

I believe all comments have been addressed. PTAL!

SergeyKanzhelev · 2021-05-13T06:00:50Z

/lgtm

mrunalp · 2021-05-13T15:56:22Z

keps/sig-node/2400-node-swap/README.md

+    int64 memory_swap_limit_in_bytes = 9;
+...
+}
+```


We may want to include this as part of CRI stats as well.

derekwaynecarr

/approve
/lgtm

thanks for the updates, look forward to experimenting with this as it evolves.

k8s-ci-robot · 2021-05-13T17:02:12Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k, derekwaynecarr, ehashman, nikhita

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/prod-readiness/OWNERS~~ [deads2k,ehashman]
~~keps/sig-node/OWNERS~~ [derekwaynecarr]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

carlisia · 2021-05-19T01:59:07Z

Hello @ehashman 👋, 1.22 Docs Shadow here.
This enhancement is marked as ‘Needs Docs’ for the 1.22 release.

Please follow the steps detailed in the documentation to open a PR against dev-1.22 branch in the k/website repo. This PR can be just a placeholder at this time and must be created before Fri July 9, 11:59 PM PDT.

Also, take a look at Documenting for a release to familiarize yourself with the docs requirement for the release.

Thank you! 🙏

k8s-ci-robot requested review from dchen1107, derekwaynecarr, dims, mrunalp and SergeyKanzhelev April 6, 2021 22:29

k8s-ci-robot added sig/node Categorizes an issue or PR as relevant to SIG Node. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Apr 6, 2021

ehashman mentioned this pull request Apr 6, 2021

Kubelet/Kubernetes should work with Swap Enabled kubernetes/kubernetes#53533

Closed

ehashman mentioned this pull request Apr 7, 2021

Node memory swap support #2400

Open

60 tasks

SergeyKanzhelev reviewed Apr 12, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

SergeyKanzhelev reviewed Apr 12, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

SergeyKanzhelev reviewed Apr 12, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

SergeyKanzhelev reviewed Apr 12, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

SergeyKanzhelev reviewed Apr 12, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

anguslees reviewed Apr 13, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

ehashman force-pushed the kep-2400 branch from 62806e2 to f7c16be Compare April 19, 2021 22:19

k8s-ci-robot added the kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory label Apr 19, 2021

ike-ma reviewed Apr 20, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

ike-ma mentioned this pull request Apr 26, 2021

REQUEST: New membership for ike-ma kubernetes/org#2657

Closed

Add draft for node swap KEP

a1941aa

ehashman force-pushed the kep-2400 branch from f7c16be to 31acf14 Compare April 29, 2021 00:13

k8s-ci-robot added the do-not-merge/invalid-commit-message Indicates that a PR should not merge because it has an invalid commit message. label Apr 29, 2021

k8s-ci-robot added cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. and removed cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Apr 29, 2021

ehashman force-pushed the kep-2400 branch from 31acf14 to 8acb98b Compare April 29, 2021 00:25

k8s-ci-robot removed the do-not-merge/invalid-commit-message Indicates that a PR should not merge because it has an invalid commit message. label Apr 29, 2021

bobbypage reviewed May 6, 2021

View reviewed changes

deads2k reviewed May 7, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

deads2k reviewed May 7, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

deads2k reviewed May 7, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

deads2k reviewed May 7, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

deads2k reviewed May 7, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

deads2k reviewed May 7, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Show resolved Hide resolved

ehashman added 2 commits May 7, 2021 14:22

Reflow to 80-char line width

84adca7

Address PRR and other review comments

64e639a

derekwaynecarr requested changes May 11, 2021

View reviewed changes

Update based on reviewer feedback

277c51a

ehashman requested a review from derekwaynecarr May 11, 2021 22:53

derekwaynecarr requested changes May 12, 2021

View reviewed changes

derekwaynecarr reviewed May 12, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

mrunalp reviewed May 12, 2021

View reviewed changes

keps/sig-node/2400-node-swap/README.md Outdated Show resolved Hide resolved

Address next round of reviewer feedback

b5c0dae

k8s-ci-robot assigned SergeyKanzhelev May 13, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 13, 2021

mrunalp reviewed May 13, 2021

View reviewed changes

derekwaynecarr approved these changes May 13, 2021

View reviewed changes

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 13, 2021

k8s-ci-robot merged commit abc88e7 into kubernetes:master May 13, 2021

k8s-ci-robot added this to the v1.22 milestone May 13, 2021

iholder101 mentioned this pull request Jul 19, 2023

[KEP-2400] Node memory swap support (replacement for inactive issue #2400) #4128

Closed

35 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node system swap support #2602

Node system swap support #2602

ehashman commented Apr 6, 2021 •

edited

Loading

nikhita commented Apr 7, 2021

k8s-ci-robot commented Apr 29, 2021

ehashman commented Apr 29, 2021

bobbypage May 6, 2021

ehashman May 7, 2021

bobbypage May 7, 2021 •

edited

Loading

deads2k commented May 11, 2021

deads2k commented May 11, 2021

derekwaynecarr left a comment

derekwaynecarr left a comment

derekwaynecarr May 12, 2021

derekwaynecarr May 12, 2021

ehashman commented May 13, 2021

SergeyKanzhelev commented May 13, 2021

mrunalp May 13, 2021

derekwaynecarr left a comment

k8s-ci-robot commented May 13, 2021

carlisia commented May 19, 2021

Node system swap support #2602

Node system swap support #2602

Conversation

ehashman commented Apr 6, 2021 • edited Loading

nikhita commented Apr 7, 2021

k8s-ci-robot commented Apr 29, 2021

ehashman commented Apr 29, 2021

bobbypage May 6, 2021

Choose a reason for hiding this comment

ehashman May 7, 2021

Choose a reason for hiding this comment

bobbypage May 7, 2021 • edited Loading

Choose a reason for hiding this comment

deads2k commented May 11, 2021

deads2k commented May 11, 2021

derekwaynecarr left a comment

Choose a reason for hiding this comment

derekwaynecarr left a comment

Choose a reason for hiding this comment

derekwaynecarr May 12, 2021

Choose a reason for hiding this comment

derekwaynecarr May 12, 2021

Choose a reason for hiding this comment

ehashman commented May 13, 2021

SergeyKanzhelev commented May 13, 2021

mrunalp May 13, 2021

Choose a reason for hiding this comment

derekwaynecarr left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented May 13, 2021

carlisia commented May 19, 2021

ehashman commented Apr 6, 2021 •

edited

Loading

bobbypage May 7, 2021 •

edited

Loading