Memory manager support for Windows nodes #128560

marosset · 2024-11-05T02:16:40Z

/kind feature

What type of PR is this?

What this PR does / why we need it:

This PR adds support for memory manager on Windows nodes and add a new BestEffort policy which allows giving preferences to the Windows OS for which CPUs to bind containers to.
Windows cannot guarantee scheduling workloads to specific NUMA nodes so instead we query the OS for which CPUs are part of a given NUMA node and specify that when starting containers.

Which issue(s) this PR fixes:

Part of #125262

To enable memory manager support the kubelet configuration would look something like

cpuManagerPolicy: static
memoryManagerPolicy: BestEffort
systemReserved: 
  cpu: 500m
  memory: 500Mi
featureGates:
  MemoryManager: true
  WindowsCPUAndMemoryAffinity: true

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Added Windows support for the node memory manager.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

KEP: https://github.com/kubernetes/enhancements/tree/master/keps/sig-windows/4885-windows-cpu-and-memory-affinity

/sig windows node
/milestone v1.32
/area kubelet
/assign @jsturtevant

pkg/kubelet/cm/container_manager_windows.go

pkg/kubelet/cm/internal_container_lifecycle_windows.go

pkg/kubelet/cm/memorymanager/memory_manager.go

pkg/kubelet/cm/memorymanager/policy_best_effort.go

ffromani · 2024-11-05T16:41:03Z

pkg/kubelet/cm/memorymanager/policy_best_effort.go

+
+// bestEffortPolicy is implementation of the policy interfact for the BestEffort policy
+type bestEffortPolicy struct {
+	static *staticPolicy


Is not obvious to me what do we gain by wrapping the static policy. Could you please elaborate a bit?

For Windows we want the new BestEffort policy to have the same logic as the existing static policy.
I figured that doing this would reduce a lot of code duplication.

I can update the comments here to reflect that or maybe I can have the kubelet arguments specify BestEffortPolicy and then create a staticPolicy instance here for Windows.

Do you all have a preference @ffromani @jsturtevant ?

Updating comments and wrapping seems to make sense to me. This will allow to adjust more easily if we learn of reasons to adjust.

I'll need to refresh my memory over what we discussed in the KEP before I can answer. Right now I can surely say comments is a great starting point.

ok, first: I've nothing against reusing the code like this. We didn't do often in the codebase, but why not?
The issue I see however is that from my interpretation of the KEP I would have expected different hint generation on windows based on https://github.com/kubernetes/enhancements/tree/master/keps/sig-windows/4885-windows-cpu-and-memory-affinity#windows-memory-considerations

I think we can settle this with a good set of testcases, at very least unit tests. which seems lacking to me?
Granted, the policy is actually a thin wrapper over the static policy, which has its own tests; the tests I recommend to add are to ensure and document the expected behavior on windows.

The tests should cover preferably also the topology manager merge hint process. It will be similar in spirit to what you are adding to internal_container_lifecycle_windows_test.go, but covering the generic topology manager flow.

There's likely not enough time to do this for this cycle. Plus I want to be practical: it's an alpha feature, disabled by default, and the early feedback from having some functionality in is important as well. So I don't think it is blocking, but I DO think this is beta-blocking (to discuss this topic in detail at least) and possibly we can need another alpha iteration to sort this out depending on the aforementioned feedback

The issue I see however is that from my interpretation of the KEP I would have expected different hint generation on windows based on https://github.com/kubernetes/enhancements/tree/master/keps/sig-windows/4885-windows-cpu-and-memory-affinity#windows-memory-considerations

I didn't see a difference in what we would provide for a hint. The major difference is in the semantic meaning of the hint, which on Windows we can't guarantee that it the hint will be respected, only make that suggestion. What kind of difference are you thinking?

There's likely not enough time to do this for this cycle. Plus I want to be practical: it's an alpha feature, disabled by default, and the early feedback from having some functionality in is important as well. So I don't think it is blocking, but I DO think this is beta-blocking (to discuss this topic in detail at least) and possibly we can need another alpha iteration to sort this out depending on the aforementioned feedback

I appreciate this sentiment. We do have a user we are working with that is willing to provide feedback but is looking for it to be in an Alpha release. From my understanding of the challenges on Linux, these types of features need some real feedback to get them dialed in, and I anticipate we will get feedback that we will need to incorporate back into the design.

The issue I see however is that from my interpretation of the KEP I would have expected different hint generation on windows based on https://github.com/kubernetes/enhancements/tree/master/keps/sig-windows/4885-windows-cpu-and-memory-affinity#windows-memory-considerations

I didn't see a difference in what we would provide for a hint. The major difference is in the semantic meaning of the hint, which on Windows we can't guarantee that it the hint will be respected, only make that suggestion. What kind of difference are you thinking?

I'm basing my expectations on what I saw so far on linux for the cpumanager policy. I would like to see if for example returning a different set of NUMA bitmasks or different values for Preferred field.

If we add a thin wrapper over the Static policy, we are saying that this policy should behave 1:1 like the static policy, and we are unnecessarily coupling the behavior. What I would rather see is a specification (and unit tests are a good vehicle for that) of the expected behavior, more precise than (yes, I'm oversimplifying here) "whatever Static policy does, but not enforcing".

Lacking specification is something that biten us already multiple already, so we should IMO go in the direction to have the spec spelled out more precisely, and the testcase well defined. If, after that, we can reuse the implementation, great.

If nothing else (which I can accept!) this should be discussed and elaborated in the KEP design details.
I'm happy to elaborate further in the next cycle. Again, more details in the KEP to elaborate more precisely the behavior can possibly be all we need.

There's likely not enough time to do this for this cycle. Plus I want to be practical: it's an alpha feature, disabled by default, and the early feedback from having some functionality in is important as well. So I don't think it is blocking, but I DO think this is beta-blocking (to discuss this topic in detail at least) and possibly we can need another alpha iteration to sort this out depending on the aforementioned feedback

I appreciate this sentiment. We do have a user we are working with that is willing to provide feedback but is looking for it to be in an Alpha release. From my understanding of the challenges on Linux, these types of features need some real feedback to get them dialed in, and I anticipate we will get feedback that we will need to incorporate back into the design.

Fine, so we will evaluate during the 1.33 cycle if we can move to beta (taking into account my oen feedback above) or if we need another alpha iteration.

The rest of the code overall LGTM, I'll have another pass ASAP but from my recollections I don't have anything major on my notes.

ffromani · 2024-11-05T16:41:46Z

/triage accepted
/priority important-soon

initial priority assessment subject to be changed

pkg/kubelet/cm/internal_container_lifecycle_windows.go

ffromani · 2024-11-05T18:11:16Z

/retitle Memory manager support for Windows nodes

ffromani · 2024-11-05T18:12:34Z

@marosset adding the policy name ultimates surfaces to the user and to the kubeletconfig API object so this change MAY require api-review, please take this into account.

sftim · 2024-11-06T10:53:28Z

Changelog suggestion

-Windows: Support memory manager on Windows 
+Added Windows support for the node memory manager.

pkg/kubelet/cm/internal_container_lifecycle_windows.go

pkg/kubelet/cm/memorymanager/memory_manager.go

ffromani · 2024-11-06T15:32:04Z

pkg/kubelet/cm/memorymanager/policy_best_effort.go

+
+// bestEffortPolicy is implementation of the policy interfact for the BestEffort policy
+type bestEffortPolicy struct {
+	static *staticPolicy


ok, first: I've nothing against reusing the code like this. We didn't do often in the codebase, but why not?
The issue I see however is that from my interpretation of the KEP I would have expected different hint generation on windows based on https://github.com/kubernetes/enhancements/tree/master/keps/sig-windows/4885-windows-cpu-and-memory-affinity#windows-memory-considerations

I think we can settle this with a good set of testcases, at very least unit tests. which seems lacking to me?
Granted, the policy is actually a thin wrapper over the static policy, which has its own tests; the tests I recommend to add are to ensure and document the expected behavior on windows.

The tests should cover preferably also the topology manager merge hint process. It will be similar in spirit to what you are adding to internal_container_lifecycle_windows_test.go, but covering the generic topology manager flow.

There's likely not enough time to do this for this cycle. Plus I want to be practical: it's an alpha feature, disabled by default, and the early feedback from having some functionality in is important as well. So I don't think it is blocking, but I DO think this is beta-blocking (to discuss this topic in detail at least) and possibly we can need another alpha iteration to sort this out depending on the aforementioned feedback

pkg/kubelet/cm/memorymanager/policy_best_effort.go

marosset · 2024-11-07T16:55:49Z

/lgtm cancel

need to rebase on top of #128657

Ack - I'll rebase on that as soon as it merges.

ffromani

/lgtm

thanks for the rebase and sorry for the churn. You use the recently added containerMap.Clone(), so looks fine again to me.

k8s-ci-robot · 2024-11-07T19:58:09Z

LGTM label has been added.

Git tree hash: 28376828105d5f848a8c1e75791828342d5b5d5c

pkg/kubelet/cm/memorymanager/memory_manager.go

k8s-ci-robot · 2024-11-07T20:11:49Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: marosset, mrunalp

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/kubelet/OWNERS~~ [mrunalp]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ffromani · 2024-11-07T20:18:38Z

beta-blocking (or perhaps as later bugfix?) we need to update cmd/kubelet/app/options/options.go

mrunalp · 2024-11-07T20:21:47Z

/lgtm

k8s-ci-robot · 2024-11-07T20:21:53Z

LGTM label has been added.

Git tree hash: 9343fc1fc9ca096c90740b4240d16aae84833eed

jsturtevant · 2024-11-07T20:45:10Z

/lgtm

tengqm · 2024-11-08T00:35:21Z

Do we have documentation updates (including the feature gate) to the website?

marosset · 2024-11-08T00:47:10Z

Do we have documentation updates (including the feature gate) to the website?

We'll use kubernetes/website#48469 to update docs for both cpu manager (#125296) and memory manager (this PR) support for windows.

k8s-ci-robot assigned jsturtevant Nov 5, 2024

k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Nov 5, 2024

k8s-ci-robot added this to the v1.32 milestone Nov 5, 2024

k8s-ci-robot requested review from kannon92 and wzshiming November 5, 2024 02:17

ffromani reviewed Nov 5, 2024

View reviewed changes

jsturtevant reviewed Nov 5, 2024

View reviewed changes

pkg/kubelet/cm/internal_container_lifecycle_windows.go Outdated Show resolved Hide resolved

jsturtevant reviewed Nov 5, 2024

View reviewed changes

pkg/kubelet/cm/internal_container_lifecycle_windows.go Outdated Show resolved Hide resolved

k8s-ci-robot changed the title ~~Memeory manager support for Windows nodes~~ Memory manager support for Windows nodes Nov 5, 2024

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Nov 5, 2024

ffromani mentioned this pull request Nov 6, 2024

blog: Memory Manager goes GA kubernetes/website#48578

Merged

ffromani reviewed Nov 6, 2024

View reviewed changes

jsturtevant reviewed Nov 6, 2024

View reviewed changes

pkg/kubelet/cm/memorymanager/policy_best_effort.go Outdated Show resolved Hide resolved

k8s-ci-robot requested review from jsturtevant and mrunalp November 7, 2024 15:36

marosset force-pushed the win-mem-manager branch 2 times, most recently from ab90eb1 to 284fa5a Compare November 7, 2024 19:54

ffromani reviewed Nov 7, 2024

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 7, 2024

mrunalp reviewed Nov 7, 2024

View reviewed changes

pkg/kubelet/cm/memorymanager/memory_manager.go Outdated Show resolved Hide resolved

mrunalp approved these changes Nov 7, 2024

View reviewed changes

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Nov 7, 2024

marosset force-pushed the win-mem-manager branch from 284fa5a to 05a8977 Compare November 7, 2024 20:19

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 7, 2024

k8s-ci-robot requested review from ffromani and mrunalp November 7, 2024 20:19

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 7, 2024

Memory manager support for Windows nodes

ad138d5

marosset force-pushed the win-mem-manager branch from 05a8977 to ad138d5 Compare November 7, 2024 20:44

k8s-ci-robot merged commit 3c9380c into kubernetes:master Nov 7, 2024
15 checks passed

marosset mentioned this pull request Nov 8, 2024

Windows CPU and Memory Affinity kubernetes/enhancements#4885

Open

9 tasks

marosset deleted the win-mem-manager branch November 14, 2024 23:37

liggitt removed the api-review Categorizes an issue or PR as actively needing an API review. label Nov 21, 2024

marosset mentioned this pull request Dec 10, 2024

cleaned release notes final draft kubernetes/sig-release#2689

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory manager support for Windows nodes #128560

Memory manager support for Windows nodes #128560

marosset commented Nov 5, 2024 •

edited

Loading

ffromani Nov 5, 2024

marosset Nov 5, 2024

jsturtevant Nov 5, 2024

ffromani Nov 5, 2024

ffromani Nov 6, 2024

jsturtevant Nov 6, 2024

ffromani Nov 6, 2024 •

edited

Loading

ffromani commented Nov 5, 2024

ffromani commented Nov 5, 2024

ffromani commented Nov 5, 2024 •

edited

Loading

sftim commented Nov 6, 2024

ffromani Nov 6, 2024

marosset commented Nov 7, 2024

ffromani left a comment

k8s-ci-robot commented Nov 7, 2024

k8s-ci-robot commented Nov 7, 2024

ffromani commented Nov 7, 2024

mrunalp commented Nov 7, 2024

k8s-ci-robot commented Nov 7, 2024

jsturtevant commented Nov 7, 2024

tengqm commented Nov 8, 2024

marosset commented Nov 8, 2024 •

edited

Loading

Memory manager support for Windows nodes #128560

Memory manager support for Windows nodes #128560

Conversation

marosset commented Nov 5, 2024 • edited Loading

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

ffromani Nov 5, 2024

Choose a reason for hiding this comment

marosset Nov 5, 2024

Choose a reason for hiding this comment

jsturtevant Nov 5, 2024

Choose a reason for hiding this comment

ffromani Nov 5, 2024

Choose a reason for hiding this comment

ffromani Nov 6, 2024

Choose a reason for hiding this comment

jsturtevant Nov 6, 2024

Choose a reason for hiding this comment

ffromani Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

ffromani commented Nov 5, 2024

ffromani commented Nov 5, 2024

ffromani commented Nov 5, 2024 • edited Loading

sftim commented Nov 6, 2024

ffromani Nov 6, 2024

Choose a reason for hiding this comment

marosset commented Nov 7, 2024

ffromani left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Nov 7, 2024

k8s-ci-robot commented Nov 7, 2024

ffromani commented Nov 7, 2024

mrunalp commented Nov 7, 2024

k8s-ci-robot commented Nov 7, 2024

jsturtevant commented Nov 7, 2024

tengqm commented Nov 8, 2024

marosset commented Nov 8, 2024 • edited Loading

marosset commented Nov 5, 2024 •

edited

Loading

ffromani Nov 6, 2024 •

edited

Loading

ffromani commented Nov 5, 2024 •

edited

Loading

marosset commented Nov 8, 2024 •

edited

Loading