KEP-127: user namespace support for stateless pods #116377

giuseppe · 2023-03-08T15:14:57Z

What type of PR is this?

/kind feature

What this PR does / why we need it:

it adds support for user namespace for stateless pods

Which issue(s) this PR fixes:

Special notes for your reviewer:

rebased on top of: #116249

Does this PR introduce a user-facing change?

By enabling the `UserNamespacesStatelessPodsSupport` feature gate in kubelet, you can now run a stateless pod in a separate user namespace

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Support User Namespaces in pods enhancements#127

giuseppe · 2023-03-08T15:15:22Z

/sig node

rata

LGTM

sftim · 2023-03-08T16:14:18Z

❓ do we publish the UID range that a Pod is using? eg: add that information into the .status of the Pod?

giuseppe · 2023-03-08T21:32:23Z

question do we publish the UID range that a Pod is using? eg: add that information into the .status of the Pod?

no, that information is not exposed at the moment

pacoxu · 2023-03-13T09:43:24Z

/priority important-soon
/triage accepted

pkg/kubelet/kuberuntime/kuberuntime_container_linux.go

add the definitions for the ID mappings to use at runtime for the volume mount. This is supported only on Linux where idmapped mounts are used to perform the runtime mapping. The new fields are mapped directly to the field in the OCI runtime specs: https://github.com/opencontainers/runtime-spec/blob/main/config.md#posix-platform-mounts The CRI runtime will pass the mappings to the OCI runtime as-is. Related to KEP-127. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

Now KEP-127 relies on idmap mounts to do the ID translation and we won't do any chowns in the kubelet. This patch just removes the usage of GetHostIDsForPod() in operationexecutor to do the chown, and also removes the GetHostIDsForPod() method from the kubelet volume interface. Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com>

Latest changes to KEP-127 removed that phase, so let's stop reserving those IDs for that. While we are there, we replace 0 for 0*65536 as before we had a bug that we were not multiplying the index, to avoid bugs in the future. Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com>

To that end, we need to add one kubelet getter listPodsFromDisk(). Other than that, it is a pretty trivial move. Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com>

mrunalp

/lgtm

k8s-ci-robot · 2023-03-14T04:19:42Z

LGTM label has been added.

Git tree hash: f5d61df8aeccd78ecd8e9b66f0d063b71702228f

mrunalp · 2023-03-14T04:20:12Z

@thockin @gnufied ptal

rata · 2023-03-14T10:22:02Z

/retest

gnufied · 2023-03-14T14:31:53Z

cc @dobsonj

gnufied · 2023-03-14T14:46:13Z

pkg/volume/util/operationexecutor/operation_generator.go

-				return volumetypes.NewOperationContext(eventErr, detailedErr, migrated)
-			}
-		}
-
 		// Execute mount


So the idea here is to, instead of kubelet giving permissions to mapped uid/gid, we are going to ask CRI to idmap these mount points when creating bind mounts? So no change in volumemanager code is necessary?

yes exactly, the idea is to use idmapped mounts (through the CRI) to cancel the effect on volumes of running inside a user namespace

gnufied · 2023-03-14T14:52:59Z

I think change is fine for alpha implementation with understanding that current limitation of idmapped mounts will prevent this from working in case of brown field volumes with non-uniform uid/gid permissions. But we are currently not targeting those anyways.

Lets merge this, but I also want to hear from @dobsonj who has been tinkering in this area.

/approve
/hold

k8s-ci-robot · 2023-03-14T14:53:21Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: giuseppe, gnufied, mrunalp, rata

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/controller/volume/OWNERS~~ [gnufied]
~~pkg/kubelet/OWNERS~~ [mrunalp]
~~pkg/volume/OWNERS~~ [gnufied]
~~staging/src/k8s.io/cri-api/pkg/OWNERS~~ [mrunalp]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

dobsonj · 2023-03-14T16:17:24Z

I think this looks fine, I'd just ask that we document which container runtimes support this so it's clear for users of this feature.
/unhold

k8s-ci-robot requested review from humblec and mrunalp March 8, 2023 15:17

rata approved these changes Mar 8, 2023

View reviewed changes

giuseppe mentioned this pull request Mar 9, 2023

cri-api: add mappings for volumes #116249

Closed

rata reviewed Mar 13, 2023

View reviewed changes

pkg/kubelet/kuberuntime/kuberuntime_container_linux.go Outdated Show resolved Hide resolved

giuseppe and others added 5 commits March 13, 2023 22:21

kubelet: use idmapped mounts for all volumes

9075404

Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>

kubelet: Move userns manager to its own package

ec0410a

To that end, we need to add one kubelet getter listPodsFromDisk(). Other than that, it is a pretty trivial move. Signed-off-by: Rodrigo Campos <rodrigoca@microsoft.com>

giuseppe force-pushed the rata/userns branch from acea900 to ec0410a Compare March 13, 2023 21:28

marosset mentioned this pull request Mar 14, 2023

Support User Namespaces in pods kubernetes/enhancements#127

Open

26 tasks

mrunalp approved these changes Mar 14, 2023

View reviewed changes

k8s-ci-robot assigned mrunalp Mar 14, 2023

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 14, 2023

gnufied reviewed Mar 14, 2023

View reviewed changes

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 14, 2023

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 14, 2023

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 14, 2023

k8s-ci-robot merged commit 6a111be into kubernetes:master Mar 14, 2023

k8s-ci-robot added this to the v1.27 milestone Mar 14, 2023

pacoxu mentioned this pull request Jun 25, 2023

update cri-api change in v1.27 #118845

Merged

praiskup mentioned this pull request Jul 24, 2023

A new option --buildroot-image= similar to bootstrap_image rpm-software-management/mock#1159

Closed

1 task

yawqi mentioned this pull request Nov 7, 2023

support user namespace kata-containers/kata-containers#8170

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP-127: user namespace support for stateless pods #116377

KEP-127: user namespace support for stateless pods #116377

giuseppe commented Mar 8, 2023

giuseppe commented Mar 8, 2023

rata left a comment

sftim commented Mar 8, 2023

giuseppe commented Mar 8, 2023

pacoxu commented Mar 13, 2023

mrunalp left a comment

k8s-ci-robot commented Mar 14, 2023

mrunalp commented Mar 14, 2023

rata commented Mar 14, 2023

gnufied commented Mar 14, 2023

gnufied Mar 14, 2023

giuseppe Mar 14, 2023

gnufied commented Mar 14, 2023

k8s-ci-robot commented Mar 14, 2023

dobsonj commented Mar 14, 2023

KEP-127: user namespace support for stateless pods #116377

KEP-127: user namespace support for stateless pods #116377

Conversation

giuseppe commented Mar 8, 2023

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

giuseppe commented Mar 8, 2023

rata left a comment

Choose a reason for hiding this comment

sftim commented Mar 8, 2023

giuseppe commented Mar 8, 2023

pacoxu commented Mar 13, 2023

mrunalp left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Mar 14, 2023

mrunalp commented Mar 14, 2023

rata commented Mar 14, 2023

gnufied commented Mar 14, 2023

gnufied Mar 14, 2023

Choose a reason for hiding this comment

giuseppe Mar 14, 2023

Choose a reason for hiding this comment

gnufied commented Mar 14, 2023

k8s-ci-robot commented Mar 14, 2023

dobsonj commented Mar 14, 2023