All containers restart after kubelet upgrade to v1.31 #129385

liuxu623 · 2024-12-24T13:59:32Z

What happened?

All containers restart after kubelet upgrade to v1.31

What did you expect to happen?

Containers shouldn't restart after kubelet upgrade

How can we reproduce it (as minimally and precisely as possible)?

upgrade kubelet from v1.30 to v1.31

Anything else we need to know?

#124220 (comment)

Kubernetes version

$ kubectl version
# paste output here

Cloud provider

OS version

# On Linux:
$ cat /etc/os-release
# paste output here
$ uname -a
# paste output here

# On Windows:
C:\> wmic os get Caption, Version, BuildNumber, OSArchitecture
# paste output here

Install tools

Container runtime (CRI) and version (if applicable)

Related plugins (CNI, CSI, ...) and versions (if applicable)

k8s-ci-robot · 2024-12-24T13:59:41Z

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

liggitt · 2024-12-24T19:38:36Z

No running containers are supported while upgrading kubelets across minor versions. See https://kubernetes.io/releases/version-skew-policy/#kubelet-1

Note:
Before performing a minor version kubelet upgrade, drain pods from that node. In-place minor version kubelet upgrades are not supported.

liggitt · 2024-12-24T19:39:00Z

/remove-kind bug

Chaunceyctx · 2024-12-25T01:22:21Z

modification: #124220. key point: The hash value in the existing container's annotation will not match the hash value calculated by the kubelet.

liuxu623 · 2024-12-25T02:05:23Z

modification: #124220. key point: The hash value in the existing container's annotation will not match the hash value calculated by the kubelet.

@liggitt @Chaunceyctx Currently, only the image is used to calculate the hash. I don't understand why we have to use hash to determine whether to restart the container instead of directly comparing the image.

liuxu623 · 2024-12-25T02:06:42Z

Is it necessary to keep container hash annotation?

thisisharrsh · 2024-12-25T05:21:59Z

I don't understand why we have to use hash to determine whether to restart the container instead of directly comparing the image.

Using a hash ensures that any significant change in the container's configuration, whether it's the image or other critical attributes, triggers a restart to maintain consistency and reliability.

thisisharrsh · 2024-12-25T05:22:22Z

While comparing the image directly might seem sufficient, a hash-based approach provides a comprehensive and efficient mechanism to detect changes across all configuration parameters, avoiding potential bugs and ensuring predictable behavior.

thisisharrsh · 2024-12-25T05:24:35Z

In my view, this is why we are using the hash value.

thisisharrsh · 2024-12-25T05:25:16Z

/sig node

ffromani · 2024-12-25T15:56:26Z

/cc

dixudx · 2024-12-26T02:42:54Z

I totally understand upgrading in-place is not recommended and it is really hard for the community to maintain such a way.

But for large clusters, it is not easy to strictly follow the drain-node-upgrading steps. At least we should introduce an opt-in flag/gate to allow users using the old hash value, which could help avoid unnecessary pod restarts during upgrading. We should make a compromise and let the cluster administrators make the decision.

Moreover, such pod restarts due to changed spec hash values are unnecessary, since kube-apiserver has already got the validation checks and only allowed a few updateable fields, such as images. Currently on kubelet side, hash value annotations are mostly used for annotations, we'd better not rely on this to determine whether to restart/kill the container during upgrading.

liuxu623 · 2024-12-26T04:22:36Z

I don't understand why we have to use hash to determine whether to restart the container instead of directly comparing the image.

Using a hash ensures that any significant change in the container's configuration, whether it's the image or other critical attributes, triggers a restart to maintain consistency and reliability.

I think we should to clarify what situations will cause Pod restarts. So far, there is only one situation, which is updating the image. If there are other situations in the future, we can consider using hash. But so far, I think using hash is unnecessary.

thisisharrsh · 2024-12-26T07:09:20Z

I think we should to clarify what situations will cause Pod restarts. So far, there is only one situation, which is updating the image. If there are other situations in the future, we can consider using hash. But so far, I think using hash is unnecessary.

Thank you for pointing this out!
I agree that simplifying the restart logic to focus solely on image updates is a better way to move forward in this issue. By removing unnecessary complexity, we can make the behaviour more predictable and user-friendly.

thisisharrsh · 2024-12-26T07:10:10Z

But we should also keep in mind the potential future scenarios where additional restart triggers may be introduced.

ffromani · 2024-12-26T08:42:15Z

But for large clusters, it is not easy to strictly follow the drain-node-upgrading steps. At least we should introduce an opt-in flag/gate to allow users using the old hash value, which could help avoid unnecessary pod restarts during upgrading. We should make a compromise and let the cluster administrators make the decision.

In the kubelet resource management area there are quite some behaviors which were sometimes questionable from the beginning, or the usual temporary solutions which stuck forever, or in general quite some behaviors established incidentally rather than deliberately which despite not being covered by backward compatibility promise are unchanged since many years and users effectively depend upon them.

But these behaviors are not guaranteed stable, yet they are treated as such. So in sig-node we are contemplating - and sometimes already proposing - to add feature gates to protect actual bugfixes or changes that in theory should not require them.

This long preamble to say that this case resonates with that experience and I see why a feature gate can be appealing. Thing is: is too late for that (and way worse to revert, I agree with that stance). Retroactively add a feature gate after 8 months doesn't look great and I'm not sure how much is helpful. Perhaps (warning: uninformed musing, didn't review the original change) the change could have been protected by a FG in the first place and we can be more vigilant in the future?

OTOH it seems there's a slow (slow?) emerging trend about these cases and listening to users is paramount, so I guess we should probably try to think to a more systemic approach?

Last but not least, feature gate proliferation is not ideal either. It leads to a very fragmented experience, make our testing matrix explode, and sometimes only postpone the issues. I'm struggling to find a good solution here.

liuxu623 added the kind/bug Categorizes issue or PR as related to a bug. label Dec 24, 2024

k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Dec 24, 2024

k8s-ci-robot added the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Dec 24, 2024

liuxu623 mentioned this issue Dec 24, 2024

only restart container when image changed #129386

Closed

k8s-ci-robot added needs-kind Indicates a PR lacks a `kind/foo` label and requires one. and removed kind/bug Categorizes issue or PR as related to a bug. labels Dec 24, 2024

This was referenced Dec 24, 2024

Revert container hash value changing #129381

Closed

[kubelet]: fixed container restart due to pod spec field changes #124220

Merged

k8s-ci-robot added sig/node Categorizes an issue or PR as relevant to SIG Node. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Dec 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

All containers restart after kubelet upgrade to v1.31 #129385

All containers restart after kubelet upgrade to v1.31 #129385

liuxu623 commented Dec 24, 2024

k8s-ci-robot commented Dec 24, 2024

liggitt commented Dec 24, 2024

liggitt commented Dec 24, 2024

Chaunceyctx commented Dec 25, 2024 •

edited

Loading

liuxu623 commented Dec 25, 2024

liuxu623 commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

ffromani commented Dec 25, 2024

dixudx commented Dec 26, 2024

liuxu623 commented Dec 26, 2024

thisisharrsh commented Dec 26, 2024

thisisharrsh commented Dec 26, 2024

ffromani commented Dec 26, 2024

All containers restart after kubelet upgrade to v1.31 #129385

All containers restart after kubelet upgrade to v1.31 #129385

Comments

liuxu623 commented Dec 24, 2024

What happened?

What did you expect to happen?

How can we reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

Kubernetes version

Cloud provider

OS version

Install tools

Container runtime (CRI) and version (if applicable)

Related plugins (CNI, CSI, ...) and versions (if applicable)

k8s-ci-robot commented Dec 24, 2024

liggitt commented Dec 24, 2024

liggitt commented Dec 24, 2024

Chaunceyctx commented Dec 25, 2024 • edited Loading

liuxu623 commented Dec 25, 2024

liuxu623 commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

thisisharrsh commented Dec 25, 2024

ffromani commented Dec 25, 2024

dixudx commented Dec 26, 2024

liuxu623 commented Dec 26, 2024

thisisharrsh commented Dec 26, 2024

thisisharrsh commented Dec 26, 2024

ffromani commented Dec 26, 2024

Chaunceyctx commented Dec 25, 2024 •

edited

Loading