-
Notifications
You must be signed in to change notification settings - Fork 39.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
e2e: node: memory manager: wait for pod pending deletion before do the actual test #128894
base: master
Are you sure you want to change the base?
e2e: node: memory manager: wait for pod pending deletion before do the actual test #128894
Conversation
The metrics tests, memory manager first and foremost, expect a controlled environment on which they run. This is why they are labeled Serial. The key factor here is the predictability: the test expect to be in full control of the node state; if any other actor mutates the node state, that makes for a almost guaranteed false negative. To reduce the chance to flake, we add an extra check before to run the actual tests: detect pods pending deletion, and wait for them to be gone. Signed-off-by: Francesco Romani <fromani@redhat.com>
This issue is currently awaiting triage. If a SIG or subproject determines this is a relevant issue, they will accept it by applying the The Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. |
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: ffromani The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/hold if we like this direction, I want to update all the other metrics tests |
What type of PR is this?
/kind bug
/kind flake
What this PR does / why we need it:
The metrics tests, memory manager first and foremost, expect a controlled environment on which they run.
This is why they are labeled Serial. The key factor here is the predictability: the test expect
to be in full control of the node state; if any other actor mutates the node state, that makes for a almost guaranteed
false negative.
To reduce the chance to flake, we add an extra check before to run the actual tests: detect pods pending deletion, and
wait for them to be gone.
Which issue(s) this PR fixes:
Fixes #128869
Special notes for your reviewer:
see slack thread: https://kubernetes.slack.com/archives/C0BP8PW9G/p1732023371237679
Does this PR introduce a user-facing change?