-
Notifications
You must be signed in to change notification settings - Fork 39.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failure cluster [c199077e...]: Pod InPlace Resize Container pod-resize-resource-quota-test #128840
Comments
/triage accepted |
/assign |
Though I haven't reached the root cause, this issue doesn't look to have been caused by In ci-kubernetes-e2e-gci-gce #1858341567471816704: Referring to kube-controller-manager.log, Resource Quota Monitor detected pod updates:
However, Resource Quota Controller could not update the resource quota immediately:
It looks that kubernetes/pkg/controller/resourcequota/resource_quota_controller.go Lines 471 to 500 in 35d098a
So, workers that updates resource quotas in Resource Quota Controller got blocked at locking: kubernetes/pkg/controller/resourcequota/resource_quota_controller.go Lines 263 to 270 in 35d098a
|
The "Expected an error to have occurred. Got: : nil" looks suspiciously like the flake that I got in an E2E test for DRA. Does the test assume that creating some object is reflected in the v1.ResourceQuota immediately and then quota is enforced by the apiserver? Further debugging of the DRA failure led to the (so far tentative) conclusion that the quota mechanism is not perfectly reliable - see #128899 (comment). |
At a glance, there seems to be another race with resource quota. This issue looks to have been caused by a test of CRDValidationRatcheting:
In kube-apiserver.log, CRD
As described in #128840 (comment), Resource Quota Controller looks to have been trying to sync the deleted resource
|
Failure cluster c199077e785ee1358fcf
Error text:
Another failure:
Recent failures:
11/18/2024, 3:48:51 AM ci-kubernetes-e2e-gci-gce
11/17/2024, 5:34:50 PM ci-kubernetes-e2e-gci-gce-ip-alias
11/17/2024, 11:42:35 AM ci-containerd-e2e-ubuntu-gce
11/17/2024, 5:48:18 AM ci-kubernetes-e2e-kind-alpha-beta-features
11/16/2024, 12:34:32 PM ci-containerd-e2e-ubuntu-gce
/kind failing-test
/kind flake
/sig node
The text was updated successfully, but these errors were encountered: