Surface info of failed plugins during PreFilter, Filter and Permit #98041

Huang-Wei · 2021-01-14T00:48:41Z

What type of PR is this?

/kind feature
/sig scheduling

What this PR does / why we need it:

Basically when running plugins, the info that records the Pod failed on which plugins gets thrown away, and a high-level Status is returned. To resolve #94009, we need to know (1) which plugins are interested in which cluster events, and (2) which plugins a queued Pod failed on. This PR tries to resolve the latter concern.

Which issue(s) this PR fixes:

Pre-PR of #94009.

Special notes for your reviewer:

This PR only starts with surfacing failed plugins of PreFilter, Filter and Permit, if deem needed, we will extend the scope to other plugins.

Does this PR introduce a user-facing change?:

NONE

k8s-ci-robot · 2021-01-14T00:48:43Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

k8s-ci-robot · 2021-01-14T00:48:48Z

@Huang-Wei: This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

pkg/scheduler/core/generic_scheduler.go

Huang-Wei · 2021-01-20T21:29:20Z

/retest

@ahg-g @alculquicondor It's ready for review. This is the first PR to resolve #94009.

ahg-g · 2021-01-26T14:36:46Z

pkg/scheduler/framework/interface.go

@@ -127,6 +131,16 @@ func (s *Status) Message() string {
 	return strings.Join(s.reasons, ", ")
 }

+// SetFailedPlugin sets the given plugin name to s.failedPlugin.
+func (s *Status) SetFailedPlugin(plugin string) {
+	s.failedPlugin = plugin


since nil is a valid status, should we check and return if s == nil, in other functions as well?

In theory yes, but in practice it is not that necessary because we don't call this on nil or a concrect Success status. (If deem needed, it's even more necessary to guard AppendReason() than this)

ok, we can do that as a follow up.

ahg-g · 2021-01-26T14:38:15Z

pkg/scheduler/framework/interface.go

@@ -109,6 +109,10 @@ type Status struct {
 	code    Code
 	reasons []string
 	err     error
+	// failedPlugin is an optional field that records the plugin name a Pod failed by.
+	// It's only set by the framework when a Unschedulable or UnschedulableAndUnresolvable


shouldn't we also include Error?

I read Error more as an abnormal fault, which is usually not associated with the plugin's semantics, such as "node not found", "internal labelSelector deducing error", etc. So my original idea was that for type of Error, we don't set failedPlugin, so in runtime:

if a Pod returns error on all nodes, its aggregated failedPlugin set is nil, so we move this pod upon any cluster event unconditionally

if a Pod return non-error on some nodes, we intersect the associated plugin name upon a cluster event with those failedPlugins

This may prevent intermittent internal errors from disabling the pod's move. WDYT?

Just for the sake of separation of concerns, shouldn't the logic receiving the status decide how to deal with different codes?

Hmm, even we set failedPlugin upon Error, that info wouldn't be carried all the way to the logic which decides to move pods or not:

Upon an error in (Pre)Filter, diagnosis will be discarded:

feasibleNodes, diagnosis, err := g.findNodesThatFitPod(ctx, fwk, state, pod) if err != nil { return result, err }

Moreover during Filter(), error would be immediately returned once we hit the first Error (other in-parallel FIlter() would cancel()). So even we carry the failedPlugin (on errro) all the way to Pod moving logic, this failedPlugin might be misleading.

But you have access to that when building the "diagnosis". In any case, my point is that "failed" != unschedulable. So we either include all failure modes, or name the field to indicate its limited scope.

In any case, my point is that "failed" != unschedulable.

That's a fair point. I think I will leave the failedPlugin in Status (more specifically, just Error, Unschedulable and UnschedulableAndUnresolvable, not including Success, Wait and Skip). And in diagnosis, I will rename failedPlugin to unschedulablePlugins.

changing the name to unschedulablePlugins sounds good to me.

pkg/scheduler/framework/runtime/framework.go

Huang-Wei · 2021-01-27T01:43:32Z

/retest

ahg-g · 2021-01-28T19:44:09Z

/lgtm
/approve
/hold

not sure if you are waiting for others, so holding.

k8s-ci-robot · 2021-01-28T19:44:44Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, Huang-Wei

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [Huang-Wei,ahg-g]
~~test/integration/scheduler/OWNERS~~ [Huang-Wei,ahg-g]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

alculquicondor · 2021-01-28T20:01:36Z

Giving a quick look after the fire has finished :)

alculquicondor

Is the PR description outdated? It seems that we did add it to Permit already.

pkg/scheduler/framework/interface.go

alculquicondor · 2021-01-28T20:05:22Z

pkg/scheduler/core/generic_scheduler_test.go

+					NodeToStatusMap: framework.NodeToStatusMap{
+						"1": framework.NewStatus(framework.UnschedulableAndUnresolvable, "injected unschedulable status"),
+						"2": framework.NewStatus(framework.UnschedulableAndUnresolvable, "injected unschedulable status"),
+					},


leave a TODO as comment in the code too.

alculquicondor · 2021-01-28T20:05:31Z

pkg/scheduler/core/generic_scheduler_test.go


 	if err != nil {
 		t.Errorf("unexpected error: %v", err)
 	}

-	if len(nodeToStatusMap) != len(nodes) {
-		t.Errorf("unexpected failed status map: %v", nodeToStatusMap)
+	if len(diagnosis.NodeToStatusMap) != len(nodes) {


leave as comment in code

alculquicondor · 2021-01-28T20:27:58Z

/lgtm

Huang-Wei · 2021-01-28T21:11:52Z

Thanks @ahg-g @alculquicondor @chendave all for the review!

/hold cancel
/retest

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. needs-priority Indicates a PR lacks a `priority/foo` label and requires one. approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Jan 14, 2021

k8s-ci-robot requested review from damemi and k82cn January 14, 2021 00:49

Huang-Wei marked this pull request as ready for review January 14, 2021 01:06

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 14, 2021

Huang-Wei changed the title ~~Surface info of failed plugins during PerFilter and Filter~~ [WIP] Surface info of failed plugins during PerFilter and Filter Jan 14, 2021

k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 14, 2021

chendave reviewed Jan 14, 2021

View reviewed changes

pkg/scheduler/core/generic_scheduler.go Outdated Show resolved Hide resolved

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 14, 2021

Huang-Wei force-pushed the sched-enqueue-1 branch from 81da742 to 1dea606 Compare January 14, 2021 19:34

k8s-ci-robot added needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Jan 14, 2021

Huang-Wei force-pushed the sched-enqueue-1 branch from 1dea606 to b41f817 Compare January 19, 2021 23:52

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 19, 2021

Huang-Wei changed the title ~~[WIP] Surface info of failed plugins during PerFilter and Filter~~ Surface info of failed plugins during PerFilter and Filter Jan 20, 2021

k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 20, 2021

This was referenced Jan 20, 2021

Store a cluster event to plugin map in SchedulerQueue #98241

Merged

Avoid moving pods out of unschedulable status unconditionally #94009

Closed

PoC: ServiceAffinity enqueue #98248

Closed

ahg-g reviewed Jan 26, 2021

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 26, 2021

Huang-Wei force-pushed the sched-enqueue-1 branch from 2afff6f to aba4947 Compare January 26, 2021 21:11

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 26, 2021

Huang-Wei force-pushed the sched-enqueue-1 branch from aba4947 to c92cddc Compare January 26, 2021 23:16

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jan 26, 2021

Huang-Wei force-pushed the sched-enqueue-1 branch from c92cddc to e7311b1 Compare January 28, 2021 19:29

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 28, 2021

k8s-ci-robot assigned ahg-g Jan 28, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 28, 2021

alculquicondor reviewed Jan 28, 2021

View reviewed changes

Surface info of failed plugins during PerFilter and Filter

f8a6bdb

Huang-Wei force-pushed the sched-enqueue-1 branch from e7311b1 to f8a6bdb Compare January 28, 2021 20:21

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 28, 2021

k8s-ci-robot assigned alculquicondor Jan 28, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 28, 2021

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 28, 2021

k8s-ci-robot merged commit f402e47 into kubernetes:master Jan 29, 2021

k8s-ci-robot added this to the v1.21 milestone Jan 29, 2021

Huang-Wei deleted the sched-enqueue-1 branch February 5, 2021 19:14

Huang-Wei changed the title ~~Surface info of failed plugins during PerFilter and Filter~~ Surface info of failed plugins during PerFilter, Filter and Permit Mar 7, 2021

Huang-Wei changed the title ~~Surface info of failed plugins during PerFilter, Filter and Permit~~ Surface info of failed plugins during PreFilter, Filter and Permit Mar 7, 2021

Huang-Wei mentioned this pull request Mar 17, 2021

[Umbrella] Fine-grained pods enqueuing in scheduler #100347

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Surface info of failed plugins during PreFilter, Filter and Permit #98041

Surface info of failed plugins during PreFilter, Filter and Permit #98041

Huang-Wei commented Jan 14, 2021 •

edited

Loading

k8s-ci-robot commented Jan 14, 2021

k8s-ci-robot commented Jan 14, 2021

Huang-Wei commented Jan 20, 2021

ahg-g Jan 26, 2021

Huang-Wei Jan 26, 2021

ahg-g Jan 26, 2021

ahg-g Jan 26, 2021

Huang-Wei Jan 26, 2021

ahg-g Jan 26, 2021

Huang-Wei Jan 26, 2021

ahg-g Jan 26, 2021

Huang-Wei Jan 26, 2021

ahg-g Jan 27, 2021

Huang-Wei commented Jan 27, 2021

ahg-g commented Jan 28, 2021

k8s-ci-robot commented Jan 28, 2021

alculquicondor commented Jan 28, 2021

alculquicondor left a comment

alculquicondor Jan 28, 2021

alculquicondor Jan 28, 2021

alculquicondor commented Jan 28, 2021

Huang-Wei commented Jan 28, 2021

Surface info of failed plugins during PreFilter, Filter and Permit #98041

Surface info of failed plugins during PreFilter, Filter and Permit #98041

Conversation

Huang-Wei commented Jan 14, 2021 • edited Loading

k8s-ci-robot commented Jan 14, 2021

k8s-ci-robot commented Jan 14, 2021

Huang-Wei commented Jan 20, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Huang-Wei commented Jan 27, 2021

ahg-g commented Jan 28, 2021

k8s-ci-robot commented Jan 28, 2021

alculquicondor commented Jan 28, 2021

alculquicondor left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alculquicondor commented Jan 28, 2021

Huang-Wei commented Jan 28, 2021

Huang-Wei commented Jan 14, 2021 •

edited

Loading