scheduler: preallocation for NodeToStatusMap #124714

sanposhiho · 2024-05-07T00:03:29Z

What type of PR is this?

/kind bug
/kind regression

What this PR does / why we need it:

Improve the throughput by a preallocation for NodeToStatusMap.

Which issue(s) this PR fixes:

Part of (hopefully fix) #124709

Edit: This was reverted in #125197 prior to release of 1.31.0

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Fix a performance regression in 1.30.0 for scheduling daemonset pods to reach 300 pods/s, if the configured qps allows it.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot · 2024-05-07T00:03:38Z

This issue is currently awaiting triage.

If a SIG or subproject determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

sanposhiho · 2024-05-07T00:03:42Z

/cc @alculquicondor

alculquicondor · 2024-05-07T13:03:15Z

pkg/scheduler/schedule_one.go


 	allNodes, err := sched.nodeInfoSnapshot.NodeInfos().List()
 	if err != nil {
-		return nil, diagnosis, err
+		return nil, framework.Diagnosis{
+			NodeToStatusMap: make(framework.NodeToStatusMap),


I don't think .List would fail for the snapshot, but couldn't it be left nil?

+1 and maybe we can defer the allocation until unsuccess real happens.

couldn't it be left nil?

Ah, yes we can just leave it nil.

@kerthcet Can you elaborate about your proposal?

I mean we can make the map (allocate the memory) when pod unschedulable real happens rather than preallocate, especially for 5k nodes, however, scheduling runs in serial, so the benefit is small but will make the code more complicated, so this is not a good suggestion. 😢

alculquicondor · 2024-05-07T13:04:08Z

This patch allows us to reach 300 pods/s again

alculquicondor · 2024-05-07T13:04:52Z

/lgtm
/approve

Can you open a cherry-pick for release-1.30?

k8s-ci-robot · 2024-05-07T13:04:58Z

LGTM label has been added.

Git tree hash: cabde4b4d4acb424ef24fafe1fce6f24db0deda6

k8s-ci-robot · 2024-05-07T13:05:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alculquicondor, sanposhiho

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~pkg/scheduler/OWNERS~~ [alculquicondor,sanposhiho]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

saschagrunert · 2024-05-08T08:28:47Z

Is it possible that we regressed something timing related here?

See #124743

sanposhiho · 2024-05-08T08:38:42Z

@saschagrunert Well... This PR is just changing the internal data structure's field to be preallocated.
I'm not very familiar with the failing test though, I cannot think of any scenarios that this PR could affect the behaviors of sig-node's stuff (actually even in the scheduler except perf improvement).

sanposhiho · 2024-05-08T13:09:05Z

Can you open a cherry-pick for release-1.30?

Done; #124753

alculquicondor · 2024-05-08T17:21:54Z

/release-note-edit

Fix throughput when scheduling daemonset pods to reach 300 pods/s, if the configured qps allows it.

…124714-upstream-release-1.30 Automated cherry pick of #124714: scheduler: preallocation for NodeToStatusMap

scheduler: preallocation for NodeToStatusMap

9fcd791

k8s-ci-robot added the needs-priority Indicates a PR lacks a `priority/foo` label and requires one. label May 7, 2024

k8s-ci-robot requested a review from alculquicondor May 7, 2024 00:03

k8s-ci-robot added sig/scheduling Categorizes an issue or PR as relevant to SIG Scheduling. and removed do-not-merge/needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels May 7, 2024

k8s-ci-robot requested review from chendave and kerthcet May 7, 2024 00:04

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 7, 2024

AxeZhan mentioned this pull request May 7, 2024

Throughput degradation scheduling daemonset pods #124709

Closed

alculquicondor reviewed May 7, 2024

View reviewed changes

k8s-ci-robot assigned alculquicondor May 7, 2024

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 7, 2024

k8s-ci-robot merged commit e798b9c into kubernetes:master May 7, 2024
14 checks passed

k8s-ci-robot added this to the v1.31 milestone May 7, 2024

chengjoey mentioned this pull request May 8, 2024

hotfix when a plugin (in-tree or out-of-tree) return non-existent/illegal nodes, the pod scheduling flow will abort immediately. #124559

Merged

sanposhiho mentioned this pull request May 8, 2024

Automated cherry pick of #124714: scheduler: preallocation for NodeToStatusMap #124753

Merged

k8s-ci-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note-none Denotes a PR that doesn't merit a release note. labels May 8, 2024

k8s-ci-robot added a commit that referenced this pull request May 10, 2024

Merge pull request #124753 from sanposhiho/automated-cherry-pick-of-#…

4183998

…124714-upstream-release-1.30 Automated cherry pick of #124714: scheduler: preallocation for NodeToStatusMap

sanposhiho mentioned this pull request May 11, 2024

cleanup: eliminate unncessary NodeToStatusMap creation #124822

Merged

gabesaba mentioned this pull request May 29, 2024

[scheduler] absent key in NodeToStatusMap implies UnschedulableAndUnresolvable #125197

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scheduler: preallocation for NodeToStatusMap #124714

scheduler: preallocation for NodeToStatusMap #124714

sanposhiho commented May 7, 2024 •

edited by liggitt

Loading

k8s-ci-robot commented May 7, 2024

sanposhiho commented May 7, 2024

alculquicondor May 7, 2024

kerthcet May 7, 2024

sanposhiho May 8, 2024

kerthcet May 10, 2024 •

edited

Loading

alculquicondor commented May 7, 2024 •

edited

Loading

alculquicondor commented May 7, 2024

k8s-ci-robot commented May 7, 2024

k8s-ci-robot commented May 7, 2024

saschagrunert commented May 8, 2024

sanposhiho commented May 8, 2024 •

edited

Loading

sanposhiho commented May 8, 2024 •

edited

Loading

alculquicondor commented May 8, 2024

scheduler: preallocation for NodeToStatusMap #124714

scheduler: preallocation for NodeToStatusMap #124714

Conversation

sanposhiho commented May 7, 2024 • edited by liggitt Loading

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:

Special notes for your reviewer:

Does this PR introduce a user-facing change?

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

k8s-ci-robot commented May 7, 2024

sanposhiho commented May 7, 2024

alculquicondor May 7, 2024

Choose a reason for hiding this comment

kerthcet May 7, 2024

Choose a reason for hiding this comment

sanposhiho May 8, 2024

Choose a reason for hiding this comment

kerthcet May 10, 2024 • edited Loading

Choose a reason for hiding this comment

alculquicondor commented May 7, 2024 • edited Loading

alculquicondor commented May 7, 2024

k8s-ci-robot commented May 7, 2024

k8s-ci-robot commented May 7, 2024

saschagrunert commented May 8, 2024

sanposhiho commented May 8, 2024 • edited Loading

sanposhiho commented May 8, 2024 • edited Loading

alculquicondor commented May 8, 2024

sanposhiho commented May 7, 2024 •

edited by liggitt

Loading

kerthcet May 10, 2024 •

edited

Loading

alculquicondor commented May 7, 2024 •

edited

Loading

sanposhiho commented May 8, 2024 •

edited

Loading

sanposhiho commented May 8, 2024 •

edited

Loading