[Core] Speed up single_client_tasks_sync #41743

jjyao · 2023-12-08T19:12:02Z

Why are these changes needed?

Currently IntelGPUAcceleratorManager.get_current_node_num_accelerators() will be called everytime we try to get accelerator manager for GPU resource. This is expensive and makes single_client_tasks_sync slower. This PR changes to only call it once.

Related issue number

#41695

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

jjyao · 2023-12-08T20:43:37Z

Microbenchmark: https://buildkite.com/ray-project/release/builds/3607#018c4af2-38cb-468f-9403-f6bc9cccd1db

single_client_tasks_sync = [1028.731851730263, 6.294947883118997]
multi_client_tasks_async = [25705.306014103495, 2792.8083138099164]

Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

rickyyx · 2023-12-08T22:25:20Z

python/ray/_private/accelerators/__init__.py

+        )
+    except AttributeError:
+        # Lazy initialization.
+        resource_name_to_accelerator_manager = {


oh this gets cache across invocation? i thought function doesn't have attribute cached like this.

Yes, this basically implements function static variable in C++.

--------- Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

Currently IntelGPUAcceleratorManager.get_current_node_num_accelerators() will be called everytime we try to get accelerator manager for GPU resource. This is expensive and makes single_client_tasks_sync slower. This PR changes to only call it once. Related issue number #41695 --------- Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

Speed up single_client_tasks_sync

1763278

Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

up

3e320cc

Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

jjyao assigned rickyyx Dec 8, 2023

jjyao requested a review from rickyyx December 8, 2023 22:11

rickyyx reviewed Dec 8, 2023

View reviewed changes

jjyao requested a review from rickyyx December 8, 2023 23:42

rickyyx approved these changes Dec 9, 2023

View reviewed changes

rickyyx merged commit ab36f94 into ray-project:master Dec 9, 2023
14 of 15 checks passed

jjyao deleted the jjyao/slow branch December 9, 2023 04:31

jjyao added a commit that referenced this pull request Dec 9, 2023

[Core] Speed up single_client_tasks_sync (#41743)

a4cdc00

--------- Signed-off-by: Jiajun Yao <jeromeyjj@gmail.com>

jjyao mentioned this pull request Dec 11, 2023

[Core] Perf regression #41695

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Speed up single_client_tasks_sync #41743

[Core] Speed up single_client_tasks_sync #41743

jjyao commented Dec 8, 2023 •

edited

Loading

jjyao commented Dec 8, 2023

rickyyx Dec 8, 2023

jjyao Dec 8, 2023

[Core] Speed up single_client_tasks_sync #41743

[Core] Speed up single_client_tasks_sync #41743

Conversation

jjyao commented Dec 8, 2023 • edited Loading

Why are these changes needed?

Related issue number

Checks

jjyao commented Dec 8, 2023

rickyyx Dec 8, 2023

Choose a reason for hiding this comment

jjyao Dec 8, 2023

Choose a reason for hiding this comment

jjyao commented Dec 8, 2023 •

edited

Loading