New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[CoreML] support coreml model cache #23065

Merged

wejoncy merged 36 commits into main from jicwen/coreml_cache

Dec 31, 2024

Contributor

wejoncy commented Dec 10, 2024 •

edited

Loading

Description

Refactor compute plan profiling

Support cache coreml model to speed up session initialization. this is only support by user provided entry and user responsible to manage the cache

With the cache, session initialization time can be reduced by 50% or more:

model	before	after
yolo11.onnx	0.6s	0.1s
yolo11-fp16.onnx	1.8s	0.1s

Motivation and Context

wejoncy requested a review from skottmckay

December 10, 2024 10:26

wejoncy marked this pull request as ready for review

December 10, 2024 10:26

wejoncy linked an issue

that may be closed by this pull request

CoreML - Writing CoreML Model on every inference session creation #21761

Closed

wejoncy force-pushed the jicwen/coreml_cache branch from 5bfc8eb to fc9db07 Compare

December 10, 2024 11:10


          support coreml model cache

1d1c874

wejoncy force-pushed the jicwen/coreml_cache branch from d539da2 to 1d1c874 Compare

December 10, 2024 12:42

wejoncy and others added 3 commits

December 11, 2024 11:37


          improve

b7888c4

fix

f492fee


          better hash

7b11848

wejoncy force-pushed the jicwen/coreml_cache branch from 81c2b9e to 7b11848 Compare

December 16, 2024 06:16


          refactor output -path

4a5772f

skottmckay reviewed

View reviewed changes

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/coreml_options.cc Outdated Show resolved Hide resolved

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/coreml_execution_provider.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/coreml_options.cc Outdated Show resolved Hide resolved

skottmckay reviewed

View reviewed changes

onnxruntime/core/providers/coreml/coreml_execution_provider.cc Outdated Show resolved Hide resolved

wejoncy and others added 15 commits

December 16, 2024 17:26


          address comments

b57aa28


          remove extra check

723b2dd


          Apply suggestions from code review

4f0ac2a

Co-authored-by: Scott McKay <skottmckay@gmail.com>


          improve doc

781e42e


          typo

26775b4


          check cache-key

89317c6


          validate alpha-number

773dce0

fix

e82f3e4


          format

d3d25b9


          fix bug

d053dc0


          format

2779e3d


          renaming

c7194ad


          max 64 chars

8204e64


          polish cache path

9c9374c

fix

8faf178

skottmckay reviewed

View reviewed changes

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/coreml_execution_provider.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Show resolved Hide resolved


          Update include/onnxruntime/core/providers/coreml/coreml_provider_fact…

e4e3547

…ory.h

Co-authored-by: Scott McKay <skottmckay@gmail.com>


          Update include/onnxruntime/core/providers/coreml/coreml_provider_fact…

e49112c

…ory.h

Co-authored-by: Scott McKay <skottmckay@gmail.com>

github-actions bot reviewed

View reviewed changes

include/onnxruntime/core/providers/coreml/coreml_provider_factory.h Outdated Show resolved Hide resolved


          disable caching in runtime.

d7b867c

skottmckay reviewed

View reviewed changes

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/builders/model_builder.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/coreml_execution_provider.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/coreml/coreml_options.h Outdated Show resolved Hide resolved

wejoncy and others added 5 commits

December 20, 2024 16:50


          Apply suggestions from code review

7c466a1

Co-authored-by: Scott McKay <skottmckay@gmail.com>


          address comments

d1e7633

fix

a5ffe03


          format

5518e38


          format

70075e5

wejoncy requested a review from skottmckay

December 23, 2024 07:57

Contributor

skottmckay commented Dec 23, 2024

Are there any unit tests for the new code? We should be able to test that the expected cache files are created in the right places and that things like invalid cache key values are rejected.

skottmckay reviewed

View reviewed changes

onnxruntime/core/providers/coreml/coreml_execution_provider.cc Show resolved Hide resolved

wejoncy and others added 2 commits

December 24, 2024 13:38


          Update onnxruntime/core/providers/coreml/coreml_execution_provider.cc

dc52361

Co-authored-by: Scott McKay <skottmckay@gmail.com>


          add test for model cache

31e6c68

Contributor Author

wejoncy commented Dec 24, 2024 •

edited

Loading

Are there any unit tests for the new code? We should be able to test that the expected cache files are created in the right places and that things like invalid cache key values are rejected.

Added unit test for three cases where hash is valid or invalid.

wejoncy added 2 commits

December 24, 2024 15:18

ut

78b2a4b

ut

5f7bddc

github-actions bot reviewed

View reviewed changes

Contributor

github-actions bot left a comment •

edited by wejoncy

Loading

,


          lint

f9db65f

microsoft deleted a comment from github-actions bot

skottmckay previously approved these changes

View reviewed changes

Contributor

skottmckay left a comment

wejoncy added 2 commits

December 30, 2024 03:37


          Merge branch 'main' into jicwen/coreml_cache

391f914


          fix test

201368d

wejoncy dismissed skottmckay’s stale review via

201368d

December 30, 2024 04:07

wejoncy requested a review from skottmckay

December 30, 2024 07:39

skottmckay approved these changes

View reviewed changes

wejoncy merged commit 8687011 into main

96 checks passed

wejoncy deleted the jicwen/coreml_cache branch

December 31, 2024 01:29

tarekziade pushed a commit to tarekziade/onnxruntime that referenced this pull request


          [CoreML] support coreml model cache (microsoft#23065)

aa1a9a8

### Description
Refactor compute plan profiling

Support cache coreml model to speed up session initialization. this is
only support by user provided entry and user responsible to manage the
cache


With the cache, session initialization time can be reduced by 50% or
more:
|model| before| after|
|--|--|--|
|yolo11.onnx| 0.6s|0.1s|
|yolo11-fp16.onnx|1.8s|0.1s|


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: wejoncy <wejoncy@.com>
Co-authored-by: Scott McKay <skottmckay@gmail.com>

guschmue pushed a commit that referenced this pull request


          [CoreML] support coreml model cache (#23065)

ede03de

### Description
Refactor compute plan profiling

Support cache coreml model to speed up session initialization. this is
only support by user provided entry and user responsible to manage the
cache


With the cache, session initialization time can be reduced by 50% or
more:
|model| before| after|
|--|--|--|
|yolo11.onnx| 0.6s|0.1s|
|yolo11-fp16.onnx|1.8s|0.1s|


### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

---------

Co-authored-by: wejoncy <wejoncy@.com>
Co-authored-by: Scott McKay <skottmckay@gmail.com>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet