Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Symmetric quantized convolution kernel ARM64 #9772

Merged
merged 7 commits into from
Dec 14, 2021

Conversation

chenfucn
Copy link
Contributor

@chenfucn chenfucn commented Nov 16, 2021

Description: Describe your changes.
Adding a symmetric quantized convolution kernel for ARM64

Pre-dot CPU (Pixel 3a)

Model baseline this change
Cartoongan 1543.21 1530.02
Efficient Net 103.886 100.332
Mobilenet Edge TPU 33.3259 33.0789
Mobilenet v1 19.177 19.995

Post-dot CPU (Pixel 5a)

Model baseline this change
Cartoongan 734.349 708.554
Efficient Net 57.5911 56.9976
Mobilenet Edge TPU 11.3029 10.7636
Mobilenet v1 8.24782 8.05804

Note:
Indirect conv performs worse for shallow convs (input channels are small). This is much more so for low end pre-dot CPUs, where only 128 or deeper conv is faster with indirect conv. With DOT-CPUs, 32 deep conv is already faster

@chenfucn chenfucn force-pushed the cfu_qconv branch 6 times, most recently from 3c70547 to 252d28a Compare November 30, 2021 03:32
@chenfucn chenfucn force-pushed the cfu_qconv branch 14 times, most recently from b640e22 to f6f0519 Compare December 4, 2021 00:43
@chenfucn chenfucn force-pushed the cfu_qconv branch 4 times, most recently from 50fd111 to 5efe775 Compare December 6, 2021 21:40
yufenglee
yufenglee previously approved these changes Dec 8, 2021
Copy link
Member

@yufenglee yufenglee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

Copy link
Member

@yufenglee yufenglee left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

@chenfucn chenfucn merged commit cd0af7a into microsoft:master Dec 14, 2021
@chenfucn chenfucn deleted the cfu_qconv branch December 14, 2021 18:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants