[Feature] enable bf16 in AmpOptimWrapper #960

C1rN09 · 2023-02-24T06:42:28Z

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily get feedback. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

Enable torch.bfloat16 in AmpOptimWrapper and config file

Modification

New optional argument dtype in AmpOptimWrapper, can be str or torch.dtype

BC-breaking (Optional)

No

Use cases (Optional)

optim_wrapper = dict(
    type='AmpOptimWrapper',
    dtype='bfloat16',  # (None, float32, float16, bfloat16, ...)
    ....
)

Test results

mmcls::resnet50_8xb256-rsb-a3-100e_in1k

setting	iter_time	memory	acc_top1	acc_top5
no amp	0.4230	21665	78.30	93.80
amp float16	0.4239	11481	--	--
amp bfloat16	1.0812	11494	--	--
amp bfloat16 + cudnn8	0.4251	11478	78.42	93.85

Hint: Amp with dtype=torch.bfloat16 works bad on convolutions, because it doesn't use CuDNN by default. Enable CuDNN version bfloat16 convolution by environment variable: TORCH_CUDNN_V8_API_ENABLED=1

mmdet::retinanet_r50_fpn_1x_coco

Failed due to torch.bfloat16 not supported by F.interpolate.

Checklist

Pre-commit or other linting tools are used to fix the potential lint issues.
The modification is covered by complete unit tests. If not, please add more unit test to ensure the correctness.
If the modification has potential influence on downstream projects, this PR should be tested with downstream projects, like MMDet or MMCls.
The documentation has been modified accordingly, like docstring or example tutorials.

codecov · 2023-02-24T06:48:19Z

Codecov Report

❗ No coverage uploaded for pull request base (main@d8abf9a). Click here to learn what that means.
Patch has no changes to coverable lines.

❗ Current head bc3313a differs from pull request most recent head 4c350c6. Consider uploading reports for the commit 4c350c6 to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #960   +/-   ##
=======================================
  Coverage        ?   76.55%           
=======================================
  Files           ?      138           
  Lines           ?    10846           
  Branches        ?     2167           
=======================================
  Hits            ?     8303           
  Misses          ?     2186           
  Partials        ?      357

Flag	Coverage Δ
unittests	`76.55% <0.00%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

mmengine/optim/optimizer/amp_optimizer_wrapper.py

CLAassistant · 2023-03-01T11:14:34Z

All committers have signed the CLA.

support bf16 in AmpOptimWrapper

7939264

mm-assistant bot assigned HAOCHENYE Feb 24, 2023

HAOCHENYE reviewed Feb 24, 2023

View reviewed changes

mmengine/optim/optimizer/amp_optimizer_wrapper.py Outdated Show resolved Hide resolved

C1rN09 added 4 commits February 24, 2023 14:55

add docstring

2361423

modify docs

7154f95

add unittests for bf16 in AmpOptimWrapper

e1d7891

fix type

50d8ccd

C1rN09 marked this pull request as ready for review February 24, 2023 08:31

C1rN09 requested a review from zhouzaida as a code owner February 24, 2023 08:31

C1rN09 added 3 commits February 24, 2023 16:52

fix to pass ci

01f9b64

fix ut skip logic to pass ci

455482e

fix as comment

9ea61cd

HAOCHENYE reviewed Feb 27, 2023

View reviewed changes

mmengine/optim/optimizer/amp_optimizer_wrapper.py Outdated Show resolved Hide resolved

add type hints

0e02098

HAOCHENYE previously approved these changes Feb 28, 2023

View reviewed changes

zhouzaida reviewed Mar 1, 2023

View reviewed changes

mmengine/optim/optimizer/amp_optimizer_wrapper.py Outdated Show resolved Hide resolved

zhouzaida reviewed Mar 1, 2023

View reviewed changes

mmengine/optim/optimizer/amp_optimizer_wrapper.py Outdated Show resolved Hide resolved

C1rN09 added 3 commits March 1, 2023 17:41

fix docstring and add warning information

5d645b3

remove check for pytorch>=1.6 in unittest

8b014ae

modify unittest

3b162b4

C1rN09 dismissed HAOCHENYE’s stale review via 3b162b4 March 1, 2023 10:51

C1rN09 added 4 commits March 1, 2023 19:03

modify unittest

bda5c52

remove torch.float32 && torch.float64 from valid dtypes

f8e14a6

fix as comments

aaa3096

minor refine docstring

fb669fe

C1rN09 added 2 commits March 1, 2023 20:48

fix unittest parameterized to pass CI

432c840

fix unittest && add back torch.float32, torch.float64

4c350c6

zhouzaida approved these changes Mar 1, 2023

View reviewed changes

zhouzaida merged commit 2ed8e34 into open-mmlab:main Mar 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] enable bf16 in AmpOptimWrapper #960

[Feature] enable bf16 in AmpOptimWrapper #960

C1rN09 commented Feb 24, 2023

codecov bot commented Feb 24, 2023 •

edited

Loading

CLAassistant commented Mar 1, 2023 •

edited

Loading

[Feature] enable bf16 in AmpOptimWrapper #960

[Feature] enable bf16 in AmpOptimWrapper #960

Conversation

C1rN09 commented Feb 24, 2023

Motivation

Modification

BC-breaking (Optional)

Use cases (Optional)

Test results

mmcls::resnet50_8xb256-rsb-a3-100e_in1k

mmdet::retinanet_r50_fpn_1x_coco

Checklist

codecov bot commented Feb 24, 2023 • edited Loading

Codecov Report

CLAassistant commented Mar 1, 2023 • edited Loading

codecov bot commented Feb 24, 2023 •

edited

Loading

CLAassistant commented Mar 1, 2023 •

edited

Loading