Add MobileNetV3 Architecture in TorchVision #3182

datumbox · 2020-12-16T17:50:25Z

Partially fixes #1676

Depends and cherrypicks commits from #3177

The current temporary pre-trained model was trained:

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py\
--model mobilenet_v3_large --epochs 600 --opt rmsprop --batch-size 128 --lr 0.064\ 
--wd 0.00001 --lr-step-size 2 --lr-gamma 0.973 --auto-augment imagenet --random-erase 0.2

Submitted batch job 34241491

Then we took the 3 last checkpoints (epochs 549, 528, 408) that improved the Acc@1 and averaged their parameters using the following script:

# from https://github.com/pytorch/fairseq/blob/master/scripts/average_checkpoints.py
import collections
import torch


def average_checkpoints(inputs):
    params_dict = collections.OrderedDict()
    params_keys = None
    new_state = None
    num_models = len(inputs)
    for fpath in inputs:
        with open(fpath, "rb") as f:
            state = torch.load(
                f,
                map_location=(
                    lambda s, _: torch.serialization.default_restore_location(s, "cpu")
                ),
            )
        # Copies over the settings from the first checkpoint
        if new_state is None:
            new_state = state
        model_params = state["model"]
        model_params_keys = list(model_params.keys())
        if params_keys is None:
            params_keys = model_params_keys
        elif params_keys != model_params_keys:
            raise KeyError(
                "For checkpoint {}, expected list of params: {}, "
                "but found: {}".format(f, params_keys, model_params_keys)
            )
        for k in params_keys:
            p = model_params[k]
            if isinstance(p, torch.HalfTensor):
                p = p.float()
            if k not in params_dict:
                params_dict[k] = p.clone()
                # NOTE: clone() is needed in case of p is a shared parameter
            else:
                params_dict[k] += p
    averaged_params = collections.OrderedDict()
    for k, v in params_dict.items():
        averaged_params[k] = v
        if averaged_params[k].is_floating_point():
            averaged_params[k].div_(num_models)
        else:
            averaged_params[k] //= num_models
    new_state["model"] = averaged_params
    return new_state


def avg(epochs, filename):
    paths = ["model_{}.pth".format(i) for i in epochs]
    weights = average_checkpoints(paths)
    torch.save(weights, filename.format(len(epochs)))

avg([549, 528, 408], "model_best{}avg.pth")

Validated with:

python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py\
 --model mobilenet_v3_large --test-only --pretrained

Submitted batch job 34643680

Accuracy metrics:
Acc@1 74.042 Acc@5 91.340

Speed Benchmark: 0.0411 sec per image on CPU

…sses and methods.

# Conflicts: # torchvision/models/mobilenet.py # torchvision/models/quantization/mobilenet.py

torchvision/models/mobilenetv3.py

codecov · 2020-12-16T18:53:07Z

Codecov Report

Merging #3182 (5030435) into master (4cbe714) will increase coverage by 0.30%.
The diff coverage is 94.95%.

@@            Coverage Diff             @@
##           master    #3182      +/-   ##
==========================================
+ Coverage   73.49%   73.79%   +0.30%     
==========================================
  Files         101      102       +1     
  Lines        9235     9354     +119     
  Branches     1477     1490      +13     
==========================================
+ Hits         6787     6903     +116     
- Misses       1991     1993       +2     
- Partials      457      458       +1

Impacted Files	Coverage Δ
torchvision/models/mobilenetv3.py	`94.91% <94.91%> (ø)`
torchvision/models/mobilenet.py	`100.00% <100.00%> (ø)`
torchvision/models/mobilenetv2.py	`86.51% <0.00%> (+3.37%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4cbe714...9a758a8. Read the comment docs.

vfdev-5

Nice PR @datumbox !
Few nits...

torchvision/models/mobilenetv3.py

datumbox · 2020-12-17T12:48:02Z

The failing builds seem unrelated. See issue #3183

datumbox

I added a few comments to assist review.

datumbox · 2021-01-05T12:50:25Z

torchvision/models/mobilenetv3.py

+# TODO: add pretrained
+model_urls = {
+    "mobilenet_v3_large": None,
+    "mobilenet_v3_small": None,


Pending S3 bucket access and training finalization.

datumbox · 2021-01-05T12:51:37Z

references/classification/train.py

+            model.parameters(), lr=args.lr, momentum=args.momentum, weight_decay=args.weight_decay)
+    elif opt_name == 'rmsprop':
+        optimizer = torch.optim.RMSprop(model.parameters(), lr=args.lr, momentum=args.momentum,
+                                        weight_decay=args.weight_decay, eps=0.0316, alpha=0.9)


These hardcoded params are crucial for convergence =. They can be turned into args.

datumbox · 2021-01-05T19:10:29Z

I will merge this PR on a separate branch to continue with the changes necessary for Object Detection. I'll send a new PR on master once all changes are final.

* Add MobileNetV3 Architecture in TorchVision (#3182) * Adding implementation of network architecture * Adding rmsprop support on the train.py * Adding auto-augment and random-erase in the training scripts. * Adding support for reduced tail on MobileNetV3. * Tagging blocks with comments. * Adding documentation, pre-trained model URL and a minor refactoring. * Handling better untrained supported models.

Summary: * Add MobileNetV3 Architecture in TorchVision (#3182) * Adding implementation of network architecture * Adding rmsprop support on the train.py * Adding auto-augment and random-erase in the training scripts. * Adding support for reduced tail on MobileNetV3. * Tagging blocks with comments. * Adding documentation, pre-trained model URL and a minor refactoring. * Handling better untrained supported models. Reviewed By: datumbox Differential Revision: D25954557 fbshipit-source-id: f7d72a81a2ec92cbbbf3bd86c68ae0a426626cc7

datumbox added 15 commits December 15, 2020 22:39

partial implementation network architecture

22b9b12

Simplify implementation and adding blocks.

0cff6fb

Refactoring the code to make it more readable.

fd54fdf

Adding first conv layers.

834b185

Moving mobilenet.py to mobilenetv2.py

1edd16b

Adding mobilenet.py for BC.

2f52f0d

Extending ConvBNReLU for reuse.

bb2ec9e

Moving mobilenet.py to mobilenetv2.py

e95ee5c

Adding mobilenet.py for BC.

2ebe8ba

Extending ConvBNReLU for reuse.

0c31a33

Reduce import scope on mobilenet to only the public and versioned cla…

db7522b

…sses and methods.

Merge branch 'refactoring/mobilenetv2_bc_move' into models/mobilenetv3

fdbcec7

# Conflicts: # torchvision/models/mobilenet.py # torchvision/models/quantization/mobilenet.py

Further simplify by reusing MobileNetv2 methods.

16f55f5

Adding the remaining implementation of mobilenetv3.

8162fa4

Adding tests, docs and init methods.

8615585

facebook-github-bot added the cla signed label Dec 16, 2020

datumbox added 2 commits December 16, 2020 17:59

Refactoring and fixing formatter.

8664fde

Fixing type issues.

cfa20b7

datumbox commented Dec 16, 2020

View reviewed changes

torchvision/models/mobilenetv3.py Outdated Show resolved Hide resolved

datumbox commented Dec 16, 2020

View reviewed changes

torchvision/models/mobilenetv3.py Outdated Show resolved Hide resolved

datumbox and others added 2 commits December 16, 2020 20:03

Using build-in Hardsigmoid and Hardswish.

c189ae1

Merge branch 'master' into models/mobilenetv3

5030435

vfdev-5 reviewed Dec 17, 2020

View reviewed changes

torchvision/models/mobilenetv3.py Outdated Show resolved Hide resolved

torchvision/models/mobilenetv3.py Show resolved Hide resolved

Code review nits.

9a758a8

Putting inplace on Dropout.

25f8b26

datumbox force-pushed the models/mobilenetv3 branch 3 times, most recently from 385e077 to 585374c Compare December 20, 2020 20:26

datumbox force-pushed the models/mobilenetv3 branch 2 times, most recently from 6ba7f15 to d912443 Compare December 30, 2020 20:16

Adding rmsprop support on the train.py

5198385

datumbox force-pushed the models/mobilenetv3 branch from d912443 to 5198385 Compare January 1, 2021 12:08

Adding auto-augment and random-erase in the training scripts.

e4d130f

datumbox force-pushed the models/mobilenetv3 branch from b415a70 to e4d130f Compare January 3, 2021 20:48

Merge branch 'master' into models/mobilenetv3

5d0a664

datumbox mentioned this pull request Jan 5, 2021

TorchVision Roadmap - 2021 H1 #3221

Closed

13 tasks

datumbox commented Jan 5, 2021

View reviewed changes

Adding support for reduced tail on MobileNetV3.

c0a13a2

datumbox force-pushed the models/mobilenetv3 branch from 403396c to c0a13a2 Compare January 5, 2021 14:57

datumbox changed the base branch from master to mobilenetv3 January 5, 2021 18:31

datumbox changed the title ~~[WIP] Add MobileNetV3 in TorchVision~~ Add MobileNetV3 Architecture in TorchVision Jan 5, 2021

Tagging blocks with comments.

2414d2d

datumbox merged commit aea1191 into pytorch:mobilenetv3 Jan 5, 2021

datumbox deleted the models/mobilenetv3 branch January 5, 2021 19:18

This was referenced Jan 12, 2021

Add MobileNetV3 architecture with Classification & Detection models #3243

Closed

Add MobileNetV3 architecture for Classification #3252

Merged

This was referenced Mar 18, 2021

Add vanilla DeepSpeech model pytorch/audio#1399

Merged

Adding Module Models pytorch/audio#446

Closed

datumbox mentioned this pull request Sep 29, 2021

Add RegNet Architecture in TorchVision #4403

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MobileNetV3 Architecture in TorchVision #3182

Add MobileNetV3 Architecture in TorchVision #3182

datumbox commented Dec 16, 2020 •

edited

Loading

codecov bot commented Dec 16, 2020 •

edited

Loading

vfdev-5 left a comment

datumbox commented Dec 17, 2020

datumbox left a comment

datumbox Jan 5, 2021

datumbox Jan 5, 2021

datumbox commented Jan 5, 2021

Add MobileNetV3 Architecture in TorchVision #3182

Add MobileNetV3 Architecture in TorchVision #3182

Conversation

datumbox commented Dec 16, 2020 • edited Loading

codecov bot commented Dec 16, 2020 • edited Loading

Codecov Report

vfdev-5 left a comment

Choose a reason for hiding this comment

datumbox commented Dec 17, 2020

datumbox left a comment

Choose a reason for hiding this comment

datumbox Jan 5, 2021

Choose a reason for hiding this comment

datumbox Jan 5, 2021

Choose a reason for hiding this comment

datumbox commented Jan 5, 2021

datumbox commented Dec 16, 2020 •

edited

Loading

codecov bot commented Dec 16, 2020 •

edited

Loading