Configs in this directory implement the legacy configs used by MMDetection V1.x and its model zoos.
To help users convert their models from V1.x to MMDetection V2.0, we provide v1.x configs to inference the converted v1.x models. Due to the BC-breaking changes in MMDetection V2.0 from MMDetection V1.x, running inference with the same model weights in these two version will produce different results. The difference will cause within 1% AP absolute difference as can be found in the following table.
To upgrade the model version, the users need to do the following steps.
There are three main difference in the model weights between V1.x and V2.0 codebases.
- Since the class order in all the detector's classification branch is reordered, all the legacy model weights need to go through the conversion process.
- The regression and segmentation head no longer contain the background channel. Weights in these background channels should be removed to fix in the current codebase.
- For two-stage detectors, their wegihts need to be upgraded since MMDetection V2.0 refactors all the two-stage detectors with
RoIHead
.
The users can do the same modification as mentioned above for the self-implemented
detectors. We provide a scripts tools/upgrade_model_version.py
to convert the model weights in the V1.x model zoo.
python tools/upgrade_model_version.py ${OLD_MODEL_PATH} ${NEW_MODEL_PATH} --num-classes ${NUM_CLASSES}
- OLD_MODEL_PATH: the path to load the model weights in 1.x version.
- NEW_MODEL_PATH: the path to save the converted model weights in 2.0 version.
- NUM_CLASSES: number of classes of the original model weights. Usually it is 81 for COCO dataset, 21 for VOC dataset. The number of classes in V2.0 models should be equal to that in V1.x models - 1.
After converting the model weights, checkout to the v1.2 release to find the corresponding config file that uses the legacy settings.
The V1.x models usually need these three legacy modules: LegacyAnchorGenerator
, LegacyDeltaXYWHBBoxCoder
, and RoIAlign(align=False)
.
For models using ResNet Caffe backbones, they also need to change the pretrain name and the corresponding img_norm_cfg
.
An example is in retinanet_r50_caffe_fpn_1x_coco_v1.py
Then use the config to test the model weights. For most models, the obtained results should be close to that in V1.x.
We provide configs of some common structures in this directory.
The performance change after converting the models in this directory are listed as the following.
Method | Style | Lr schd | V1.x box AP | V1.x mask AP | V2.0 box AP | V2.0 mask AP | Download |
---|---|---|---|---|---|---|---|
Mask R-CNN R-50-FPN | pytorch | 1x | 37.3 | 34.2 | 36.8 | 33.9 | model |
RetinaNet R-50-FPN | caffe | 1x | 35.8 | - | 35.4 | - | |
RetinaNet R-50-FPN | pytorch | 1x | 35.6 | - | 35.2 | - | model |
Cascade Mask R-CNN R-50-FPN | pytorch | 1x | 41.2 | 35.7 | 40.8 | 35.6 | model |
SSD300-VGG16 | caffe | 120e | 25.7 | - | 25.4 | - | model |