lang2seg

Prerequisites

Python 2.7
Pytorch 0.2 or 0.3
CUDA 8.0
Mask R-CNN: Follow the instructions of the mask-faster-rcnn repo, preparing everything needed for pyutils/mask-faster-rcnn.
REFER API and data: Use the download links of REFER and go to the foloder running make. Follow data/README.md to prepare images and refcoco/refcoco+/refcocog annotations.
COCO training set should be downloaded in pyutils/mask-faster-rcnn/data/coco/images/train2014.

Preprocessing

The processed data is uploaded in cache/prepro/.

Training

<DATASET> <SPLITBY> pairs contain: refcoco unc/refcoco+ unc/refcocog umd/refcocog google
Output model will be saved at <DATASET>_<SPLITBY>/output_<OUTPUT_POSTFIX>. If there are trained models in this directory, the model of the latest iteratioin will be loaded.
The iteration when learning rate decay is specified as STEPSIZE in train_*.sh.

Train the baseline segmentation model with only 1 dynamic filter:

./experiments/scripts/train_baseline.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX>

The Mask R-CNN model is in pyutils/mask-faster-rcnn/lib/nets/resnet_v1.py and pyutils/mask-faster-rcnn/lib/nets/network.py.

Train the model with spatial dynamic filters:

./experiments/scripts/train_spatial.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX>

The Mask R-CNN model is in pyutils/mask-faster-rcnn/lib/nets/resnet_v1_7f.py and pyutils/mask-faster-rcnn/lib/nets/network_7f.py.

Train the model with spatial dynamic filters and caption loss:

./experiments/scripts/train_cycle.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX> att2in2 <CAPTION_LOSS_WEIGHT>

The pretrained Mask R-CNN model should be placed at <DATASET>_<SPLITBY>/output_<OUTPUT_POSTFIX>. If there are multiple models in the directory, the model of the latest iteration will be loaded.

The pretrained caption model should be placed at <DATASET>_<SPLITBY>/caption_log_res5_2/, named as model-best.pth and infos-best.pkl.

The Mask R-CNN model is in pyutils/mask-faster-rcnn/lib/nets/resnet_v1_cycle_res5_2.py and pyutils/mask-faster-rcnn/lib/nets/network_cycle_res5_2.py.

Train the model with spatial dynamic filters and response loss:

./experiments/scripts/train_response.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX>

The Mask R-CNN model is in pyutils/mask-faster-rcnn/lib/nets/resnet_v1_7f_response.py and pyutils/mask-faster-rcnn/lib/nets/network_7f_response.py.

Train the model with spatial dynamic filters, response loss and caption loss:

./experiments/scripts/train_cycle_response.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX> att2in2 <CAPTION_LOSS_WEIGHT>

The pretrained Mask R-CNN model should be placed at <DATASET>_<SPLITBY>/output_<OUTPUT_POSTFIX>. If there are multiple models in the directory, the model of the latest iteration will be loaded.

The pretrained caption model should be placed at <DATASET>_<SPLITBY>/caption_log_response/, named as model-best.pth and infos-best.pkl.

The Mask R-CNN model is in pyutils/mask-faster-rcnn/lib/nets/resnet_v1_cycle_response.py and pyutils/mask-faster-rcnn/lib/nets/network_cycle_response.py.

Evaluation

Evaluate the baseline segmentation model:

./experiments/scripts/eval_baseline.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX> <MODEL_ITER>

Evaluate the model at <DATASET>_<SPLITBY>/output_<OUTPUT_POSTFIX>, of trained iteration <MODEL_ITER>.

Detection and segmentation results will be saved at experiments/det_results.txt and experiments/mask_results.txt respectively.

Evaluate the model with spatial dynamic filters (and caption loss):

./experiments/scripts/eval_spatial.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX> <MODEL_ITER>

Evaluate the model with spatial dynamic filters and response loss (and caption loss):

./experiments/scripts/eval_response.sh <GPUID> <DATASET> <SPLITBY> <OUTPUT_POSTFIX> <MODEL_ITER>

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
cache/prepro		cache/prepro
experiments		experiments
lib		lib
pyutils		pyutils
refcoco_unc		refcoco_unc
refcocog_umd		refcocog_umd
tools		tools
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lang2seg

Prerequisites

Preprocessing

Training

Evaluation

About

Releases

Packages

Languages

License

wenz116/lang2seg

Folders and files

Latest commit

History

Repository files navigation

lang2seg

Prerequisites

Preprocessing

Training

Evaluation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages