AttentionMask

AttentionMask: Attentive, Efficient Object Proposal Generation Focusing on Small Objects (ACCV 2018, accepted as oral)

We propose a novel approach for class-agnostic object proposal generation, which is efficient and especially well-suited to detect small objects. Efficiency is achieved by scale-specific objectness attention maps which focus the processing on promising parts of the image and reduce the amount of sampled windows strongly. This leads to a system, which is 33% faster than the state-of-the-art and clearly outperforming state-of-the-art in terms of average recall. Secondly, we add a module for detecting small objects, which are often missed by recent models. We show that this module improves the average recall for small objects by about 53%.

The system is based on FastMask.

If you find this software useful in your research, please cite our paper.

@inproceedings{WilmsFrintropACCV2018,
title = {{AttentionMask}: Attentive, Efficient Object Proposal Generation Focusing on Small Objects},
author = {Christian Wilms and Simone Frintrop},
booktitle = {Asien Conference on Computer Vision (ACCV)},
year = {2018}
}

Requirements

Ubuntu 16.04
Cuda 9.0
Python 2.7
OpenCV-Python
Python packages: scipy, numpy, python-cjson, setproctitle, scikit-image
COCOApi
Caffe (already part of this git)
Alchemy (already part of this git)

Installation

We assume Ubuntu 16.04 with Cuda 9.0, Python 2.7 and pip already installed

First, install OpenCV-Python:

$ sudo apt-get install python-opencv

Then, clone and install COCOApi as described here.

Now, clone this repository, install the Python packages from requirements.txt if necessary and install the requirements of Caffe (PyCaffe) following the official instructions. Edit the Makefile.config according to your system settings.

$ git clone https://github.com/chwilms/AttentionMask
$ cd AttentionMask
$ pip install -r requirements.txt
$ cd caffe
$ make pycaffe -j6
$ cd ..

Create new subdirectories for weights params and results results:

$ mkdir params results

Usage

After sucessfull installation, AttentionMask can immediatly be used without any training. Just download the weights (and the COCO dataset). However, you can also train AttentionMask with your own data. Note however, that your own data should be in the COCO format.

Download dataset

Download the train2014 and val2014 splits from COCO dataset. The train2014 split is exclusively used for training, while the first 5000 image from the val2014 split are used for testing. After downloading, extract the data in the following structure:

AttentionMask
|
---- data
     |
     ---- coco
          |
          ---- annotations
          |    |
          |    ---- instances_train2014.json
          |    |
          |    ---- instances_val2014.json
          |
          ---- train2014
          |    |
          |    ---- COCO_train2014_000000000009.jpg
          |    |
          |    ---- ...
          |
          ---- val2014
               |
               ---- COCO_val2014_000000000042.jpg
               |
               ---- ...

Download weights

For inference, you have to download the model weights for one of the final AttentionMask models: AttentionMask-8-128, AttentionMask-8-192, AttentionMask-16-192

If you want to do training yourself, download the initial ImageNet weights for the ResNet. Weight files should be moved into the params subdirectory.

Inference

There are two options for inference. You can either generate proposals for the COCO dataset (or any other dataset following that format) or you can generate proposals for one image.

COCO dataset

For inference on the COCO dataset, use the testAttentionMask.py script with the gpu id, the model name, the weights and the dataset you want to test on (e.g., val2014):

$ python testAttentionMask.py 0 attentionMask-8-128 --init_weights attentionmask-8-128final.caffemodel --dataset val2014 --end 5000

By default, only the first 5000 images of a dataset are used.

Individual image

If you want to test AttentionMask on one of your images, call the demo.py script with the path to your image, the gpu id, the model name and the weights:

$ python demo.py 0 attentionMask-8-128 <your image path here> --init_weights=attentionmask-8-128final.caffemodel

As a result you get an image with the best 20 proposals as overlays. If you want to dive deeper into the set of proposals, you can store them all using the ret_masks variable in the script with the ret_scores variable for the objectness scores.

Evaluation

For evaluation on the COCO dataset, you can use the evalCOCO.py script with the model name and the dataset used. --useSegm is a flag for using segmentation masks instead of bounding boxes.

$ python evalCOCO.py attentionMask-8-128 --dataset val2014 --useSegm True --end 5000

By default, only the first 5000 images of a dataset are used.

Training

To train AttentionMask on a dataset, you can use the train.sh script. It iterates over several epochs and saves as well as evaluates the result of each epoch (outputs in trainEval.txt). For validation, currently the frist 5000 images of the training set are used, however, we encourage you to use a split of val2014 that is disjunct to the first 5000 images as your own validation set. Lowering the learning rate has to be done manually. We lowered the learning rate after 3 consecutive epochs of not improving results on our validation set. The environmental variable EPOCH determines the next epoch to be run and is automatically incremented.

$ export EPOCH=1
$ ./train.sh

Training on your own dataset

If you want to change the dataset form COCO to something else you have to follow the subsequent steps.

You have to provide the annotations in COCO-style. COCO-style means the annotation file has to be a json file similar to the COCO annotations. There are many tools on the web to change or create annotations accordingly.
Change the shuffledData.txt. The only purpose of this file is to keep the data preprocessing in the data layer and the box selection layer in sync (loading the identical image, determining if the image should be flipped or slightly zoomed). Therefore this file keeps a randomly shuffled list of all indices of the dataset. In case of COCO dataset it is a list of numbers from 0 to 82080 (82081 images in training set). Additionally for each number there is a random flag (0 or 1) for horizontally flipping the image for training as well as a number between 0 and 69 as a tiny zoom. Tiny zoom is a small number that is added on top of the max edge length to get some more variety in image sizes (check fetch() and fetch_image() in base_coco_ssm_spider.py or boxSelectionLayerMP.py for details). All three values are separated by a semicolon and each line has one entry.
In config.py adjust ANNOTATION_TYPE and IMAGE_SET according to your new dataset. Furthermore, you may have to adjust ANNOTATION_FILE_FORMAT for the path to the annotations or IMAGE_PATH_FORMAT for the path to the images. The image format strings are used in alchemy/data/coco.py for locating the images. Changes may have to be applied there as well, e.g., if the image file name does not start with the dataset name.
The solver (models/*.solver.prototxt) has to be adapted if the dataset is of different length than COCO. Change snapshot, display and average_loss according to the number of images in your dataset.
Change the value of the --step parameter when calling the training script to the number of images in your dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
alchemy		alchemy
caffe		caffe
configs		configs
models		models
python_layers/data		python_layers/data
spiders		spiders
LICENSE		LICENSE
README.md		README.md
config.py		config.py
demo.py		demo.py
evalCOCO.py		evalCOCO.py
example.png		example.png
infogainH_ones.binaryproto		infogainH_ones.binaryproto
requirements.txt		requirements.txt
shuffledData.txt		shuffledData.txt
testAttentionMask.py		testAttentionMask.py
train.sh		train.sh
trainAttentionMask.py		trainAttentionMask.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AttentionMask

Requirements

Installation

Usage

Download dataset

Download weights

Inference

COCO dataset

Individual image

Evaluation

Training

Training on your own dataset

About

Releases

Packages

Contributors 2

Languages

License

chwilms/AttentionMask

Folders and files

Latest commit

History

Repository files navigation

AttentionMask

Requirements

Installation

Usage

Download dataset

Download weights

Inference

COCO dataset

Individual image

Evaluation

Training

Training on your own dataset

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages