See the main README.md
.
Images and annotations would not be warped but instead re-scaled so that the long sides are 513.
If you want to try the official voc12 caffemodel, please convert with the convert.py --dataset voc12
and use this configuration file.