SelaVPR

This is the official repository for the ICLR 2024 paper "Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition".

Getting Started

This repo follows the Visual Geo-localization Benchmark. You can refer to it (VPR-datasets-downloader) to prepare datasets.

The dataset should be organized in a directory tree as such:

├── datasets_vg
    └── datasets
        └── pitts30k
            └── images
                ├── train
                │   ├── database
                │   └── queries
                ├── val
                │   ├── database
                │   └── queries
                └── test
                    ├── database
                    └── queries

Before training, you should download the pre-trained foundation model DINOv2(ViT-L/14) here.

Train

Finetuning on MSLS

python3 train.py --datasets_folder=/path/to/your/datasets_vg/datasets --dataset_name=msls --queries_per_epoch=30000 --foundation_model_path /path/to/pre-trained/dinov2_vitl14_pretrain.pth

Further finetuning on Pitts30k

python3 train.py --datasets_folder=/path/to/your/datasets_vg/datasets --dataset_name=pitts30k --queries_per_epoch=5000 --resume /path/to/finetuned/msls/model/SelaVPR_msls.pth

Test

python3 eval.py --datasets_folder=/path/to/your/datasets_vg/datasets --dataset_name=pitts30k --resume /path/to/finetuned/pitts30k/model/SelaVPR_pitts30k.pth

Trained Models

The model finetuned on MSLS (for diverse scenes).

DOWNLOAD	MSLS-val			Nordland-test			St. Lucia
DOWNLOAD	R@1	R@5	R@10	R@1	R@5	R@10	R@1	R@5	R@10
LINK	90.8	96.4	97.2	85.2	95.5	98.5	99.8	100.0	100.0

The model further finetuned on Pitts30k (only for urban scenes).

DOWNLOAD	Tokyo24/7			Pitts30k			Pitts250k
DOWNLOAD	R@1	R@5	R@10	R@1	R@5	R@10	R@1	R@5	R@10
LINK	94.0	96.8	97.5	92.8	96.8	97.7	95.7	98.8	99.2

Acknowledgements

Parts of this repo are inspired by the following repositories:

Visual Geo-localization Benchmark

DINOv2

Citation

If you find this repo useful for your research, please consider citing the paper

@inproceedings{selavpr,
  title={Towards Seamless Adaptation of Pre-trained Models for Visual Place Recognition},
  author={Lu, Feng and Zhang, Lijun and Lan, Xiangyuan and Dong, Shuting and Wang, Yaowei and Yuan, Chun},
  booktitle={The Twelfth International Conference on Learning Representations},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
backbone		backbone
sync_batchnorm		sync_batchnorm
README.md		README.md
commons.py		commons.py
datasets_ws.py		datasets_ws.py
eval.py		eval.py
local_matching.py		local_matching.py
loss.py		loss.py
network.py		network.py
parser.py		parser.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SelaVPR

Getting Started

Train

Test

Trained Models

Acknowledgements

Citation

About

Releases

Packages

Languages

License

Lu-Feng/SelaVPR

Folders and files

Latest commit

History

Repository files navigation

SelaVPR

Getting Started

Train

Test

Trained Models

Acknowledgements

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages