Dual-Path Convolutional Image-Text Embedding

This repository contains the code for our paper Dual-Path Convolutional Image-Text Embedding. Thank you for your kindly attention.

The compelete code will be uploaded in two weeks. I am adding illustrations and comments to the code for using. You can check my progress as follows.

CheckList

Prepare Data

Extract wrod2vec weights. Follow the instruction in ./word2vector_matlab;
Prepare the dataset. Follow the instruction in ./dataset. You can choose one dataset to run. Three datasets need different prepocessing. I write the instruction for Flickr30k, MSCOCO and CUHK-PEDES.
Download the model pre-trained on ImageNet. And put the model into './data'.

(bash) wget http://www.vlfeat.org/matconvnet/models/imagenet-resnet-50-dag.mat

Alternatively, you may try VGG16 or VGG19.

Train

For Flickr30k, run train_flickr_word2_1_pool.m for Stage I training.

Run train_flickr_word_Rankloss_shift_hard for Stage II training.

For MSCOCO, run train_coco_word2_1_pool.m for Stage I training.

Run train_coco_Rankloss_shift_hard.m for Stage II training.

For CUHK-PEDES, run train_cuhk_word2_1_pool.m for Stage I training.

Run train_cuhk_word_Rankloss_shift for Stage II training.

Test

Select one model and have fun!

For Flickr30k, run test/extract_pic_feature_word2_plus_52.m and to extract the feature from image and text. Note that you need to change the model path in the code.
For MSCOCO, run test_coco/extract_pic_feature_word2_plus.m and to extract the feature from image and text. Note that you need to change the model path in the code.
For CUHK-PEDES, run test_cuhk/extract_pic_feature_word2_plus_52.m and to extract the feature from image and text. Note that you need to change the model path in the code.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
data		data
dataset		dataset
examples		examples
matlab		matlab
test		test
test_coco		test_coco
test_cuhk		test_cuhk
word2vector_matlab		word2vector_matlab
LICENSE		LICENSE
README.md		README.md
coco_word2_Rankloss.m		coco_word2_Rankloss.m
coco_word2_Rankloss_vgg19.m		coco_word2_Rankloss_vgg19.m
coco_word2_pool.m		coco_word2_pool.m
coco_word2_pool_vgg19.m		coco_word2_pool_vgg19.m
concat_2net.m		concat_2net.m
cuhk_word2_Rankloss.m		cuhk_word2_Rankloss.m
cuhk_word2_Rankloss_vgg16.m		cuhk_word2_Rankloss_vgg16.m
cuhk_word2_pool.m		cuhk_word2_pool.m
cuhk_word2_pool_vgg16.m		cuhk_word2_pool_vgg16.m
gpu_compile.m		gpu_compile.m
rand_diff_class.m		rand_diff_class.m
rand_diff_class2.m		rand_diff_class2.m
rand_diff_class3.m		rand_diff_class3.m
rand_same_class.m		rand_same_class.m
rand_same_class_coco.m		rand_same_class_coco.m
resnet52_new_hope_word2_pool.m		resnet52_new_hope_word2_pool.m
resnet52_new_hope_word2_pool_vgg19.m		resnet52_new_hope_word2_pool_vgg19.m
resnet52_new_hope_word_Rankloss.m		resnet52_new_hope_word_Rankloss.m
resnet52_new_hope_word_Rankloss_vgg19.m		resnet52_new_hope_word_Rankloss_vgg19.m
train_coco_Rankloss_shift_hard_vgg19.m		train_coco_Rankloss_shift_hard_vgg19.m
train_coco_word2_1_pool.m		train_coco_word2_1_pool.m
train_coco_word2_1_pool_vgg19.m		train_coco_word2_1_pool_vgg19.m
train_cuhk_Rankloss_shift.m		train_cuhk_Rankloss_shift.m
train_cuhk_Rankloss_shift_vgg16.m		train_cuhk_Rankloss_shift_vgg16.m
train_cuhk_word2_1_pool.m		train_cuhk_word2_1_pool.m
train_cuhk_word2_1_pool_vgg16.m		train_cuhk_word2_1_pool_vgg16.m
train_flickr_word2_1_pool.m		train_flickr_word2_1_pool.m
train_flickr_word2_1_pool_vgg19.m		train_flickr_word2_1_pool_vgg19.m
train_flickr_word_Rankloss_shift_hard.m		train_flickr_word_Rankloss_shift_hard.m
train_flickr_word_Rankloss_shift_hard_vgg19.m		train_flickr_word_Rankloss_shift_hard_vgg19.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dual-Path Convolutional Image-Text Embedding

CheckList

Prepare Data

Train

Test

About

Releases

Sponsor this project

Packages

Languages

License

layumi/Image-Text-Embedding

Folders and files

Latest commit

History

Repository files navigation

Dual-Path Convolutional Image-Text Embedding

CheckList

Prepare Data

Train

Test

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Sponsor this project

Packages 0

Languages

Packages