Please visit http://shannon.cs.illinois.edu/DenotationGraph/ and download Flickr30k.
(matlab) split_Flickr30k
(matlab) resize_image
(matlab) prepare_imdb %you need to change the full path in this script
(matlab) train_txt
(matlab) make_dictionary
In this step, we also get rid of rare words, which are not included in GoogleNews word2vector.
(matlab) clear_txt
(matlab) prepare_wordcnn_feature2