[datasets] Extend the range of public datasets supported in docTR #587
Closed
Description
Currently, we support FUNSD
, CORD
and SROIE
but we should look at extending the range of supported datasets. Among others, we could include handwritten, and in-the-wild situations.
Here is a list of datasets you can usually find in OCR-related benchmarks:
- IIIT-5k (https://cvit.iiit.ac.in/research/projects/cvit-projects/the-iiit-5k-word-dataset) IIIT-5K dataset integration #589
- SVT (http://vision.ucsd.edu/~kai/svt/) SVT dataset integration #597 reopening #597 SVT dataset integration #620
- IC03 (http://www.iapr-tc11.org/mediawiki/index.php?title=ICDAR_2003_Robust_Reading_Competitions) ICDAR2003 dataset integration #653
- IC13 (http://dagdata.cvc.uab.es/icdar2013competition/?ch=2&com=downloads) ICDAR2013 dataset integration #662
- SVHN (http://ufldl.stanford.edu/housenumbers/) SVHN dataset integration #634
- SynthText (https://github.com/ankush-me/SynthText) SynthText dataset integration #624
- IMGUR5K (https://github.com/facebookresearch/IMGUR5K-Handwriting-Dataset) Imgur5k dataset integration #785
Of course, the list goes on