Datasets

Conceptual Captions link
Flickr 30k link
TIMIT Speech corpus link

I think that we should use the flickr dataset as the 30k images should really be enough in the limited time we have.

Once you have downloaded the flickr dataset extract it and run the resize script that's located in the flickr30k_images folder, from the folder in question

bash resize_images.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Datasets

Files

README.md

Latest commit

History

README.md

File metadata and controls

Datasets