Simple Baseline for Visual Question Answering

We descrive a very simple bag-of-words baseline for visual question answering. The description of the baseline is in the arXiv paper http://arxiv.org/pdf/1512.02167.pdf. The code is developed by Bolei Zhou and Yuandong Tian.

To train the model using the code, the following data of the VQA dataset are needed:

Contact Bolei Zhou ([email protected]) if you have any questions.

Please cite our arXiv note if you use our code:

B. Zhou, Y. Tian, S. Suhkbaatar, A. Szlam, R. Fergus. Simple Baseline for Visual Question Answering. arXiv:1512.02167

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
result		result
.gitignore		.gitignore
LinearNB.lua		LinearNB.lua
README.md		README.md
opensource_base.lua		opensource_base.lua
opensource_baseline.lua		opensource_baseline.lua
resize_images.lua		resize_images.lua
resize_images_referit.lua		resize_images_referit.lua
teststuff.lua		teststuff.lua

Provide feedback