VQA-LOL
Visual Question Answering under the Lens of Logic


Tejas Gokhale*


Pratyay Banerjee*


Chitta Baral


Yezhou Yang




Paper

Dataset

Source Code

Cite This Work




VQA models struggle at negation, antonyms, conjunction, disjunction!

We provide dataset, a new model, and detailed analysis!


Our dataset contains questions composed with negation, antonyms, conjunctions, and disjunctions.
Our model learns to identify the type of question and they type of logical connective in the question to aid question-answering.

Reference

Gokhale, T., Banerjee, P., Baral, C., & Yang, Y. (2020). VQA-LOL: Visual Question Answering under the Lens of Logic.
Bibtex