Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous Questions in VQA

2022-11-14 16:45:42

Elias Stengel-Eskin, Jimena Guallar-Blasco, Yi Zhou, Benjamin Van Durme

arXiv_CL

arXiv_CL VQA QA Ontology

Abstract
Abstract (translated)
URL
PDF

Abstract

Resolving ambiguities in questions is key to successfully answering them. Focusing on questions about images, we create a dataset of ambiguous examples; we annotate these examples, grouping the answers by the underlying question they address and rephrasing the question for each group to reduce ambiguity. An analysis of our data reveals a linguistically-aligned ontology of reasons for ambiguity in visual questions. We then develop an English question-generation model which we demonstrate via automatic and human evaluation produces less ambiguous questions. We further show that the question generation objective we use allows the model to integrate answer group information without any direct supervision.

Abstract (translated)

URL

https://arxiv.org/abs/2211.07516

PDF

https://arxiv.org/pdf/2211.07516.pdf