BIP! Finder - OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge

2019 • OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge

Authors: Marino, Kenneth, Rastegari, Mohammad, Farhadi, Ali, Mottaghi, Roozbeh

Venue: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Type: Publication

Abstract: Visual Question Answering (VQA) in its ideal form lets us study reasoning in the joint space of vision and language and serves as a proxy for the AI task of scene understanding. However, most VQA benchmarks to date are focused on questions such as simple counting, visual attributes, and object detection that do not require reasoning or knowledge beyond what is in the image. In this paper, we address the task of knowledge-based visual question answering and provide a benchmark, called OK-VQA, where the image content is not sufficient to answer t... (read more)

Topics: Artificial intelligence Information retrieval

DOI: 10.1109/cvpr.2019.00331 (Found 2 versions)

BIP! social metrics: 0 1
External links: Crossref OpenAIRE

BibTex PDF

Topic-specific impact indicators

Popularity: This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
Influence: This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Citation Count: This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
Impulse: This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.