English
Related papers

Related papers: VProChart: Answering Chart Question through Visual…

200 papers

Visually-situated languages such as charts and plots are omnipresent in real-world documents. These graphical depictions are human-readable and are often analyzed in visually-rich documents to address a variety of questions that necessitate…

Artificial Intelligence · Computer Science 2023-10-31 Anran Wu , Luwei Xiao , Xingjiao Wu , Shuwen Yang , Junjie Xu , Zisong Zhuang , Nian Xie , Cheng Jin , Liang He

Bar charts are an effective way to convey numeric information, but today's algorithms cannot parse them. Existing methods fail when faced with even minor variations in appearance. Here, we present DVQA, a dataset that tests many aspects of…

Computer Vision and Pattern Recognition · Computer Science 2018-03-30 Kushal Kafle , Brian Price , Scott Cohen , Christopher Kanan

Chart question answering (CQA) is a task used for assessing chart comprehension, which is fundamentally different from understanding natural images. CQA requires analyzing the relationships between the textual and the visual components of a…

Computer Vision and Pattern Recognition · Computer Science 2022-07-12 Matan Levy , Rami Ben-Ari , Dani Lischinski

Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e.g. bar charts, pie charts, and line graphs. CQA requires capabilities that…

Computer Vision and Pattern Recognition · Computer Science 2020-07-23 Kushal Kafle , Robik Shrestha , Brian Price , Scott Cohen , Christopher Kanan

Information visualizations such as bar charts and line charts are very common for analyzing data and discovering critical insights. Often people analyze charts to answer questions that they have in mind. Answering such questions can be…

Computation and Language · Computer Science 2022-05-24 Enamul Hoque , Parsa Kavehzadeh , Ahmed Masry

Charts are very popular for analyzing data. When exploring charts, people often ask a variety of complex reasoning questions that involve several logical and arithmetic operations. They also commonly refer to visual features of a chart in…

Computation and Language · Computer Science 2022-03-22 Ahmed Masry , Do Xuan Long , Jia Qing Tan , Shafiq Joty , Enamul Hoque

Recently, Vision Language Models (VLMs) have increasingly emphasized document visual grounding to achieve better human-computer interaction, accessibility, and detailed understanding. However, its application to visualizations such as…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Alexander Vogel , Omar Moured , Yufan Chen , Jiaming Zhang , Rainer Stiefelhagen

We propose a novel framework that leverages Visual Question Answering (VQA) models to automate the evaluation of LLM-generated data visualizations. Traditional evaluation methods often rely on human judgment, which is costly and unscalable,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 James Ford , Xingmeng Zhao , Dan Schumacher , Anthony Rios

In the fields of computer vision and natural language processing, multimodal chart question-answering, especially involving color, structure, and textless charts, poses significant challenges. Traditional methods, which typically involve…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Jingxuan Wei , Nan Xu , Guiyong Chang , Yin Luo , BiHui Yu , Ruifeng Guo

Recent multimodal LLMs have shown promise in chart-based visual question answering, but their performance declines sharply on unannotated charts-those requiring precise visual interpretation rather than relying on textual shortcuts. To…

Artificial Intelligence · Computer Science 2026-01-08 Rachneet Kaur , Nishan Srishankar , Zhen Zeng , Sumitra Ganesh , Manuela Veloso

Charts are ubiquitous, as people often use them to analyze data, answer questions, and discover critical insights. However, performing complex analytical tasks with charts requires significant perceptual and cognitive effort. Chart Question…

Charts are very popular for analyzing data, visualizing key insights and answering complex reasoning questions about data. To facilitate chart-based data analysis using natural language, several downstream tasks have been introduced…

Computation and Language · Computer Science 2023-10-12 Ahmed Masry , Parsa Kavehzadeh , Xuan Long Do , Enamul Hoque , Shafiq Joty

Multimodal vision-language models (VLMs) continue to achieve ever-improving scores on chart understanding benchmarks. Yet, we find that this progress does not fully capture the breadth of visual reasoning capabilities essential for…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Kushin Mukherjee , Donghao Ren , Dominik Moritz , Yannick Assogba

Chart Question Answering (CQA) aims at answering questions based on the visual chart content, which plays an important role in chart sumarization, business data analysis, and data report generation. CQA is a challenging multi-modal task…

Computer Vision and Pattern Recognition · Computer Science 2024-09-05 Lingling Zhang , Muye Huang , QianYing Wang , Yaxian Wang , Wenjun Wu , Jun Liu

We present a new dataset for chart question answering (CQA) constructed from visualization notebooks. The dataset features real-world, multi-view charts paired with natural language questions grounded in analytical narratives. Unlike prior…

Computation and Language · Computer Science 2025-07-03 Maeve Hutchinson , Radu Jianu , Aidan Slingsby , Jo Wood , Pranava Madhyastha

Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities. Given an image and a question in natural language, it requires…

Computer Vision and Pattern Recognition · Computer Science 2016-07-21 Qi Wu , Damien Teney , Peng Wang , Chunhua Shen , Anthony Dick , Anton van den Hengel

We introduce GQA, a new dataset for real-world visual reasoning and compositional question answering, seeking to address key shortcomings of previous VQA datasets. We have developed a strong and robust question engine that leverages scene…

Computation and Language · Computer Science 2019-07-12 Drew A. Hudson , Christopher D. Manning

We introduce InterChart, a diagnostic benchmark that evaluates how well vision-language models (VLMs) reason across multiple related charts, a task central to real-world applications such as scientific reporting, financial analysis, and…

Computation and Language · Computer Science 2026-05-04 Anirudh Iyengar Kaniyar Narayana Iyengar , Srija Mukhopadhyay , Adnan Qidwai , Shubhankar Singh , Dan Roth , Vivek Gupta

Chart question answering (CQA) is a crucial area of Visual Language Understanding. However, the robustness and consistency of current Visual Language Models (VLMs) in this field remain under-explored. This paper evaluates state-of-the-art…

Computation and Language · Computer Science 2024-10-07 Srija Mukhopadhyay , Adnan Qidwai , Aparna Garimella , Pritika Ramu , Vivek Gupta , Dan Roth

Chart understanding presents a critical test to the reasoning capabilities of Vision-Language Models (VLMs). Prior approaches face critical limitations: some rely on external tools, making them brittle and constrained by a predefined…

Computer Vision and Pattern Recognition · Computer Science 2025-09-12 Bohao Tang , Yan Ma , Fei Zhang , Jiadi Su , Ethan Chern , Zhulin Hu , Zhixin Wang , Pengfei Liu , Ya Zhang
‹ Prev 1 2 3 10 Next ›