Related papers: VProChart: Answering Chart Question through Visual…

DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding

Visually-situated languages such as charts and plots are omnipresent in real-world documents. These graphical depictions are human-readable and are often analyzed in visually-rich documents to address a variety of questions that necessitate…

Artificial Intelligence · Computer Science 2023-10-31 Anran Wu , Luwei Xiao , Xingjiao Wu , Shuwen Yang , Junjie Xu , Zisong Zhuang , Nian Xie , Cheng Jin , Liang He

DVQA: Understanding Data Visualizations via Question Answering

Bar charts are an effective way to convey numeric information, but today's algorithms cannot parse them. Existing methods fail when faced with even minor variations in appearance. Here, we present DVQA, a dataset that tests many aspects of…

Computer Vision and Pattern Recognition · Computer Science 2018-03-30 Kushal Kafle , Brian Price , Scott Cohen , Christopher Kanan

Classification-Regression for Chart Comprehension

Chart question answering (CQA) is a task used for assessing chart comprehension, which is fundamentally different from understanding natural images. CQA requires analyzing the relationships between the textual and the visual components of a…

Computer Vision and Pattern Recognition · Computer Science 2022-07-12 Matan Levy , Rami Ben-Ari , Dani Lischinski

Answering Questions about Data Visualizations using Efficient Bimodal Fusion

Chart question answering (CQA) is a newly proposed visual question answering (VQA) task where an algorithm must answer questions about data visualizations, e.g. bar charts, pie charts, and line graphs. CQA requires capabilities that…

Computer Vision and Pattern Recognition · Computer Science 2020-07-23 Kushal Kafle , Robik Shrestha , Brian Price , Scott Cohen , Christopher Kanan

Chart Question Answering: State of the Art and Future Directions

Information visualizations such as bar charts and line charts are very common for analyzing data and discovering critical insights. Often people analyze charts to answer questions that they have in mind. Answering such questions can be…

Computation and Language · Computer Science 2022-05-24 Enamul Hoque , Parsa Kavehzadeh , Ahmed Masry

ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning

Charts are very popular for analyzing data. When exploring charts, people often ask a variety of complex reasoning questions that involve several logical and arithmetic operations. They also commonly refer to visual features of a chart in…

Computation and Language · Computer Science 2022-03-22 Ahmed Masry , Do Xuan Long , Jia Qing Tan , Shafiq Joty , Enamul Hoque

RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning

Recently, Vision Language Models (VLMs) have increasingly emphasized document visual grounding to achieve better human-computer interaction, accessibility, and detailed understanding. However, its application to visualizations such as…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Alexander Vogel , Omar Moured , Yufan Chen , Jiaming Zhang , Rainer Stiefelhagen

Charting the Future: Using Chart Question-Answering for Scalable Evaluation of LLM-Driven Data Visualizations

We propose a novel framework that leverages Visual Question Answering (VQA) models to automate the evaluation of LLM-generated data visualizations. Traditional evaluation methods often rely on human judgment, which is costly and unscalable,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 James Ford , Xingmeng Zhao , Dan Schumacher , Anthony Rios

mChartQA: A universal benchmark for multimodal Chart Question Answer based on Vision-Language Alignment and Reasoning

In the fields of computer vision and natural language processing, multimodal chart question-answering, especially involving color, structure, and textless charts, poses significant challenges. Traditional methods, which typically involve…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Jingxuan Wei , Nan Xu , Guiyong Chang , Yin Luo , BiHui Yu , Ruifeng Guo

ChartAgent: A Multimodal Agent for Visually Grounded Reasoning in Complex Chart Question Answering

Recent multimodal LLMs have shown promise in chart-based visual question answering, but their performance declines sharply on unannotated charts-those requiring precise visual interpretation rather than relying on textual shortcuts. To…

Artificial Intelligence · Computer Science 2026-01-08 Rachneet Kaur , Nishan Srishankar , Zhen Zeng , Sumitra Ganesh , Manuela Veloso

ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering

Charts are ubiquitous, as people often use them to analyze data, answer questions, and discover critical insights. However, performing complex analytical tasks with charts requires significant perceptual and cognitive effort. Chart Question…

Computation and Language · Computer Science 2025-04-11 Ahmed Masry , Mohammed Saidul Islam , Mahir Ahmed , Aayush Bajaj , Firoz Kabir , Aaryaman Kartha , Md Tahmid Rahman Laskar , Mizanur Rahman , Shadikur Rahman , Mehrad Shahmohammadi , Megh Thakkar , Md Rizwan Parvez , Enamul Hoque , Shafiq Joty

UniChart: A Universal Vision-language Pretrained Model for Chart Comprehension and Reasoning

Charts are very popular for analyzing data, visualizing key insights and answering complex reasoning questions about data. To facilitate chart-based data analysis using natural language, several downstream tasks have been introduced…

Computation and Language · Computer Science 2023-10-12 Ahmed Masry , Parsa Kavehzadeh , Xuan Long Do , Enamul Hoque , Shafiq Joty

EncQA: Benchmarking Vision-Language Models on Visual Encodings for Charts

Multimodal vision-language models (VLMs) continue to achieve ever-improving scores on chart understanding benchmarks. Yet, we find that this progress does not fully capture the breadth of visual reasoning capabilities essential for…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Kushin Mukherjee , Donghao Ren , Dominik Moritz , Yannick Assogba

GoT-CQA: Graph-of-Thought Guided Compositional Reasoning for Chart Question Answering

Chart Question Answering (CQA) aims at answering questions based on the visual chart content, which plays an important role in chart sumarization, business data analysis, and data report generation. CQA is a challenging multi-modal task…

Computer Vision and Pattern Recognition · Computer Science 2024-09-05 Lingling Zhang , Muye Huang , QianYing Wang , Yaxian Wang , Wenjun Wu , Jun Liu

Chart Question Answering from Real-World Analytical Narratives

We present a new dataset for chart question answering (CQA) constructed from visualization notebooks. The dataset features real-world, multi-view charts paired with natural language questions grounded in analytical narratives. Unlike prior…

Computation and Language · Computer Science 2025-07-03 Maeve Hutchinson , Radu Jianu , Aidan Slingsby , Jo Wood , Pranava Madhyastha

Visual Question Answering: A Survey of Methods and Datasets

Visual Question Answering (VQA) is a challenging task that has received increasing attention from both the computer vision and the natural language processing communities. Given an image and a question in natural language, it requires…

Computer Vision and Pattern Recognition · Computer Science 2016-07-21 Qi Wu , Damien Teney , Peng Wang , Chunhua Shen , Anthony Dick , Anton van den Hengel

GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering

We introduce GQA, a new dataset for real-world visual reasoning and compositional question answering, seeking to address key shortcomings of previous VQA datasets. We have developed a strong and robust question engine that leverages scene…

Computation and Language · Computer Science 2019-07-12 Drew A. Hudson , Christopher D. Manning

InterChart: Benchmarking Visual Reasoning Across Decomposed and Distributed Chart Information

We introduce InterChart, a diagnostic benchmark that evaluates how well vision-language models (VLMs) reason across multiple related charts, a task central to real-world applications such as scientific reporting, financial analysis, and…

Computation and Language · Computer Science 2026-05-04 Anirudh Iyengar Kaniyar Narayana Iyengar , Srija Mukhopadhyay , Adnan Qidwai , Shubhankar Singh , Dan Roth , Vivek Gupta

Unraveling the Truth: Do VLMs really Understand Charts? A Deep Dive into Consistency and Robustness

Chart question answering (CQA) is a crucial area of Visual Language Understanding. However, the robustness and consistency of current Visual Language Models (VLMs) in this field remain under-explored. This paper evaluates state-of-the-art…

Computation and Language · Computer Science 2024-10-07 Srija Mukhopadhyay , Adnan Qidwai , Aparna Garimella , Pritika Ramu , Vivek Gupta , Dan Roth

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Chart understanding presents a critical test to the reasoning capabilities of Vision-Language Models (VLMs). Prior approaches face critical limitations: some rely on external tools, making them brittle and constrained by a predefined…

Computer Vision and Pattern Recognition · Computer Science 2025-09-12 Bohao Tang , Yan Ma , Fei Zhang , Jiadi Su , Ethan Chern , Zhulin Hu , Zhixin Wang , Pengfei Liu , Ya Zhang