Related papers: Generating Statistical Charts with Validation-Driv…

ChartCards: A Chart-Metadata Generation Framework for Multi-Task Chart Understanding

The emergence of Multi-modal Large Language Models (MLLMs) presents new opportunities for chart understanding. However, due to the fine-grained nature of these tasks, applying MLLMs typically requires large, high-quality datasets for…

Computation and Language · Computer Science 2025-10-08 Yifan Wu , Lutao Yan , Leixian Shen , Yinan Mei , Jiannan Wang , Yuyu Luo

ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Recently, many versatile Multi-modal Large Language Models (MLLMs) have emerged continuously. However, their capacity to query information depicted in visual charts and engage in reasoning based on the queried contents remains…

Computer Vision and Pattern Recognition · Computer Science 2025-04-29 Renqiu Xia , Bo Zhang , Hancheng Ye , Xiangchao Yan , Qi Liu , Hongbin Zhou , Zijun Chen , Peng Ye , Min Dou , Botian Shi , Junchi Yan , Yu Qiao

Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering

Misleading visualizations, which manipulate chart representations to support specific claims, can distort perception and lead to incorrect conclusions. Despite decades of research, they remain a widespread issue, posing risks to public…

Computation and Language · Computer Science 2025-09-23 Zixin Chen , Sicheng Song , Kashun Shum , Yanna Lin , Rui Sheng , Weiqi Wang , Huamin Qu

ChartBench: A Benchmark for Complex Visual Reasoning in Charts

Multimodal Large Language Models (MLLMs) have shown impressive capabilities in image understanding and generation. However, current benchmarks fail to accurately evaluate the chart comprehension of MLLMs due to limited chart types and…

Computer Vision and Pattern Recognition · Computer Science 2024-06-21 Zhengzhuo Xu , Sinan Du , Yiyan Qi , Chengjin Xu , Chun Yuan , Jian Guo

ChartLlama: A Multimodal LLM for Chart Understanding and Generation

Multi-modal large language models have demonstrated impressive performances on most vision-language tasks. However, the model generally lacks the understanding capabilities for specific domain data, particularly when it comes to…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Yucheng Han , Chi Zhang , Xin Chen , Xu Yang , Zhibin Wang , Gang Yu , Bin Fu , Hanwang Zhang

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation

Multimodal Large Language Models (MLLMs) have demonstrated remarkable capabilities in chart understanding tasks. However, interpreting charts with textual descriptions often leads to information loss, as it fails to fully capture the dense…

Artificial Intelligence · Computer Science 2025-07-03 Xuanle Zhao , Xianzhen Luo , Qi Shi , Chi Chen , Shuo Wang , Zhiyuan Liu , Maosong Sun

How Good (Or Bad) Are LLMs at Detecting Misleading Visualizations?

In this study, we address the growing issue of misleading charts, a prevalent problem that undermines the integrity of information dissemination. Misleading charts can distort the viewer's perception of data, leading to misinterpretations…

Human-Computer Interaction · Computer Science 2024-07-25 Leo Yu-Ho Lo , Huamin Qu

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

We introduce a new benchmark, ChartMimic, aimed at assessing the visually-grounded code generation capabilities of large multimodal models (LMMs). ChartMimic utilizes information-intensive visual charts and textual instructions as inputs,…

Software Engineering · Computer Science 2025-03-03 Cheng Yang , Chufan Shi , Yaxin Liu , Bo Shui , Junjie Wang , Mohan Jing , Linran Xu , Xinyu Zhu , Siheng Li , Yuxiang Zhang , Gongye Liu , Xiaomei Nie , Deng Cai , Yujiu Yang

Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts

The field of Multimodal Large Language Models (MLLMs) has made remarkable progress in visual understanding tasks, presenting a vast opportunity to predict the perceptual and emotional impact of charts. However, it also raises concerns, as…

Human-Computer Interaction · Computer Science 2025-05-26 Seon Gyeom Kim , Jae Young Choi , Ryan Rossi , Eunyee Koh , Tak Yeon Lee

Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning

Emerging multimodal large language models (MLLMs) exhibit great potential for chart question answering (CQA). Recent efforts primarily focus on scaling up training datasets (i.e., charts, data tables, and question-answer (QA) pairs) through…

Computer Vision and Pattern Recognition · Computer Science 2024-08-13 Xingchen Zeng , Haichuan Lin , Yilin Ye , Wei Zeng

Are LLMs ready to help non-expert users to make charts of official statistics data?

In this time when biased information, deep fakes, and propaganda proliferate, the accessibility of reliable data sources is more important than ever. National statistical institutes provide curated data that contain quantitative information…

Information Retrieval · Computer Science 2025-10-03 Gadir Suleymanli , Alexander Rogiers , Lucas Lageweg , Jefrey Lijffijt

MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems

Multimodal Large Language Models (MLLMs) have demonstrated impressive abilities across various tasks, including visual question answering and chart comprehension, yet existing benchmarks for chart-related tasks fall short in capturing the…

Computation and Language · Computer Science 2025-02-11 Zifeng Zhu , Mengzhao Jia , Zhihan Zhang , Lang Li , Meng Jiang

RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation

Vision-Language Models (VLMs) have demonstrated impressive capabilities in code generation across various domains. However, their ability to replicate complex, multi-panel visualizations from real-world data remains largely unassessed. To…

Computation and Language · Computer Science 2026-03-30 Jiajun Zhang , Yuying Li , Zhixun Li , Xingyu Guo , Jingzhuo Wu , Leqi Zheng , Yiran Yang , Jianke Zhang , Qingbin Li , Shannan Yan , Zhetong Li , Changguo Jia , Junfei Wu , Zilei Wang , Qiang Liu , Liang Wang

ChartGPT: Leveraging LLMs to Generate Charts from Abstract Natural Language

The use of natural language interfaces (NLIs) to create charts is becoming increasingly popular due to the intuitiveness of natural language interactions. One key challenge in this approach is to accurately capture user intents and…

Human-Computer Interaction · Computer Science 2025-01-22 Yuan Tian , Weiwei Cui , Dazhen Deng , Xinjing Yi , Yurun Yang , Haidong Zhang , Yingcai Wu

Multimodal Information Fusion for Chart Understanding: A Survey of MLLMs -- Evolution, Limitations, and Cognitive Enhancement

Chart understanding is a quintessential information fusion task, requiring the seamless integration of graphical and textual data to extract meaning. The advent of Multimodal Large Language Models (MLLMs) has revolutionized this domain, yet…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Zhihang Yi , Jian Zhao , Jiancheng Lv , Tao Wang

ChartComplete: A Taxonomy-based Inclusive Chart Dataset

With advancements in deep learning (DL) and computer vision techniques, the field of chart understanding is evolving rapidly. In particular, multimodal large language models (MLLMs) are proving to be efficient and accurate in understanding…

Artificial Intelligence · Computer Science 2026-01-21 Ahmad Mustapha , Charbel Toumieh , Mariette Awad

ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning

Charts provide visual representations of data and are widely used for analyzing information, addressing queries, and conveying insights to others. Various chart-related downstream tasks have emerged recently, such as question-answering and…

Computation and Language · Computer Science 2024-03-15 Ahmed Masry , Mehrad Shahmohammadi , Md Rizwan Parvez , Enamul Hoque , Shafiq Joty

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning

With the rapid development of large language models (LLMs) and their integration into large multimodal models (LMMs), there has been impressive progress in zero-shot completion of user-oriented vision-language tasks. However, a gap remains…

Computation and Language · Computer Science 2024-04-16 Fuxiao Liu , Xiaoyang Wang , Wenlin Yao , Jianshu Chen , Kaiqiang Song , Sangwoo Cho , Yaser Yacoob , Dong Yu

Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback

Large language models (LLMs) have demonstrated strong capabilities across various language tasks, notably through instruction-tuning methods. However, LLMs face challenges in visualizing complex, real-world data through charts and plots.…

Machine Learning · Computer Science 2025-02-18 Fatemeh Pesaran Zadeh , Juyeon Kim , Jin-Hwa Kim , Gunhee Kim

Synthesis by Design: Controlled Data Generation via Structural Guidance

Mathematical reasoning remains challenging for LLMs due to complex logic and the need for precise computation. Existing methods enhance LLM reasoning by synthesizing datasets through problem rephrasing, but face issues with generation…

Computation and Language · Computer Science 2025-06-12 Lei Xu , Sirui Chen , Yuxuan Huang , Chaochao Lu