Related papers: MatPlotAgent: Method and Evaluation for LLM-Based …

PlotGen: Multi-Agent LLM-based Scientific Data Visualization via Multimodal Feedback

Scientific data visualization is pivotal for transforming raw data into comprehensible visual representations, enabling pattern recognition, forecasting, and the presentation of data-driven insights. However, novice users often face…

Computation and Language · Computer Science 2025-02-04 Kanika Goswami , Puneet Mathur , Ryan Rossi , Franck Dernoncourt

SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents

Recent advances in large language models (LLMs) have enabled agentic systems that translate natural language intent into executable scientific visualization (SciVis) tasks. Despite rapid progress, the community lacks a principled and…

Artificial Intelligence · Computer Science 2026-04-01 Kuangshi Ai , Haichao Miao , Kaiyuan Tang , Nathaniel Gorski , Jianxin Sun , Guoxi Liu , Helgi I. Ingolfsson , David Lenz , Hanqi Guo , Hongfeng Yu , Teja Leburu , Michael Molash , Bei Wang , Tom Peterka , Chaoli Wang , Shusen Liu

Enhancing Agentic Autonomous Scientific Discovery with Vision-Language Model Capabilities

We show that multi-agent systems guided by vision-language models (VLMs) improve end-to-end autonomous scientific discovery. By treating plots as verifiable checkpoints, a VLM-as-a-judge evaluates figures against dynamically generated…

Computation and Language · Computer Science 2025-11-19 Kahaan Gandhi , Boris Bolliet , Inigo Zubeldia

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

The advancements of large language models (LLMs) have piqued growing interest in developing LLM-based language agents to automate scientific discovery end-to-end, which has sparked both excitement and skepticism about their true…

Computation and Language · Computer Science 2025-04-01 Ziru Chen , Shijie Chen , Yuting Ning , Qianheng Zhang , Boshi Wang , Botao Yu , Yifei Li , Zeyi Liao , Chen Wei , Zitong Lu , Vishal Dey , Mingyi Xue , Frazier N. Baker , Benjamin Burns , Daniel Adu-Ampratwum , Xuhui Huang , Xia Ning , Song Gao , Yu Su , Huan Sun

Data Interpreter: An LLM Agent For Data Science

Large Language Model (LLM)-based agents have shown effectiveness across many applications. However, their use in data science scenarios requiring solving long-term interconnected tasks, dynamic data adjustments and domain expertise remains…

Artificial Intelligence · Computer Science 2024-10-16 Sirui Hong , Yizhang Lin , Bang Liu , Bangbang Liu , Binhao Wu , Ceyao Zhang , Chenxing Wei , Danyang Li , Jiaqi Chen , Jiayi Zhang , Jinlin Wang , Li Zhang , Lingyao Zhang , Min Yang , Mingchen Zhuge , Taicheng Guo , Tuo Zhou , Wei Tao , Xiangru Tang , Xiangtao Lu , Xiawu Zheng , Xinbing Liang , Yaying Fei , Yuheng Cheng , Zhibin Gou , Zongze Xu , Chenglin Wu

WaitGPT: Monitoring and Steering Conversational LLM Agent in Data Analysis with On-the-Fly Code Visualization

Large language models (LLMs) support data analysis through conversational user interfaces, as exemplified in OpenAI's ChatGPT (formally known as Advanced Data Analysis or Code Interpreter). Essentially, LLMs produce code for accomplishing…

Human-Computer Interaction · Computer Science 2024-08-06 Liwenhan Xie , Chengbo Zheng , Haijun Xia , Huamin Qu , Chen Zhu-Tian

ChartAgent: A Multimodal Agent for Visually Grounded Reasoning in Complex Chart Question Answering

Recent multimodal LLMs have shown promise in chart-based visual question answering, but their performance declines sharply on unannotated charts-those requiring precise visual interpretation rather than relying on textual shortcuts. To…

Artificial Intelligence · Computer Science 2026-01-08 Rachneet Kaur , Nishan Srishankar , Zhen Zeng , Sumitra Ganesh , Manuela Veloso

PUB: Plot Understanding Benchmark and Dataset for Evaluating Large Language Models on Synthetic Visual Data Interpretation

The ability of large language models (LLMs) to interpret visual representations of data is crucial for advancing their application in data analysis and decision-making processes. This paper presents a novel synthetic dataset designed to…

Computation and Language · Computer Science 2024-09-05 Aneta Pawelec , Victoria Sara Wesołowska , Zuzanna Bączek , Piotr Sankowski

An Evaluation-Centric Paradigm for Scientific Visualization Agents

Recent advances in multi-modal large language models (MLLMs) have enabled increasingly sophisticated autonomous visualization agents capable of translating user intentions into data visualizations. However, measuring progress and comparing…

Human-Computer Interaction · Computer Science 2025-09-19 Kuangshi Ai , Haichao Miao , Zhimin Li , Chaoli Wang , Shusen Liu

LLM/Agent-as-Data-Analyst: A Survey

Large language models (LLMs) and agent techniques have brought a fundamental shift in the functionality and development paradigm of data analysis tasks (a.k.a LLM/Agent-as-Data-Analyst), demonstrating substantial impact across both academia…

Artificial Intelligence · Computer Science 2025-10-28 Zirui Tang , Weizheng Wang , Zihang Zhou , Yang Jiao , Bangrui Xu , Boyu Niu , Dayou Zhou , Xuanhe Zhou , Guoliang Li , Yeye He , Wei Zhou , Yitong Song , Cheng Tan , Xue Yang , Chunwei Liu , Bin Wang , Conghui He , Xiaoyang Wang , Fan Wu

Can Large Language Models Serve as Data Analysts? A Multi-Agent Assisted Approach for Qualitative Data Analysis

Context: Manual qualitative data analysis is time-intensive and can compromise validity and replicability, affecting analysis design, implementation, and reporting. Large Language Models (LLMs) enable human-bot collaboration in Software…

Software Engineering · Computer Science 2025-10-14 Zeeshan Rasheed , Muhammad Waseem , Aakash Ahmad , Kai-Kristian Kemell , Wang Xiaofeng , Anh Nguyen Duc , Pekka Abrahamsson

A Survey on Large Language Model-based Agents for Statistics and Data Science

In recent years, data science agents powered by Large Language Models (LLMs), known as "data agents," have shown significant potential to transform the traditional data analysis paradigm. This survey provides an overview of the evolution,…

Artificial Intelligence · Computer Science 2025-12-01 Maojun Sun , Ruijian Han , Binyan Jiang , Houduo Qi , Defeng Sun , Yancheng Yuan , Jian Huang

Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents

Data science aims to extract insights from data to support decision-making processes. Recently, Large Language Models (LLMs) have been increasingly used as assistants for data science, by suggesting ideas, techniques and small code…

Artificial Intelligence · Computer Science 2025-10-23 Irene Testini , José Hernández-Orallo , Lorenzo Pacchiardi

ChatVis: Large Language Model Agent for Generating Scientific Visualizations

Large language models (LLMs) are rapidly increasing in capability, but they still struggle with highly specialized programming tasks such as scientific visualization. We present an LLM assistant, ChatVis, that aids the LLM to generate…

Human-Computer Interaction · Computer Science 2025-08-01 Tom Peterka , Tanwi Mallick , Orcun Yildiz , David Lenz , Cory Quammen , Berk Geveci

ChatGPT as your Personal Data Scientist

The rise of big data has amplified the need for efficient, user-friendly automated machine learning (AutoML) tools. However, the intricacy of understanding domain-specific data and defining prediction tasks necessitates human intervention…

Computation and Language · Computer Science 2023-05-24 Md Mahadi Hassan , Alex Knipper , Shubhra Kanti Karmaker Santu

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Large Language Model (LLM) agents have shown great potential in addressing real-world data science problems. LLM-driven data science agents promise to automate the entire machine learning pipeline, yet their real-world effectiveness remains…

Computation and Language · Computer Science 2025-10-09 Yixin Ou , Yujie Luo , Jingsheng Zheng , Lanning Wei , Zhuoyun Yu , Shuofei Qiao , Jintian Zhang , Da Zheng , Yuren Mao , Yunjun Gao , Huajun Chen , Ningyu Zhang

Explainable Iterative Data Visualisation Refinement via an LLM Agent

Exploratory analysis of high-dimensional data relies on embedding the data into a low-dimensional space (typically 2D or 3D), based on which visualization plot is produced to uncover meaningful structures and to communicate geometric and…

Human-Computer Interaction · Computer Science 2026-04-23 Burak Susam , Tingting Mu

Automating Data-Driven Modeling and Analysis for Engineering Applications using Large Language Model Agents

Modern engineering increasingly relies on vast datasets generated by experiments and simulations, driving a growing demand for efficient, reliable, and broadly applicable modeling strategies. There is also heightened interest in developing…

Artificial Intelligence · Computer Science 2025-10-03 Yang Liu , Zaid Abulawi , Abhiram Garimidi , Doyeong Lim

Automated Visualization Makeovers with LLMs

Making a good graphic that accurately and efficiently conveys the desired message to the audience is both an art and a science, typically not taught in the data science curriculum. Visualisation makeovers are exercises where the community…

Human-Computer Interaction · Computer Science 2025-08-11 Siddharth Gangwar , David A. Selby , Sebastian J. Vollmer

SASAV: Self-Directed Agent for Scientific Analysis and Visualization

With recent advances in frontier multimodal large language models (MLLMs) for data understanding and visual reasoning, the role of LLMs has evolved from passive LLM-as-an-interface to proactive LLM-as-a-judge, enabling deeper integration…

Graphics · Computer Science 2026-04-07 Jianxin Sun , David Lenz , Tom Peterka , Hongfeng Yu