Related papers: InterChat: Enhancing Generative Visual Analytics u…

InsightLens: Augmenting LLM-Powered Data Analysis with Interactive Insight Management and Navigation

The proliferation of large language models (LLMs) has revolutionized the capabilities of natural language interfaces (NLIs) for data analysis. LLMs can perform multi-step and complex reasoning to generate data insights based on users'…

Human-Computer Interaction · Computer Science 2024-12-24 Luoxuan Weng , Xingbo Wang , Junyu Lu , Yingchaojie Feng , Yihan Liu , Haozhe Feng , Danqing Huang , Wei Chen

Designing a Lightweight GenAI Interface for Visual Data Analysis

Recent advances in Generative AI have transformed how users interact with data analysis through natural language interfaces. However, many systems rely too heavily on LLMs, creating risks of hallucination, opaque reasoning, and reduced user…

Human-Computer Interaction · Computer Science 2025-09-04 Ratanond Koonchanok , Alex Kale , Khairi Reda

Integrating Large Language Models into Text Animation: An Intelligent Editing System with Inline and Chat Interaction

Text animation, a foundational element in video creation, enables efficient and cost-effective communication, thriving in advertisements, journalism, and social media. However, traditional animation workflows present significant usability…

Human-Computer Interaction · Computer Science 2025-06-13 Bao Zhang , Zihan Li , Zhenglei Liu , Huanchen Wang , Yuxin Ma

Intelligent Co-Design: An Interactive LLM Framework for Interior Spatial Design via Multi-Modal Agents

In architectural interior design, miscommunication frequently arises as clients lack design knowledge, while designers struggle to explain complex spatial relationships, leading to delayed timelines and financial losses. Recent advancements…

Artificial Intelligence · Computer Science 2026-03-17 Ren Jian Lim , Rushi Dai

Generative Interfaces for Language Models

Large language models (LLMs) are increasingly seen as assistants, copilots, and consultants, capable of supporting a wide range of tasks through natural conversation. However, most systems remain constrained by a linear request-response…

Computation and Language · Computer Science 2026-05-05 Jiaqi Chen , Yanzhe Zhang , Yutong Zhang , Yijia Shao , Diyi Yang

Comparative Analysis of Large Language Models for the Machine-Assisted Resolution of User Intentions

Large Language Models (LLMs) have emerged as transformative tools for natural language understanding and user intent resolution, enabling tasks such as translation, summarization, and, increasingly, the orchestration of complex workflows.…

Software Engineering · Computer Science 2025-11-12 Justus Flerlage , Alexander Acker , Odej Kao

Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models

Recent advancements in dialogue systems have highlighted the significance of integrating multimodal responses, which enable conveying ideas through diverse modalities rather than solely relying on text-based interactions. This enrichment…

Computation and Language · Computer Science 2024-07-08 Chang-Sheng Kao , Yun-Nung Chen

DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention

Most of the existing multi-modal models, hindered by their incapacity to adeptly manage interleaved image-and-text inputs in multi-image, multi-round dialogues, face substantial constraints in resource allocation for training and data…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Zhewei Yao , Xiaoxia Wu , Conglong Li , Minjia Zhang , Heyang Qin , Olatunji Ruwase , Ammar Ahmad Awan , Samyam Rajbhandari , Yuxiong He

ChartGPT: Leveraging LLMs to Generate Charts from Abstract Natural Language

The use of natural language interfaces (NLIs) to create charts is becoming increasingly popular due to the intuitiveness of natural language interactions. One key challenge in this approach is to accurately capture user intents and…

Human-Computer Interaction · Computer Science 2025-01-22 Yuan Tian , Weiwei Cui , Dazhen Deng , Xinjing Yi , Yurun Yang , Haidong Zhang , Yingcai Wu

Multimodal Fusion with LLMs for Engagement Prediction in Natural Conversation

Over the past decade, wearable computing devices (``smart glasses'') have undergone remarkable advancements in sensor technology, design, and processing power, ushering in a new era of opportunity for high-density human behavior data.…

Artificial Intelligence · Computer Science 2025-10-30 Cheng Charles Ma , Kevin Hyekang Joo , Alexandria K. Vail , Sunreeta Bhattacharya , Álvaro Fernández García , Kailana Baker-Matsuoka , Sheryl Mathew , Lori L. Holt , Fernando De la Torre

CloChat: Understanding How People Customize, Interact, and Experience Personas in Large Language Models

Large language models (LLMs) have facilitated significant strides in generating conversational agents, enabling seamless, contextually relevant dialogues across diverse topics. However, the existing LLM-driven conversational agents have…

Human-Computer Interaction · Computer Science 2024-02-26 Juhye Ha , Hyeon Jeon , DaEun Han , Jinwook Seo , Changhoon Oh

Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations

Large language models (LLMs) have emerged as powerful and general solutions to many natural language tasks. However, many of the most important applications of language generation are interactive, where an agent has to talk to a person to…

Machine Learning · Computer Science 2023-11-10 Joey Hong , Sergey Levine , Anca Dragan

IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

Large Language Models (LLMs) are transforming artificial intelligence, evolving into task-oriented systems capable of autonomous planning and execution. One of the primary applications of LLMs is conversational AI systems, which must…

Computation and Language · Computer Science 2025-01-22 Elad Levi , Ilan Kadar

Using Large Language Models to Generate, Validate, and Apply User Intent Taxonomies

Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search…

Information Retrieval · Computer Science 2024-05-13 Chirag Shah , Ryen W. White , Reid Andersen , Georg Buscher , Scott Counts , Sarkar Snigdha Sarathi Das , Ali Montazer , Sathish Manivannan , Jennifer Neville , Xiaochuan Ni , Nagu Rangan , Tara Safavi , Siddharth Suri , Mengting Wan , Leijie Wang , Longqi Yang

WaitGPT: Monitoring and Steering Conversational LLM Agent in Data Analysis with On-the-Fly Code Visualization

Large language models (LLMs) support data analysis through conversational user interfaces, as exemplified in OpenAI's ChatGPT (formally known as Advanced Data Analysis or Code Interpreter). Essentially, LLMs produce code for accomplishing…

Human-Computer Interaction · Computer Science 2024-08-06 Liwenhan Xie , Chengbo Zheng , Haijun Xia , Huamin Qu , Chen Zhu-Tian

Understanding Large Language Model Behaviors through Interactive Counterfactual Generation and Analysis

Understanding the behavior of large language models (LLMs) is crucial for ensuring their safe and reliable use. However, existing explainable AI (XAI) methods for LLMs primarily rely on word-level explanations, which are often…

Computation and Language · Computer Science 2025-08-08 Furui Cheng , Vilém Zouhar , Robin Shing Moon Chan , Daniel Fürst , Hendrik Strobelt , Mennatallah El-Assady

POEM: Interactive Prompt Optimization for Enhancing Multimodal Reasoning of Large Language Models

Large language models (LLMs) have exhibited impressive abilities for multimodal content comprehension and reasoning with proper prompting in zero- or few-shot settings. Despite the proliferation of interactive systems developed to support…

Human-Computer Interaction · Computer Science 2024-10-01 Jianben He , Xingbo Wang , Shiyi Liu , Guande Wu , Claudio Silva , Huamin Qu

State of the Art of LLM-Enabled Interaction with Visualization

We report on a systematic, PRISMA-guided survey of research at the intersection of LLMs and visualization, with a particular focus on visio-verbal interaction -- where verbal and visual modalities converge to support data sense-making. The…

Human-Computer Interaction · Computer Science 2026-02-04 Mathis Brossier , Tobias Isenberg , Konrad Schönborn , Jonas Unger , Mario Romero , Johanna Björklund , Anders Ynnerman , Lonni Besançon

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models

Conversation agents fueled by Large Language Models (LLMs) are providing a new way to interact with visual data. While there have been initial attempts for image-based conversation models, this work addresses the under-explored field of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-11 Muhammad Maaz , Hanoona Rasheed , Salman Khan , Fahad Shahbaz Khan

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Large Language Models (LLMs) demonstrate enhanced capabilities and reliability by reasoning more, evolving from Chain-of-Thought prompting to product-level solutions like OpenAI o1. Despite various efforts to improve LLM reasoning,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-05 Yuhao Dong , Zuyan Liu , Hai-Long Sun , Jingkang Yang , Winston Hu , Yongming Rao , Ziwei Liu