Related papers: Evaluation Agent: Efficient and Promptable Evaluat…

VideoGen-Eval: Agent-based System for Video Generation Evaluation

The rapid advancement of video generation has rendered existing evaluation systems inadequate for assessing state-of-the-art models, primarily due to simple prompts that cannot showcase the model's capabilities, fixed evaluation operators…

Computer Vision and Pattern Recognition · Computer Science 2025-04-29 Yuhang Yang , Ke Fan , Shangkun Sun , Hongxiang Li , Ailing Zeng , FeiLin Han , Wei Zhai , Wei Liu , Yang Cao , Zheng-Jun Zha

Dynamic Human Evaluation for Relative Model Comparisons

Collecting human judgements is currently the most reliable evaluation method for natural language generation systems. Automatic metrics have reported flaws when applied to measure quality aspects of generated text and have been shown to…

Computation and Language · Computer Science 2022-04-29 Thórhildur Thorleiksdóttir , Cedric Renggli , Nora Hollenstein , Ce Zhang

DiffusionAgent: Navigating Expert Models for Agentic Image Generation

In the accelerating era of human-instructed visual content creation, diffusion models have demonstrated remarkable generative potential. Yet their deployment is constrained by a dual bottleneck: semantic ambiguity in diverse prompts and the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Jie Qin , Jie Wu , Weifeng Chen , Yueming Lyu

Adaptive Video Understanding Agent: Enhancing efficiency with dynamic frame sampling and feedback-driven reasoning

Understanding long-form video content presents significant challenges due to its temporal complexity and the substantial computational resources required. In this work, we propose an agent-based approach to enhance both the efficiency and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Sullam Jeoung , Goeric Huybrechts , Bhavana Ganesh , Aram Galstyan , Sravan Bodapati

AgentEval: Generative Agents as Reliable Proxies for Human Evaluation of AI-Generated Content

Modern businesses are increasingly challenged by the time and expense required to generate and assess high-quality content. Human writers face time constraints, and extrinsic evaluations can be costly. While Large Language Models (LLMs)…

Artificial Intelligence · Computer Science 2025-12-10 Thanh Vu , Richi Nayak , Thiru Balasubramaniam

Dynamic Evaluation Framework for Personalized and Trustworthy Agents: A Multi-Session Approach to Preference Adaptability

Recent advancements in generative AI have significantly increased interest in personalized agents. With increased personalization, there is also a greater need for being able to trust decision-making and action taking capabilities of these…

Information Retrieval · Computer Science 2025-04-10 Chirag Shah , Hideo Joho , Kirandeep Kaur , Preetam Prabhu Srikar Dammu

GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning

We introduce GenAgent, unifying visual understanding and generation through an agentic multimodal model. Unlike unified models that face expensive training costs and understanding-generation trade-offs, GenAgent decouples these capabilities…

Computer Vision and Pattern Recognition · Computer Science 2026-01-29 Kaixun Jiang , Yuzheng Wang , Junjie Zhou , Pandeng Li , Zhihang Liu , Chen-Wei Xie , Zhaoyu Chen , Yun Zheng , Wenqiang Zhang

ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding

Video understanding is fundamental to tasks such as action recognition, video reasoning, and robotic control. Early video understanding methods based on large vision-language models (LVLMs) typically adopt a single-pass reasoning paradigm…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Yiyang Zhou , Yangfan He , Yaofeng Su , Siwei Han , Joel Jang , Gedas Bertasius , Mohit Bansal , Huaxiu Yao

Human-Computer Collaboration for Visual Analytics: an Agent-based Framework

The visual analytics community has long aimed to understand users better and assist them in their analytic endeavors. As a result, numerous conceptual models of visual analytics aim to formalize common workflows, techniques, and goals…

Human-Computer Interaction · Computer Science 2023-04-20 Shayan Monadjemi , Mengtian Guo , David Gotz , Roman Garnett , Alvitta Ottley

CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception

Collective Perception has attracted significant attention in recent years due to its advantage for mitigating occlusion and expanding the field-of-view, thereby enhancing reliability, efficiency, and, most crucially, decision-making safety.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Yunshuang Yuan , Monika Sester

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Recent visual generation models have made major progress in photorealism, typography, instruction following, and interactive editing, yet they still struggle with spatial reasoning, persistent state, long-horizon consistency, and causal…

Computer Vision and Pattern Recognition · Computer Science 2026-05-01 Keming Wu , Zuhao Yang , Kaichen Zhang , Shizun Wang , Haowei Zhu , Sicong Leng , Zhongyu Yang , Qijie Wang , Sudong Wang , Ziting Wang , Zili Wang , Hui Zhang , Haonan Wang , Hang Zhou , Yifan Pu , Xingxuan Li , Fangneng Zhan , Bo Li , Lidong Bing , Yuxin Song , Ziwei Liu , Wenhu Chen , Jingdong Wang , Xinchao Wang , Xiaojuan Qi , Shijian Lu , Bin Wang

A Computational Theory for Efficient Mini Agent Evaluation with Causal Guarantees

In order to reduce the cost of experimental evaluation for agents, we introduce a computational theory of evaluation for mini agents: build evaluation model to accelerate the evaluation procedures. We prove upper bounds of generalized error…

Artificial Intelligence · Computer Science 2025-05-19 Hedong Yan

Multi-Agent Collaborative Framework For Math Problem Generation

Automatic question generation (AQG) for mathematics education remains an elusive goal for Intelligent Tutoring Systems and educators. While pre-trained transformer-based language models have significantly advanced natural language…

Multiagent Systems · Computer Science 2025-11-07 Kia Karbasi , Kevin Hong , Mohammad Amin Samadi , Gregory Pottie

PresentAgent: Multimodal Agent for Presentation Video Generation

We present PresentAgent, a multimodal agent that transforms long-form documents into narrated presentation videos. While existing approaches are limited to generating static slides or text summaries, our method advances beyond these…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Jingwei Shi , Zeyu Zhang , Biao Wu , Yanjie Liang , Meng Fang , Ling Chen , Yang Zhao

GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation

Open-ended image generation is no longer a simple prompt-to-image problem. High-quality generation often requires an agent to combine a model's internal generative ability with external resources. As requests become more diverse and…

Computer Vision and Pattern Recognition · Computer Science 2026-05-25 Sixiang Chen , Zhaohu Xing , Tian Ye , Xinyu Geng , Yunlong Lin , Jianyu Lai , Xuanhua He , Fuxiang Zhai , Jialin Gao , Lei Zhu

Teacher Agent: A Knowledge Distillation-Free Framework for Rehearsal-based Video Incremental Learning

Rehearsal-based video incremental learning often employs knowledge distillation to mitigate catastrophic forgetting of previously learned data. However, this method faces two major challenges for video task: substantial computing resources…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Shengqin Jiang , Yaoyu Fang , Haokui Zhang , Qingshan Liu , Yuankai Qi , Yang Yang , Peng Wang

Towards Trustworthy Report Generation: A Deep Research Agent with Progressive Confidence Estimation and Calibration

As agent-based systems continue to evolve, deep research agents are capable of automatically generating research-style reports across diverse domains. While these agents promise to streamline information synthesis and knowledge exploration,…

Artificial Intelligence · Computer Science 2026-04-08 Yi Yuan , Xuhong Wang , Shanzhe Lei

Intelligent Canvas: Enabling Design-Like Exploratory Visual Data Analysis with Generative AI through Rapid Prototyping, Iteration and Curation

Complex data analysis inherently seeks unexpected insights through exploratory visual analysis methods, transcending logical, step-by-step processing. However, existing interfaces such as notebooks and dashboards have limitations in…

Human-Computer Interaction · Computer Science 2024-03-22 Zijian Ding , Joel Chan

Automatable Evaluation Method Oriented toward Behaviour Believability for Video Games

Classic evaluation methods of believable agents are time-consuming because they involve many human to judge agents. They are well suited to validate work on new believable behaviours models. However, during the implementation, numerous…

Artificial Intelligence · Computer Science 2010-09-03 Fabien Tencé , Cédric Buche

Data Agent: Learning to Select Data via End-to-End Dynamic Optimization

Dynamic Data selection aims to accelerate training by prioritizing informative samples during online training. However, existing methods typically rely on task-specific handcrafted metrics or static/snapshot-based criteria to estimate…

Machine Learning · Computer Science 2026-05-14 Suorong Yang , Fangjian Su , Hai Gan , Ziqi Ye , Jie Li , Baile Xu , Furao Shen , Soujanya Poria