English
Related papers

Related papers: Evaluation Agent: Efficient and Promptable Evaluat…

200 papers

The rapid advancement of video generation has rendered existing evaluation systems inadequate for assessing state-of-the-art models, primarily due to simple prompts that cannot showcase the model's capabilities, fixed evaluation operators…

Computer Vision and Pattern Recognition · Computer Science 2025-04-29 Yuhang Yang , Ke Fan , Shangkun Sun , Hongxiang Li , Ailing Zeng , FeiLin Han , Wei Zhai , Wei Liu , Yang Cao , Zheng-Jun Zha

Collecting human judgements is currently the most reliable evaluation method for natural language generation systems. Automatic metrics have reported flaws when applied to measure quality aspects of generated text and have been shown to…

Computation and Language · Computer Science 2022-04-29 Thórhildur Thorleiksdóttir , Cedric Renggli , Nora Hollenstein , Ce Zhang

In the accelerating era of human-instructed visual content creation, diffusion models have demonstrated remarkable generative potential. Yet their deployment is constrained by a dual bottleneck: semantic ambiguity in diverse prompts and the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Jie Qin , Jie Wu , Weifeng Chen , Yueming Lyu

Understanding long-form video content presents significant challenges due to its temporal complexity and the substantial computational resources required. In this work, we propose an agent-based approach to enhance both the efficiency and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Sullam Jeoung , Goeric Huybrechts , Bhavana Ganesh , Aram Galstyan , Sravan Bodapati

Modern businesses are increasingly challenged by the time and expense required to generate and assess high-quality content. Human writers face time constraints, and extrinsic evaluations can be costly. While Large Language Models (LLMs)…

Artificial Intelligence · Computer Science 2025-12-10 Thanh Vu , Richi Nayak , Thiru Balasubramaniam

Recent advancements in generative AI have significantly increased interest in personalized agents. With increased personalization, there is also a greater need for being able to trust decision-making and action taking capabilities of these…

Information Retrieval · Computer Science 2025-04-10 Chirag Shah , Hideo Joho , Kirandeep Kaur , Preetam Prabhu Srikar Dammu

We introduce GenAgent, unifying visual understanding and generation through an agentic multimodal model. Unlike unified models that face expensive training costs and understanding-generation trade-offs, GenAgent decouples these capabilities…

Computer Vision and Pattern Recognition · Computer Science 2026-01-29 Kaixun Jiang , Yuzheng Wang , Junjie Zhou , Pandeng Li , Zhihang Liu , Chen-Wei Xie , Zhaoyu Chen , Yun Zheng , Wenqiang Zhang

Video understanding is fundamental to tasks such as action recognition, video reasoning, and robotic control. Early video understanding methods based on large vision-language models (LVLMs) typically adopt a single-pass reasoning paradigm…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Yiyang Zhou , Yangfan He , Yaofeng Su , Siwei Han , Joel Jang , Gedas Bertasius , Mohit Bansal , Huaxiu Yao

The visual analytics community has long aimed to understand users better and assist them in their analytic endeavors. As a result, numerous conceptual models of visual analytics aim to formalize common workflows, techniques, and goals…

Human-Computer Interaction · Computer Science 2023-04-20 Shayan Monadjemi , Mengtian Guo , David Gotz , Roman Garnett , Alvitta Ottley

Collective Perception has attracted significant attention in recent years due to its advantage for mitigating occlusion and expanding the field-of-view, thereby enhancing reliability, efficiency, and, most crucially, decision-making safety.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Yunshuang Yuan , Monika Sester

Recent visual generation models have made major progress in photorealism, typography, instruction following, and interactive editing, yet they still struggle with spatial reasoning, persistent state, long-horizon consistency, and causal…

In order to reduce the cost of experimental evaluation for agents, we introduce a computational theory of evaluation for mini agents: build evaluation model to accelerate the evaluation procedures. We prove upper bounds of generalized error…

Artificial Intelligence · Computer Science 2025-05-19 Hedong Yan

Automatic question generation (AQG) for mathematics education remains an elusive goal for Intelligent Tutoring Systems and educators. While pre-trained transformer-based language models have significantly advanced natural language…

Multiagent Systems · Computer Science 2025-11-07 Kia Karbasi , Kevin Hong , Mohammad Amin Samadi , Gregory Pottie

We present PresentAgent, a multimodal agent that transforms long-form documents into narrated presentation videos. While existing approaches are limited to generating static slides or text summaries, our method advances beyond these…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Jingwei Shi , Zeyu Zhang , Biao Wu , Yanjie Liang , Meng Fang , Ling Chen , Yang Zhao

Open-ended image generation is no longer a simple prompt-to-image problem. High-quality generation often requires an agent to combine a model's internal generative ability with external resources. As requests become more diverse and…

Computer Vision and Pattern Recognition · Computer Science 2026-05-25 Sixiang Chen , Zhaohu Xing , Tian Ye , Xinyu Geng , Yunlong Lin , Jianyu Lai , Xuanhua He , Fuxiang Zhai , Jialin Gao , Lei Zhu

Rehearsal-based video incremental learning often employs knowledge distillation to mitigate catastrophic forgetting of previously learned data. However, this method faces two major challenges for video task: substantial computing resources…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Shengqin Jiang , Yaoyu Fang , Haokui Zhang , Qingshan Liu , Yuankai Qi , Yang Yang , Peng Wang

As agent-based systems continue to evolve, deep research agents are capable of automatically generating research-style reports across diverse domains. While these agents promise to streamline information synthesis and knowledge exploration,…

Artificial Intelligence · Computer Science 2026-04-08 Yi Yuan , Xuhong Wang , Shanzhe Lei

Complex data analysis inherently seeks unexpected insights through exploratory visual analysis methods, transcending logical, step-by-step processing. However, existing interfaces such as notebooks and dashboards have limitations in…

Human-Computer Interaction · Computer Science 2024-03-22 Zijian Ding , Joel Chan

Classic evaluation methods of believable agents are time-consuming because they involve many human to judge agents. They are well suited to validate work on new believable behaviours models. However, during the implementation, numerous…

Artificial Intelligence · Computer Science 2010-09-03 Fabien Tencé , Cédric Buche

Dynamic Data selection aims to accelerate training by prioritizing informative samples during online training. However, existing methods typically rely on task-specific handcrafted metrics or static/snapshot-based criteria to estimate…

Machine Learning · Computer Science 2026-05-14 Suorong Yang , Fangjian Su , Hai Gan , Ziqi Ye , Jie Li , Baile Xu , Furao Shen , Soujanya Poria
‹ Prev 1 2 3 10 Next ›