English
Related papers

Related papers: Distributed Multi-agent Video Fast-forwarding

200 papers

Multi-agent applications have recently gained significant popularity. In many computer vision tasks, a network of agents, such as a team of robots with cameras, could work collaboratively to perceive the environment for efficient and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Shuyue Lan , Zhilu Wang , Ermin Wei , Amit K. Roy-Chowdhury , Qi Zhu

Understanding long-form video content presents significant challenges due to its temporal complexity and the substantial computational resources required. In this work, we propose an agent-based approach to enhance both the efficiency and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Sullam Jeoung , Goeric Huybrechts , Bhavana Ganesh , Aram Galstyan , Sravan Bodapati

Effective environment perception is crucial for enabling downstream robotic applications. Individual robotic agents often face occlusion and limited visibility issues, whereas multi-agent systems can offer a more comprehensive mapping of…

Robotics · Computer Science 2024-10-01 Hongrui Zhao , Boris Ivanovic , Negar Mehr

The behavioral dynamics of multi-agent systems have a rich and orderly structure, which can be leveraged to understand these systems, and to improve how artificial agents learn to operate in them. Here we introduce Relational Forward Models…

Deep reinforcement learning has been applied successfully to solve various real-world problems and the number of its applications in the multi-agent settings has been increasing. Multi-agent learning distinctly poses significant challenges…

Machine Learning · Computer Science 2021-02-24 Ngoc Duy Nguyen , Thanh Thi Nguyen , Doug Creighton , Saeid Nahavandi

Video Recognition has drawn great research interest and great progress has been made. A suitable frame sampling strategy can improve the accuracy and efficiency of recognition. However, mainstream solutions generally adopt hand-crafted…

Computer Vision and Pattern Recognition · Computer Science 2019-08-05 Wenhao Wu , Dongliang He , Xiao Tan , Shifeng Chen , Shilei Wen

We introduce a cutting-edge video compression framework tailored for the age of ubiquitous video data, uniquely designed to serve machine learning applications. Unlike traditional compression methods that prioritize human visual perception,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-25 Huan Cui , Qing Li , Hanling Wang , Yong jiang

For many applications with limited computation, communication, storage and energy resources, there is an imperative need of computer vision methods that could select an informative subset of the input video for efficient processing at or…

Computer Vision and Pattern Recognition · Computer Science 2018-05-09 Shuyue Lan , Rameswar Panda , Qi Zhu , Amit K. Roy-Chowdhury

This paper proposes a novel problem: vision-based perception to learn and predict the collective dynamics of multi-agent systems, specifically focusing on interaction strength and convergence time. Multi-agent systems are defined as…

Multiagent Systems · Computer Science 2024-11-12 Minah Lee , Uday Kamal , Saibal Mukhopadhyay

With the advancement of AIGC, video frame interpolation (VFI) has become a crucial component in existing video generation frameworks, attracting widespread research interest. For the VFI task, the motion estimation between neighboring…

Computer Vision and Pattern Recognition · Computer Science 2024-08-05 Zhilin Huang , Yijie Yu , Ling Yang , Chujun Qin , Bing Zheng , Xiawu Zheng , Zikun Zhou , Yaowei Wang , Wenming Yang

One of the key challenges for multi-agent learning is scalability. In this paper, we introduce a technique for speeding up multi-agent learning by exploiting concurrent and incremental experience sharing. This solution adaptively identifies…

Multiagent Systems · Computer Science 2017-03-07 Dan Garant , Bruno da Silva , Victor Lesser , Chongjie Zhang

Robotic manipulation tasks often rely on static cameras for perception, which can limit flexibility, particularly in scenarios like robotic surgery and cluttered environments where mounting static cameras is impractical. Ideally, robots…

Robotics · Computer Science 2025-09-18 Xiatao Sun , Francis Fan , Yinxing Chen , Daniel Rakita

This paper presents an integrated multi-agents architecture for indexing and retrieving video information.The focus of our work is to elaborate an extensible approach that gathers a priori almost of the mandatory tools which palliate to the…

Information Retrieval · Computer Science 2014-08-01 Yasser El Madani El Alami , El Habib Nfaoui , Omar El Beqqali

Robotic manipulation requires understanding both the 3D spatial structure of the environment and its temporal evolution, yet most existing policies overlook one or both. They typically rely on 2D visual observations and backbones pretrained…

In this paper, a distributed velocity-constrained consensus problem is studied for discrete-time multi-agent systems, where each agent's velocity is constrained to lie in a nonconvex set. A distributed constrained control algorithm is…

Optimization and Control · Mathematics 2020-03-05 Peng Lin , Wei Ren , Huijun Gao

The objective of meta-learning is to exploit the knowledge obtained from observed tasks to improve adaptation to unseen tasks. As such, meta-learners are able to generalize better when they are trained with a larger number of observed tasks…

Machine Learning · Computer Science 2022-10-11 Mert Kayaalp , Stefan Vlaski , Ali H. Sayed

Multi-modal large language models (MLLMs) advance vision language understanding but face inherent limitations in long-video tasks due to bounded perception context budgets. Existing agentic methods mitigate this via rule-based…

Computer Vision and Pattern Recognition · Computer Science 2026-05-04 Kerui Chen , Jinglu Wang , Jianrong Zhang , Ming Li , Yan Lu , Hehe Fan

Long-form video understanding remains challenging for Vision-Language Models (VLMs) due to the inherent tension between computational constraints and the need to capture information distributed across thousands of frames. Existing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-05 Junbo Zou , Ziheng Huang , Shengjie Zhang , Liwen Zhang , Weining Shen

The growing demand on high-quality and low-latency multimedia services has led to much interest in edge caching techniques. Motivated by this, we in this paper consider edge caching at the base stations with unknown content popularity…

Information Theory · Computer Science 2019-05-15 Chen Zhong , M. Cenk Gursoy , Senem Velipasalar

Video question-answering is a fundamental task in the field of video understanding. Although current vision--language models (VLMs) equipped with Video Transformers have enabled temporal modeling and yielded superior results, they are at…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Wei Han , Hui Chen , Min-Yen Kan , Soujanya Poria
‹ Prev 1 2 3 10 Next ›