Related papers: Distributed Multi-agent Video Fast-forwarding

Collaborative Multi-Agent Video Fast-Forwarding

Multi-agent applications have recently gained significant popularity. In many computer vision tasks, a network of agents, such as a team of robots with cameras, could work collaboratively to perceive the environment for efficient and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Shuyue Lan , Zhilu Wang , Ermin Wei , Amit K. Roy-Chowdhury , Qi Zhu

Adaptive Video Understanding Agent: Enhancing efficiency with dynamic frame sampling and feedback-driven reasoning

Understanding long-form video content presents significant challenges due to its temporal complexity and the substantial computational resources required. In this work, we propose an agent-based approach to enhance both the efficiency and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Sullam Jeoung , Goeric Huybrechts , Bhavana Ganesh , Aram Galstyan , Sravan Bodapati

Distributed NeRF Learning for Collaborative Multi-Robot Perception

Effective environment perception is crucial for enabling downstream robotic applications. Individual robotic agents often face occlusion and limited visibility issues, whereas multi-agent systems can offer a more comprehensive mapping of…

Robotics · Computer Science 2024-10-01 Hongrui Zhao , Boris Ivanovic , Negar Mehr

Relational Forward Models for Multi-Agent Learning

The behavioral dynamics of multi-agent systems have a rich and orderly structure, which can be leveraged to understand these systems, and to improve how artificial agents learn to operate in them. Here we introduce Relational Forward Models…

Machine Learning · Computer Science 2018-10-01 Andrea Tacchetti , H. Francis Song , Pedro A. M. Mediano , Vinicius Zambaldi , Neil C. Rabinowitz , Thore Graepel , Matthew Botvinick , Peter W. Battaglia

A Visual Communication Map for Multi-Agent Deep Reinforcement Learning

Deep reinforcement learning has been applied successfully to solve various real-world problems and the number of its applications in the multi-agent settings has been increasing. Multi-agent learning distinctly poses significant challenges…

Machine Learning · Computer Science 2021-02-24 Ngoc Duy Nguyen , Thanh Thi Nguyen , Doug Creighton , Saeid Nahavandi

Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition

Video Recognition has drawn great research interest and great progress has been made. A suitable frame sampling strategy can improve the accuracy and efficiency of recognition. However, mainstream solutions generally adopt hand-crafted…

Computer Vision and Pattern Recognition · Computer Science 2019-08-05 Wenhao Wu , Dongliang He , Xiao Tan , Shifeng Chen , Shilei Wen

DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy

We introduce a cutting-edge video compression framework tailored for the age of ubiquitous video data, uniquely designed to serve machine learning applications. Unlike traditional compression methods that prioritize human visual perception,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-25 Huan Cui , Qing Li , Hanling Wang , Yong jiang

FFNet: Video Fast-Forwarding via Reinforcement Learning

For many applications with limited computation, communication, storage and energy resources, there is an imperative need of computer vision methods that could select an informative subset of the input video for efficient processing at or…

Computer Vision and Pattern Recognition · Computer Science 2018-05-09 Shuyue Lan , Rameswar Panda , Qi Zhu , Amit K. Roy-Chowdhury

Learning Collective Dynamics of Multi-Agent Systems using Event-based Vision

This paper proposes a novel problem: vision-based perception to learn and predict the collective dynamics of multi-agent systems, specifically focusing on interaction strength and convergence time. Multi-agent systems are defined as…

Multiagent Systems · Computer Science 2024-11-12 Minah Lee , Uday Kamal , Saibal Mukhopadhyay

Motion-aware Latent Diffusion Models for Video Frame Interpolation

With the advancement of AIGC, video frame interpolation (VFI) has become a crucial component in existing video generation frameworks, attracting widespread research interest. For the VFI task, the motion estimation between neighboring…

Computer Vision and Pattern Recognition · Computer Science 2024-08-05 Zhilin Huang , Yijie Yu , Ling Yang , Chujun Qin , Bing Zheng , Xiawu Zheng , Zikun Zhou , Yaowei Wang , Wenming Yang

Context-Based Concurrent Experience Sharing in Multiagent Systems

One of the key challenges for multi-agent learning is scalability. In this paper, we introduce a technique for speeding up multi-agent learning by exploiting concurrent and incremental experience sharing. This solution adaptively identifies…

Multiagent Systems · Computer Science 2017-03-07 Dan Garant , Bruno da Silva , Victor Lesser , Chongjie Zhang

Optimizing Active Perception for Learning Simultaneous Viewpoint Selection and Manipulation with Diffusion Policy

Robotic manipulation tasks often rely on static cameras for perception, which can limit flexibility, particularly in scenarios like robotic surgery and cluttered environments where mounting static cameras is impractical. Ideally, robots…

Robotics · Computer Science 2025-09-18 Xiatao Sun , Francis Fan , Yinxing Chen , Daniel Rakita

Multi-agents Architecture for Semantic Retrieving Video in Distributed Environment

This paper presents an integrated multi-agents architecture for indexing and retrieving video information.The focus of our work is to elaborate an extensible approach that gathers a priori almost of the mandatory tools which palliate to the…

Information Retrieval · Computer Science 2014-08-01 Yasser El Madani El Alami , El Habib Nfaoui , Omar El Beqqali

Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model

Robotic manipulation requires understanding both the 3D spatial structure of the environment and its temporal evolution, yet most existing policies overlook one or both. They typically rely on 2D visual observations and backbones pretrained…

Robotics · Computer Science 2026-04-06 Peiyan Li , Yixiang Chen , Yuan Xu , Jiabing Yang , Xiangnan Wu , Jun Guo , Nan Sun , Long Qian , Xinghang Li , Xin Xiao , Jing Liu , Nianfeng Liu , Tao Kong , Yan Huang , Liang Wang , Tieniu Tan

Distributed velocity-constrained consensus of discrete-time multi-agent systems with nonconvex constraints, switching topologies, and delays

In this paper, a distributed velocity-constrained consensus problem is studied for discrete-time multi-agent systems, where each agent's velocity is constrained to lie in a nonconvex set. A distributed constrained control algorithm is…

Optimization and Control · Mathematics 2020-03-05 Peng Lin , Wei Ren , Huijun Gao

Dif-MAML: Decentralized Multi-Agent Meta-Learning

The objective of meta-learning is to exploit the knowledge obtained from observed tasks to improve adaptation to unseen tasks. As such, meta-learners are able to generalize better when they are trained with a larger number of observed tasks…

Machine Learning · Computer Science 2022-10-11 Mert Kayaalp , Stefan Vlaski , Ali H. Sayed

Scaling Video Understanding via Compact Latent Multi-Agent Collaboration

Multi-modal large language models (MLLMs) advance vision language understanding but face inherent limitations in long-video tasks due to bounded perception context budgets. Existing agentic methods mitigate this via rule-based…

Computer Vision and Pattern Recognition · Computer Science 2026-05-04 Kerui Chen , Jinglu Wang , Jianrong Zhang , Ming Li , Yan Lu , Hehe Fan

VideoBrain: Learning Adaptive Frame Sampling for Long Video Understanding

Long-form video understanding remains challenging for Vision-Language Models (VLMs) due to the inherent tension between computational constraints and the need to capture information distributed across thousands of frames. Existing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-05 Junbo Zou , Ziheng Huang , Shengjie Zhang , Liwen Zhang , Weining Shen

Deep Multi-Agent Reinforcement Learning Based Cooperative Edge Caching in Wireless Networks

The growing demand on high-quality and low-latency multimedia services has led to much interest in edge caching techniques. Motivated by this, we in this paper consider edge caching at the base stations with unknown content popularity…

Information Theory · Computer Science 2019-05-15 Chen Zhong , M. Cenk Gursoy , Senem Velipasalar

Self-Adaptive Sampling for Efficient Video Question-Answering on Image--Text Models

Video question-answering is a fundamental task in the field of video understanding. Although current vision--language models (VLMs) equipped with Video Transformers have enabled temporal modeling and yielded superior results, they are at…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Wei Han , Hui Chen , Min-Yen Kan , Soujanya Poria