Related papers: Learning from One Continuous Video Stream

Streaming LifeLong Learning With Any-Time Inference

Despite rapid advancements in lifelong learning (LLL) research, a large body of research mainly focuses on improving the performance in the existing \textit{static} continual learning (CL) setups. These methods lack the ability to succeed…

Machine Learning · Computer Science 2023-01-30 Soumya Banerjee , Vinay Kumar Verma , Vinay P. Namboodiri

Active Mining of Parallel Video Streams

The practicality of a video surveillance system is adversely limited by the amount of queries that can be placed on human resources and their vigilance in response. To transcend this limitation, a major effort under way is to include…

Computer Vision and Pattern Recognition · Computer Science 2014-05-16 Samaneh Khoshrou , Jaime S. Cardoso , Luis F. Teixeira

Learning from Snapshots of Discrete and Continuous Data Streams

Imagine a smart camera trap selectively clicking pictures to understand animal movement patterns within a particular habitat. These "snapshots", or pieces of data captured from a data stream at adaptively chosen times, provide a glimpse of…

Machine Learning · Computer Science 2024-12-10 Pramith Devulapalli , Steve Hanneke

Learning from Streaming Video with Orthogonal Gradients

We address the challenge of representation learning from a continuous stream of video as input, in a self-supervised manner. This differs from the standard approaches to video learning where videos are chopped and shuffled during training…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Tengda Han , Dilara Gokay , Joseph Heyward , Chuhan Zhang , Daniel Zoran , Viorica Pătrăucean , João Carreira , Dima Damen , Andrew Zisserman

Active Learning for Online Recognition of Human Activities from Streaming Videos

Recognising human activities from streaming videos poses unique challenges to learning algorithms: predictive models need to be scalable, incrementally trainable, and must remain bounded in size even when the data stream is arbitrarily…

Machine Learning · Statistics 2016-10-06 Rocco De Rosa , Ilaria Gori , Fabio Cuzzolin , Barbara Caputo , Nicolò Cesa-Bianchi

Lifelong Learning of Video Diffusion Models From a Single Video Stream

This work demonstrates that training autoregressive video diffusion models from a single video stream$\unicode{x2013}$resembling the experience of embodied agents$\unicode{x2013}$is not only possible, but can also be as effective as…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Jason Yoo , Yingchen He , Saeid Naderiparizi , Dylan Green , Gido M. van de Ven , Geoff Pleiss , Frank Wood

Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams

Devising intelligent agents able to live in an environment and learn by observing the surroundings is a longstanding goal of Artificial Intelligence. From a bare Machine Learning perspective, challenges arise when the agent is prevented…

Computer Vision and Pattern Recognition · Computer Science 2022-04-27 Matteo Tiezzi , Simone Marullo , Lapo Faggi , Enrico Meloni , Alessandro Betti , Stefano Melacci

Scene-Adaptive Video Frame Interpolation via Meta-Learning

Video frame interpolation is a challenging problem because there are different scenarios for each video depending on the variety of foreground and background motion, frame rate, and occlusion. It is therefore difficult for a single network…

Computer Vision and Pattern Recognition · Computer Science 2020-04-03 Myungsub Choi , Janghoon Choi , Sungyong Baik , Tae Hyun Kim , Kyoung Mu Lee

Fusing Multi-Stream Deep Networks for Video Classification

This paper studies deep network architectures to address the problem of video classification. A multi-stream framework is proposed to fully utilize the rich multimodal information in videos. Specifically, we first train three Convolutional…

Computer Vision and Pattern Recognition · Computer Science 2015-11-12 Zuxuan Wu , Yu-Gang Jiang , Xi Wang , Hao Ye , Xiangyang Xue , Jun Wang

Unsupervised Bi-directional Flow-based Video Generation from one Snapshot

Imagining multiple consecutive frames given one single snapshot is challenging, since it is difficult to simultaneously predict diverse motions from a single image and faithfully generate novel frames without visual distortions. In this…

Computer Vision and Pattern Recognition · Computer Science 2019-03-05 Lu Sheng , Junting Pan , Jiaming Guo , Jing Shao , Xiaogang Wang , Chen Change Loy

Lambda Learner: Fast Incremental Learning on Data Streams

One of the most well-established applications of machine learning is in deciding what content to show website visitors. When observation data comes from high-velocity, user-generated data streams, machine learning methods perform a…

Machine Learning · Computer Science 2021-06-29 Rohan Ramanath , Konstantin Salomatin , Jeffrey D. Gee , Kirill Talanine , Onkar Dalal , Gungor Polatkan , Sara Smoot , Deepak Kumar

Video Scene Parsing with Predictive Feature Learning

In this work, we address the challenging video scene parsing problem by developing effective representation learning methods given limited parsing annotations. In particular, we contribute two novel methods that constitute a unified parsing…

Computer Vision and Pattern Recognition · Computer Science 2016-12-14 Xiaojie Jin , Xin Li , Huaxin Xiao , Xiaohui Shen , Zhe Lin , Jimei Yang , Yunpeng Chen , Jian Dong , Luoqi Liu , Zequn Jie , Jiashi Feng , Shuicheng Yan

Coreset-Based Adaptive Tracking

We propose a method for learning from streaming visual data using a compact, constant size representation of all the data that was seen until a given moment. Specifically, we construct a 'coreset' representation of streaming data using a…

Computer Vision and Pattern Recognition · Computer Science 2015-11-20 Abhimanyu Dubey , Nikhil Naik , Dan Raviv , Rahul Sukthankar , Ramesh Raskar

Learning Streaming Video Representation via Multitask Training

Understanding continuous video streams plays a fundamental role in real-time applications including embodied AI and autonomous driving. Unlike offline video understanding, streaming video understanding requires the ability to process video…

Computer Vision and Pattern Recognition · Computer Science 2025-07-23 Yibin Yan , Jilan Xu , Shangzhe Di , Yikun Liu , Yudi Shi , Qirui Chen , Zeqian Li , Yifei Huang , Weidi Xie

Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories

The standard way of training video models entails sampling at each iteration a single clip from a video and optimizing the clip prediction with respect to the video-level label. We argue that a single clip may not have enough temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Xitong Yang , Haoqi Fan , Lorenzo Torresani , Larry Davis , Heng Wang

Continual Prototype Evolution: Learning Online from Non-Stationary Data Streams

Attaining prototypical features to represent class distributions is well established in representation learning. However, learning prototypes online from streaming data proves a challenging endeavor as they rapidly become outdated, caused…

Computer Vision and Pattern Recognition · Computer Science 2021-10-20 Matthias De Lange , Tinne Tuytelaars

Multi-model learning by sequential reading of untrimmed videos for action recognition

We propose a new method for learning videos by aggregating multiple models by sequentially extracting video clips from untrimmed video. The proposed method reduces the correlation between clips by feeding clips to multiple models in turn…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Kodai Kamiya , Toru Tamaki

A Framework for On-line Learning of Underwater Vehicles Dynamic Models

Learning the dynamics of robots from data can help achieve more accurate tracking controllers, or aid their navigation algorithms. However, when the actual dynamics of the robots change due to external conditions, on-line adaptation of…

Robotics · Computer Science 2019-03-14 Bilal Wehbe , Marc Hildebrandt , Frank Kirchner

Revealing Single Frame Bias for Video-and-Language Learning

Training an effective video-and-language model intuitively requires multiple frames as model inputs. However, it is unclear whether using multiple frames is beneficial to downstream tasks, and if yes, whether the performance gain is worth…

Computer Vision and Pattern Recognition · Computer Science 2022-06-08 Jie Lei , Tamara L. Berg , Mohit Bansal

Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction

Diffusion models have made significant strides in image generation, mastering tasks such as unconditional image synthesis, text-image translation, and image-to-image conversions. However, their capability falls short in the realm of video…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Gaurav Shrivastava , Abhinav Shrivastava