Related papers: Scene Separation & Data Selection: Temporal Segmen…

MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion

Autonomous systems, such as self-driving cars, rely on reliable semantic environment perception for decision making. Despite great advances in video semantic segmentation, existing approaches ignore important inductive biases and lack…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Angel Villar-Corrales , Moritz Austermann , Sven Behnke

Video Classification With CNNs: Using The Codec As A Spatio-Temporal Activity Sensor

We investigate video classification via a two-stream convolutional neural network (CNN) design that directly ingests information extracted from compressed video bitstreams. Our approach begins with the observation that all modern video…

Computer Vision and Pattern Recognition · Computer Science 2017-12-21 Aaron Chadha , Alhabib Abbas , Yiannis Andreopoulos

D3D: Distilled 3D Networks for Video Action Recognition

State-of-the-art methods for video action recognition commonly use an ensemble of two networks: the spatial stream, which takes RGB frames as input, and the temporal stream, which takes optical flow as input. In recent work, both of these…

Computer Vision and Pattern Recognition · Computer Science 2019-02-07 Jonathan C. Stroud , David A. Ross , Chen Sun , Jia Deng , Rahul Sukthankar

Automatic video scene segmentation based on spatial-temporal clues and rhythm

With ever increasing computing power and data storage capacity, the potential for large digital video libraries is growing rapidly.However, the massive use of video for the moment is limited by its opaque characteristics. Indeed, a user who…

Computer Vision and Pattern Recognition · Computer Science 2014-12-16 Walid Mahdi , Liming Chen , Mohsen Ardebilian

Two-stream Multi-dimensional Convolutional Network for Real-time Violence Detection

The increasing number of surveillance cameras and security concerns have made automatic violent activity detection from surveillance footage an active area for research. Modern deep learning methods have achieved good accuracy in violence…

Computer Vision and Pattern Recognition · Computer Science 2022-11-09 Dipon Kumar Ghosh , Amitabha Chakrabarty

Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis

Robust scene segmentation and keyframe extraction are essential preprocessing steps in video understanding pipelines, supporting tasks such as indexing, summarization, and semantic retrieval. However, existing methods often lack…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Vasilii Korolkov

Automatic Video Object Segmentation via Motion-Appearance-Stream Fusion and Instance-aware Segmentation

This paper presents a method for automatic video object segmentation based on the fusion of motion stream, appearance stream, and instance-aware segmentation. The proposed scheme consists of a two-stream fusion network and an instance…

Computer Vision and Pattern Recognition · Computer Science 2019-12-04 Sungkwon Choo , Wonkyo Seo , Nam Ik Cho

4DSegStreamer: Streaming 4D Panoptic Segmentation via Dual Threads

4D panoptic segmentation in a streaming setting is critical for highly dynamic environments, such as evacuating dense crowds and autonomous driving in complex scenarios, where real-time, fine-grained perception within a constrained time…

Computer Vision and Pattern Recognition · Computer Science 2025-10-21 Ling Liu , Jun Tian , Li Yi

Coarse-Fine Networks for Temporal Activity Detection in Videos

In this paper, we introduce Coarse-Fine Networks, a two-stream architecture which benefits from different abstractions of temporal resolution to learn better video representations for long-term motion. Traditional Video models process…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Kumara Kahatapitiya , Michael S. Ryoo

Temporal Distinct Representation Learning for Action Recognition

Motivated by the previous success of Two-Dimensional Convolutional Neural Network (2D CNN) on image recognition, researchers endeavor to leverage it to characterize videos. However, one limitation of applying 2D CNN to analyze videos is…

Computer Vision and Pattern Recognition · Computer Science 2020-07-16 Junwu Weng , Donghao Luo , Yabiao Wang , Ying Tai , Chengjie Wang , Jilin Li , Feiyue Huang , Xudong Jiang , Junsong Yuan

Raising the ClaSS of Streaming Time Series Segmentation

Ubiquitous sensors today emit high frequency streams of numerical measurements that reflect properties of human, animal, industrial, commercial, and natural processes. Shifts in such processes, e.g. caused by external events or internal…

Machine Learning · Computer Science 2025-04-04 Arik Ermshaus , Patrick Schäfer , Ulf Leser

Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition

We address the problem of capturing temporal information for video classification in 2D networks, without increasing their computational cost. Existing approaches focus on modifying the architecture of 2D networks (e.g. by including filters…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Kiyoon Kim , Shreyank N Gowda , Oisin Mac Aodha , Laura Sevilla-Lara

A Deep Siamese Network for Scene Detection in Broadcast Videos

We present a model that automatically divides broadcast videos into coherent scenes by learning a distance measure between shots. Experiments are performed to demonstrate the effectiveness of our approach by comparing our algorithm against…

Computer Vision and Pattern Recognition · Computer Science 2015-11-02 Lorenzo Baraldi , Costantino Grana , Rita Cucchiara

Exploring Temporal Information for Improved Video Understanding

In this dissertation, I present my work towards exploring temporal information for better video understanding. Specifically, I have worked on two problems: action recognition and semantic segmentation. For action recognition, I have…

Computer Vision and Pattern Recognition · Computer Science 2019-05-28 Yi Zhu

Scene Consistency Representation Learning for Video Scene Segmentation

A long-term video, such as a movie or TV show, is composed of various scenes, each of which represents a series of shots sharing the same semantic story. Spotting the correct scene boundary from the long-term video is a challenging task,…

Computer Vision and Pattern Recognition · Computer Science 2022-05-12 Haoqian Wu , Keyu Chen , Yanan Luo , Ruizhi Qiao , Bo Ren , Haozhe Liu , Weicheng Xie , Linlin Shen

StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory

Moving object segmentation based on LiDAR is a crucial and challenging task for autonomous driving and mobile robotics. Most approaches explore spatio-temporal information from LiDAR sequences to predict moving objects in the current frame.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-12 Zhiheng Li , Yubo Cui , Jiexi Zhong , Zheng Fang

Dynamic Video Segmentation Network

In this paper, we present a detailed design of dynamic video segmentation network (DVSNet) for fast and efficient semantic video segmentation. DVSNet consists of two convolutional neural networks: a segmentation network and a flow network.…

Computer Vision and Pattern Recognition · Computer Science 2018-06-15 Yu-Syuan Xu , Tsu-Jui Fu , Hsuan-Kung Yang , Chun-Yi Lee

A Novel Two-Dimensional Smoothing Algorithm

Smoothing and filtering two-dimensional sequences are fundamental tasks in fields such as computer vision. Conventional filtering algorithms often rely on the selection of the filtering window, limiting their applicability in certain…

Information Theory · Computer Science 2025-07-22 Xufeng Chen , Liang Yan , Xiaoshan Gao

Value of Temporal Dynamics Information in Driving Scene Segmentation

Semantic scene segmentation has primarily been addressed by forming representations of single images both with supervised and unsupervised methods. The problem of semantic segmentation in dynamic scenes has begun to recently receive…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Li Ding , Jack Terwilliger , Rini Sherony , Bryan Reimer , Lex Fridman

Unsupervised Change Detection in Satellite Images Using Convolutional Neural Networks

This paper proposes an efficient unsupervised method for detecting relevant changes between two temporally different images of the same scene. A convolutional neural network (CNN) for semantic segmentation is implemented to extract…

Neural and Evolutionary Computing · Computer Science 2019-03-22 Kevin Louis de Jong , Anna Sergeevna Bosman