Related papers: Shift-Memory Network for Temporal Scene Segmentati…

Continual Adaptation of Semantic Segmentation using Complementary 2D-3D Data Representations

Semantic segmentation networks are usually pre-trained once and not updated during deployment. As a consequence, misclassifications commonly occur if the distribution of the training data deviates from the one encountered during the robot's…

Robotics · Computer Science 2023-02-15 Jonas Frey , Hermann Blum , Francesco Milano , Roland Siegwart , Cesar Cadena

Fast-SCNN: Fast Semantic Segmentation Network

The encoder-decoder framework is state-of-the-art for offline semantic image segmentation. Since the rise in autonomous systems, real-time computation is increasingly desirable. In this paper, we introduce fast segmentation convolutional…

Computer Vision and Pattern Recognition · Computer Science 2019-02-13 Rudra P K Poudel , Stephan Liwicki , Roberto Cipolla

Spatial Sampling Network for Fast Scene Understanding

We propose a network architecture to perform efficient scene understanding. This work presents three main novelties: the first is an Improved Guided Upsampling Module that can replace in toto the decoder part in common semantic segmentation…

Computer Vision and Pattern Recognition · Computer Science 2019-05-23 Davide Mazzini , Raimondo Schettini

Enhanced Neuromorphic Semantic Segmentation Latency through Stream Event

Achieving optimal semantic segmentation with frame-based vision sensors poses significant challenges for real-time systems like UAVs and self-driving cars, which require rapid and precise processing. Traditional frame-based methods often…

Computer Vision and Pattern Recognition · Computer Science 2025-02-27 D. Hareb , J. Martinet , B. Miramond

Towards Accurate Scene Text Recognition with Semantic Reasoning Networks

Scene text image contains two levels of contents: visual texture and semantic information. Although the previous scene text recognition methods have made great progress over the past few years, the research on mining semantic information to…

Computer Vision and Pattern Recognition · Computer Science 2020-03-30 Deli Yu , Xuan Li , Chengquan Zhang , Junyu Han , Jingtuo Liu , Errui Ding

A Deep Semantic Segmentation Network with Semantic and Contextual Refinements

Semantic segmentation is a fundamental task in multimedia processing, which can be used for analyzing, understanding, editing contents of images and videos, among others. To accelerate the analysis of multimedia data, existing segmentation…

Computer Vision and Pattern Recognition · Computer Science 2024-12-13 Zhiyan Wang , Deyin Liu , Lin Yuanbo Wu , Song Wang , Xin Guo , Lin Qi

Real-Time Semantic Segmentation via Multiply Spatial Fusion Network

Real-time semantic segmentation plays a significant role in industry applications, such as autonomous driving, robotics and so on. It is a challenging task as both efficiency and performance need to be considered simultaneously. To address…

Computer Vision and Pattern Recognition · Computer Science 2019-11-19 Haiyang Si , Zhiqiang Zhang , Feifan Lv , Gang Yu , Feng Lu

Temporal Memory Attention for Video Semantic Segmentation

Video semantic segmentation requires to utilize the complex temporal relations between frames of the video sequence. Previous works usually exploit accurate optical flow to leverage the temporal relations, which suffer much from heavy…

Computer Vision and Pattern Recognition · Computer Science 2021-09-14 Hao Wang , Weining Wang , Jing Liu

Unsupervised Continual Semantic Adaptation through Neural Rendering

An increasing amount of applications rely on data-driven models that are deployed for perception tasks across a sequence of scenes. Due to the mismatch between training and deployment data, adapting the model on the new scenes is often…

Computer Vision and Pattern Recognition · Computer Science 2025-10-15 Zhizheng Liu , Francesco Milano , Jonas Frey , Roland Siegwart , Hermann Blum , Cesar Cadena

Dilated Temporal Fully-Convolutional Network for Semantic Segmentation of Motion Capture Data

Semantic segmentation of motion capture sequences plays a key part in many data-driven motion synthesis frameworks. It is a preprocessing step in which long recordings of motion capture sequences are partitioned into smaller segments.…

Computer Vision and Pattern Recognition · Computer Science 2018-07-17 Noshaba Cheema , Somayeh Hosseini , Janis Sprenger , Erik Herrmann , Han Du , Klaus Fischer , Philipp Slusallek

Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation

This paper presents a deep learning framework for medical video segmentation. Convolution neural network (CNN) and transformer-based methods have achieved great milestones in medical image segmentation tasks due to their incredible semantic…

Computer Vision and Pattern Recognition · Computer Science 2024-02-13 Chengxi Zeng , Xinyu Yang , David Smithard , Majid Mirmehdi , Alberto M Gambaruto , Tilo Burghardt

Temporally Distributed Networks for Fast Video Semantic Segmentation

We present TDNet, a temporally distributed network designed for fast and accurate video semantic segmentation. We observe that features extracted from a certain high-level layer of a deep CNN can be approximated by composing features…

Computer Vision and Pattern Recognition · Computer Science 2020-04-08 Ping Hu , Fabian Caba Heilbron , Oliver Wang , Zhe Lin , Stan Sclaroff , Federico Perazzi

Spatial Memory for Context Reasoning in Object Detection

Modeling instance-level context and object-object relationships is extremely challenging. It requires reasoning about bounding boxes of different classes, locations \etc. Above all, instance-level spatial reasoning inherently requires…

Computer Vision and Pattern Recognition · Computer Science 2017-04-14 Xinlei Chen , Abhinav Gupta

STFCN: Spatio-Temporal FCN for Semantic Video Segmentation

This paper presents a novel method to involve both spatial and temporal features for semantic video segmentation. Current work on convolutional neural networks(CNNs) has shown that CNNs provide advanced spatial features supporting a very…

Computer Vision and Pattern Recognition · Computer Science 2016-09-05 Mohsen Fayyaz , Mohammad Hajizadeh Saffar , Mohammad Sabokrou , Mahmood Fathy , Reinhard Klette , Fay Huang

Exploring Frame Segmentation Networks for Temporal Action Localization

Temporal action localization is an important task of computer vision. Though many methods have been proposed, it still remains an open question how to predict the temporal location of action segments precisely. Most state-of-the-art works…

Computer Vision and Pattern Recognition · Computer Science 2019-02-15 Ke Yang , Xiaolong Shen , Peng Qiao , Shijie Li , Dongsheng Li , Yong Dou

TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device

The explosive growth in video streaming requires video understanding at high accuracy and low computation cost. Conventional 2D CNNs are computationally cheap but cannot capture temporal relationships; 3D CNN-based methods can achieve good…

Computer Vision and Pattern Recognition · Computer Science 2021-09-28 Ji Lin , Chuang Gan , Kuan Wang , Song Han

Local Memory Attention for Fast Video Semantic Segmentation

We propose a novel neural network module that transforms an existing single-frame semantic segmentation model into a video semantic segmentation pipeline. In contrast to prior works, we strive towards a simple, fast, and general module that…

Computer Vision and Pattern Recognition · Computer Science 2021-09-28 Matthieu Paul , Martin Danelljan , Luc Van Gool , Radu Timofte

Efficient Global-Local Memory for Real-time Instrument Segmentation of Robotic Surgical Video

Performing a real-time and accurate instrument segmentation from videos is of great significance for improving the performance of robotic-assisted surgery. We identify two important clues for surgical instrument perception, including local…

Computer Vision and Pattern Recognition · Computer Science 2021-09-29 Jiacheng Wang , Yueming Jin , Liansheng Wang , Shuntian Cai , Pheng-Ann Heng , Jing Qin

TSM: Temporal Shift Module for Efficient Video Understanding

The explosive growth in video streaming gives rise to challenges on performing video understanding at high accuracy and low computation cost. Conventional 2D CNNs are computationally cheap but cannot capture temporal relationships; 3D CNN…

Computer Vision and Pattern Recognition · Computer Science 2019-08-23 Ji Lin , Chuang Gan , Song Han

Efficient Semantic Video Segmentation with Per-frame Inference

For semantic segmentation, most existing real-time deep models trained with each frame independently may produce inconsistent results for a video sequence. Advanced methods take into considerations the correlations in the video sequence,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Yifan Liu , Chunhua Shen , Changqian Yu , Jingdong Wang