Related papers: MOSES: A Streaming Algorithm for Linear Dimensiona…

StreamMOS: Streaming Moving Object Segmentation with Multi-View Perception and Dual-Span Memory

Moving object segmentation based on LiDAR is a crucial and challenging task for autonomous driving and mobile robotics. Most approaches explore spatio-temporal information from LiDAR sequences to predict moving objects in the current frame.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-12 Zhiheng Li , Yubo Cui , Jiexi Zhong , Zheng Fang

Segment Anything Across Shots: A Method and Benchmark

This work focuses on multi-shot semi-supervised video object segmentation (MVOS), which aims at segmenting the target object indicated by an initial mask throughout a video with multiple shots. The existing VOS methods mainly focus on…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Hengrui Hu , Kaining Ying , Henghui Ding

Streamed Learning: One-Pass SVMs

We present a streaming model for large-scale classification (in the context of $\ell_2$-SVM) by leveraging connections between learning and computational geometry. The streaming model imposes the constraint that only a single pass over the…

Machine Learning · Computer Science 2009-08-06 Piyush Rai , Hal Daumé , Suresh Venkatasubramanian

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

Video object segmentation (VOS) aims at segmenting a particular object throughout the entire video clip sequence. The state-of-the-art VOS methods have achieved excellent performance (e.g., 90+% J&F) on existing datasets. However, since the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Henghui Ding , Chang Liu , Shuting He , Xudong Jiang , Philip H. S. Torr , Song Bai

MOOSE: Pay Attention to Temporal Dynamics for Video Understanding via Optical Flows

Many motion-centric video analysis tasks, such as atomic actions, detecting atypical motor behavior in individuals with autism, or analyzing articulatory motion in real-time MRI of human speech, require efficient and interpretable temporal…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Hong Nguyen , Dung Tran , Hieu Hoang , Phong Nguyen , Shrikanth Narayanan

Video Object Segmentation with Dynamic Query Modulation

Storing intermediate frame segmentations as memory for long-range context modeling, spatial-temporal memory-based methods have recently showcased impressive results in semi-supervised video object segmentation (SVOS). However, these methods…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Hantao Zhou , Runze Hu , Xiu Li

Stochastic Multidimensional Scaling

Multidimensional scaling (MDS) is a popular dimensionality reduction techniques that has been widely used for network visualization and cooperative localization. However, the traditional stress minimization formulation of MDS necessitates…

Optimization and Control · Mathematics 2016-12-22 Ketan Rajawat , Sandeep Kumar

SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution

Semi-supervised video object segmentation (Semi-VOS), which requires only annotating the first frame of a video to segment future frames, has received increased attention recently. Among existing pipelines, the memory-matching-based one is…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Weihao Lin , Tao Chen , Chong Yu

READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation

We present READMem (Robust Embedding Association for a Diverse Memory), a modular framework for semi-automatic video object segmentation (sVOS) methods designed to handle unconstrained videos. Contemporary sVOS works typically aggregate…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Stéphane Vujasinović , Sebastian Bullinger , Stefan Becker , Norbert Scherer-Negenborn , Michael Arens , Rainer Stiefelhagen

TransVOS: Video Object Segmentation with Transformers

Recently, Space-Time Memory Network (STM) based methods have achieved state-of-the-art performance in semi-supervised video object segmentation (VOS). A crucial problem in this task is how to model the dependency both among different frames…

Computer Vision and Pattern Recognition · Computer Science 2021-09-21 Jianbiao Mei , Mengmeng Wang , Yeneng Lin , Yi Yuan , Yong Liu

Exploring High-Order Self-Similarity for Video Understanding

Space-time self-similarity (STSS), which captures visual correspondences across frames, provides an effective way to represent temporal dynamics for video understanding. In this work, we explore higher-order STSS and demonstrate how STSSs…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Manjin Kim , Heeseung Kwon , Karteek Alahari , Minsu Cho

Self-Supervised Deep Visual Odometry with Online Adaptation

Self-supervised VO methods have shown great success in jointly estimating camera pose and depth from videos. However, like most data-driven methods, existing VO networks suffer from a notable decrease in performance when confronted with…

Computer Vision and Pattern Recognition · Computer Science 2020-05-14 Shunkai Li , Xin Wang , Yingdian Cao , Fei Xue , Zike Yan , Hongbin Zha

Sparse Coresets for SVD on Infinite Streams

In streaming Singular Value Decomposition (SVD), $d$-dimensional rows of a possibly infinite matrix arrive sequentially as points in $\mathbb{R}^d$. An $\epsilon$-coreset is a (much smaller) matrix whose sum of square distances of the rows…

Data Structures and Algorithms · Computer Science 2020-11-30 Vladimir Braverman , Dan Feldman , Harry Lang , Daniela Rus , Adiel Statman

Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOS

Video object segmentation (VOS) is a crucial task in computer vision, but current VOS methods struggle with complex scenes and prolonged object motions. To address these challenges, the MOSE dataset aims to enhance object recognition and…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Deshui Miao , Yameng Gu , Xin Li , Zhenyu He , Yaowei Wang , Ming-Hsuan Yang

SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization

Matching-based methods, especially those based on space-time memory, are significantly ahead of other solutions in semi-supervised video object segmentation (VOS). However, continuously growing and redundant template features lead to an…

Computer Vision and Pattern Recognition · Computer Science 2022-08-23 Zhihui Lin , Tianyu Yang , Maomao Li , Ziyu Wang , Chun Yuan , Wenhao Jiang , Wei Liu

Dynamic Structure Embedded Online Multiple-Output Regression for Stream Data

Online multiple-output regression is an important machine learning technique for modeling, predicting, and compressing multi-dimensional correlated data streams. In this paper, we propose a novel online multiple-output regression method,…

Machine Learning · Computer Science 2015-09-09 Changsheng Li , Fan Wei , Weishan Dong , Qingshan Liu , Xiangfeng Wang , Xin Zhang

Novel Factorization Strategies for Higher Order Tensors: Implications for Compression and Recovery of Multi-linear Data

In this paper we propose novel methods for compression and recovery of multilinear data under limited sampling. We exploit the recently proposed tensor- Singular Value Decomposition (t-SVD)[1], which is a group theoretic framework for…

Information Theory · Computer Science 2013-11-01 Zemin Zhang , Gregory Ely , Shuchin Aeron , Ning Hao , Misha Kilmer

MOVE: Motion-Guided Few-Shot Video Object Segmentation

This work addresses motion-guided few-shot video object segmentation (FSVOS), which aims to segment dynamic objects in videos based on a few annotated examples with the same motion patterns. Existing FSVOS datasets and methods typically…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Kaining Ying , Hengrui Hu , Henghui Ding

Accurate Streaming Support Vector Machines

A widely-used tool for binary classification is the Support Vector Machine (SVM), a supervised learning technique that finds the "maximum margin" linear separator between the two classes. While SVMs have been well studied in the batch…

Machine Learning · Computer Science 2014-12-09 Vikram Nathan , Sharath Raghvendra

Efficient Principal Subspace Projection of Streaming Data Through Fast Similarity Matching

Big data problems frequently require processing datasets in a streaming fashion, either because all data are available at once but collectively are larger than available memory or because the data intrinsically arrive one data point at a…

Computation · Statistics 2018-08-08 Andrea Giovannucci , Victor Minden , Cengiz Pehlevan , Dmitri B. Chklovskii