Related papers: Refining Pre-Trained Motion Models

Motion-Augmented Self-Training for Video Recognition at Smaller Scale

The goal of this paper is to self-train a 3D convolutional neural network on an unlabeled video collection for deployment on small-scale video collections. As smaller video datasets benefit more from motion than appearance, we strive to…

Computer Vision and Pattern Recognition · Computer Science 2021-05-05 Kirill Gavrilyuk , Mihir Jain , Ilia Karmanov , Cees G. M. Snoek

Enhancing Self-Training Methods

Semi-supervised learning approaches train on small sets of labeled data along with large sets of unlabeled data. Self-training is a semi-supervised teacher-student approach that often suffers from the problem of "confirmation bias" that…

Machine Learning · Computer Science 2023-01-19 Aswathnarayan Radhakrishnan , Jim Davis , Zachary Rabin , Benjamin Lewis , Matthew Scherreik , Roman Ilin

A Large-Scale Analysis on Self-Supervised Video Representation Learning

Self-supervised learning is an effective way for label-free model pre-training, especially in the video domain where labeling is expensive. Existing self-supervised works in the video domain use varying experimental setups to demonstrate…

Computer Vision and Pattern Recognition · Computer Science 2023-11-22 Akash Kumar , Ashlesha Kumar , Vibhav Vineet , Yogesh Singh Rawat

Real-World Point Tracking with Verifier-Guided Pseudo-Labeling

Models for long-term point tracking are typically trained on large synthetic datasets. The performance of these models degrades in real-world videos due to different characteristics and the absence of dense ground-truth annotations.…

Computer Vision and Pattern Recognition · Computer Science 2026-03-13 Görkay Aydemir , Fatma Güney , Weidi Xie

Improving Semantic Segmentation via Self-Training

Deep learning usually achieves the best results with complete supervision. In the case of semantic segmentation, this means that large amounts of pixelwise annotations are required to learn accurate models. In this paper, we show that we…

Computer Vision and Pattern Recognition · Computer Science 2020-05-07 Yi Zhu , Zhongyue Zhang , Chongruo Wu , Zhi Zhang , Tong He , Hang Zhang , R. Manmatha , Mu Li , Alexander Smola

Learning Velocity and Acceleration: Self-Supervised Motion Consistency for Pedestrian Trajectory Prediction

Understanding human motion is crucial for accurate pedestrian trajectory prediction. Conventional methods typically rely on supervised learning, where ground-truth labels are directly optimized against predicted trajectories. This amplifies…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Yizhou Huang , Yihua Cheng , Kezhi Wang

Semi-Supervised Class-Agnostic Motion Prediction with Pseudo Label Regeneration and BEVMix

Class-agnostic motion prediction methods aim to comprehend motion within open-world scenarios, holding significance for autonomous driving systems. However, training a high-performance model in a fully-supervised manner always requires…

Computer Vision and Pattern Recognition · Computer Science 2023-12-15 Kewei Wang , Yizheng Wu , Zhiyu Pan , Xingyi Li , Ke Xian , Zhe Wang , Zhiguo Cao , Guosheng Lin

Track, Check, Repeat: An EM Approach to Unsupervised Tracking

We propose an unsupervised method for detecting and tracking moving objects in 3D, in unlabelled RGB-D videos. The method begins with classic handcrafted techniques for segmenting objects using motion cues: we estimate optical flow and…

Computer Vision and Pattern Recognition · Computer Science 2021-04-09 Adam W. Harley , Yiming Zuo , Jing Wen , Ayush Mangal , Shubhankar Potdar , Ritwick Chaudhry , Katerina Fragkiadaki

Self-Supervised Class-Agnostic Motion Prediction with Spatial and Temporal Consistency Regularizations

The perception of motion behavior in a dynamic environment holds significant importance for autonomous driving systems, wherein class-agnostic motion prediction methods directly predict the motion of the entire point cloud. While most…

Computer Vision and Pattern Recognition · Computer Science 2024-03-22 Kewei Wang , Yizheng Wu , Jun Cen , Zhiyu Pan , Xingyi Li , Zhe Wang , Zhiguo Cao , Guosheng Lin

Aligned Unsupervised Pretraining of Object Detectors with Self-training

The unsupervised pretraining of object detectors has recently become a key component of object detector training, as it leads to improved performance and faster convergence during the supervised fine-tuning stage. Existing unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Ioannis Maniadis Metaxas , Adrian Bulat , Ioannis Patras , Brais Martinez , Georgios Tzimiropoulos

Bootstrapped Self-Supervised Training with Monocular Video for Semantic Segmentation and Depth Estimation

For a robot deployed in the world, it is desirable to have the ability of autonomous learning to improve its initial pre-set knowledge. We formalize this as a bootstrapped self-supervised learning problem where a system is initially…

Computer Vision and Pattern Recognition · Computer Science 2021-08-03 Yihao Zhang , John J. Leonard

Self-Supervised Multi-Object Tracking with Cross-Input Consistency

In this paper, we propose a self-supervised learning procedure for training a robust multi-object tracking (MOT) model given only unlabeled video. While several self-supervisory learning signals have been proposed in prior work on…

Computer Vision and Pattern Recognition · Computer Science 2021-11-12 Favyen Bastani , Songtao He , Sam Madden

Meta Co-Training: Two Views are Better than One

In many critical computer vision scenarios unlabeled data is plentiful, but labels are scarce and difficult to obtain. As a result, semi-supervised learning which leverages unlabeled data to boost the performance of supervised classifiers…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Jay C. Rothenberger , Dimitrios I. Diochnos

TrajSSL: Trajectory-Enhanced Semi-Supervised 3D Object Detection

Semi-supervised 3D object detection is a common strategy employed to circumvent the challenge of manually labeling large-scale autonomous driving perception datasets. Pseudo-labeling approaches to semi-supervised learning adopt a…

Computer Vision and Pattern Recognition · Computer Science 2024-09-18 Philip Jacobson , Yichen Xie , Mingyu Ding , Chenfeng Xu , Masayoshi Tomizuka , Wei Zhan , Ming C. Wu

Turning to a Teacher for Timestamp Supervised Temporal Action Segmentation

Temporal action segmentation in videos has drawn much attention recently. Timestamp supervision is a cost-effective way for this task. To obtain more information to optimize the model, the existing method generated pseudo frame-wise labels…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Yang Zhao , Yan Song

Self-Loop Uncertainty: A Novel Pseudo-Label for Semi-Supervised Medical Image Segmentation

Witnessing the success of deep learning neural networks in natural image processing, an increasing number of studies have been proposed to develop deep-learning-based frameworks for medical image segmentation. However, since the pixel-wise…

Image and Video Processing · Electrical Eng. & Systems 2020-07-21 Yuexiang Li , Jiawei Chen , Xinpeng Xie , Kai Ma , Yefeng Zheng

Multiview Pseudo-Labeling for Semi-supervised Learning from Video

We present a multiview pseudo-labeling approach to video learning, a novel framework that uses complementary views in the form of appearance and motion information for semi-supervised learning in video. The complementary views help obtain…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Bo Xiong , Haoqi Fan , Kristen Grauman , Christoph Feichtenhofer

Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

This paper focuses on self-supervised monocular depth estimation in dynamic scenes trained on monocular videos. Existing methods jointly estimate pixel-wise depth and motion, relying mainly on an image reconstruction loss. Dynamic regions1…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Hoang Chuong Nguyen , Tianyu Wang , Jose M. Alvarez , Miaomiao Liu

Boosting Supervised Learning Performance with Co-training

Deep learning perception models require a massive amount of labeled training data to achieve good performance. While unlabeled data is easy to acquire, the cost of labeling is prohibitive and could create a tremendous burden on companies or…

Computer Vision and Pattern Recognition · Computer Science 2021-11-19 Xinnan Du , William Zhang , Jose M. Alvarez

Self-Training: A Survey

Semi-supervised algorithms aim to learn prediction functions from a small set of labeled observations and a large set of unlabeled observations. Because this framework is relevant in many applications, they have received a lot of interest…

Machine Learning · Computer Science 2025-02-17 Massih-Reza Amini , Vasilii Feofanov , Loic Pauletto , Lies Hadjadj , Emilie Devijver , Yury Maximov