Related papers: Representation Alignment Contrastive Regularizatio…

Multi-Task Self-Supervised Time-Series Representation Learning

Time-series representation learning can extract representations from data with temporal dynamics and sparse labels. When labeled data are sparse but unlabeled data are abundant, contrastive learning, i.e., a framework to learn a latent…

Machine Learning · Computer Science 2023-03-03 Heejeong Choi , Pilsung Kang

Structure-preserving contrastive learning for spatial time series

The effectiveness of neural network models largely relies on learning meaningful latent patterns from data, where self-supervised learning of informative representations can enhance model performance and generalisability. However,…

Machine Learning · Computer Science 2025-10-28 Yiru Jiao , Sander van Cranenburgh , Simeon Calvert , Hans van Lint

Representation Learning via Global Temporal Alignment and Cycle-Consistency

We introduce a weakly supervised method for representation learning based on aligning temporal sequences (e.g., videos) of the same process (e.g., human action). The main idea is to use the global temporal ordering of latent correspondences…

Computer Vision and Pattern Recognition · Computer Science 2021-05-12 Isma Hadji , Konstantinos G. Derpanis , Allan D. Jepson

Towards Discriminative Representation: Multi-view Trajectory Contrastive Learning for Online Multi-object Tracking

Discriminative representation is crucial for the association step in multi-object tracking. Recent work mainly utilizes features in single or neighboring frames for constructing metric loss and empowering networks to extract representation…

Computer Vision and Pattern Recognition · Computer Science 2022-04-06 En Yu , Zhuoling Li , Shoudong Han

Metric Learning Driven Multi-Task Structured Output Optimization for Robust Keypoint Tracking

As an important and challenging problem in computer vision and graphics, keypoint-based object tracking is typically formulated in a spatio-temporal statistical learning framework. However, most existing keypoint trackers are incapable of…

Computer Vision and Pattern Recognition · Computer Science 2014-12-05 Liming Zhao , Xi Li , Jun Xiao , Fei Wu , Yueting Zhuang

Contrastive Training of Complex-Valued Autoencoders for Object Discovery

Current state-of-the-art object-centric models use slots and attention-based routing for binding. However, this class of models has several conceptual limitations: the number of slots is hardwired; all slots have equal capacity; training…

Machine Learning · Computer Science 2023-11-10 Aleksandar Stanić , Anand Gopalakrishnan , Kazuki Irie , Jürgen Schmidhuber

Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers

An accurate detection and tracking of devices such as guiding catheters in live X-ray image acquisitions is an essential prerequisite for endovascular cardiac interventions. This information is leveraged for procedural guidance, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2024-05-03 Saahil Islam , Venkatesh N. Murthy , Dominik Neumann , Badhan Kumar Das , Puneet Sharma , Andreas Maier , Dorin Comaniciu , Florin C. Ghesu

Joint Spatial-Temporal and Appearance Modeling with Transformer for Multiple Object Tracking

The recent trend in multiple object tracking (MOT) is heading towards leveraging deep learning to boost the tracking performance. In this paper, we propose a novel solution named TransSTAM, which leverages Transformer to effectively model…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Peng Dai , Yiqiang Feng , Renliang Weng , Changshui Zhang

A Simple Framework for Multi-mode Spatial-Temporal Data Modeling

Spatial-temporal data modeling aims to mine the underlying spatial relationships and temporal dependencies of objects in a system. However, most existing methods focus on the modeling of spatial-temporal data in a single mode, lacking the…

Machine Learning · Computer Science 2023-08-23 Zihang Liu , Le Yu , Tongyu Zhu , Leiei Sun

Spatial-Temporal Relation Networks for Multi-Object Tracking

Recent progress in multiple object tracking (MOT) has shown that a robust similarity score is key to the success of trackers. A good similarity score is expected to reflect multiple cues, e.g. appearance, location, and topology, over a long…

Computer Vision and Pattern Recognition · Computer Science 2019-04-26 Jiarui Xu , Yue Cao , Zheng Zhang , Han Hu

Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

There has been an increasing focus in learning interpretable feature representations, particularly in applications such as medical image analysis that require explainability, whilst relying less on annotated data (since annotations can be…

Computer Vision and Pattern Recognition · Computer Science 2019-11-19 Gabriele Valvano , Agisilaos Chartsias , Andrea Leo , Sotirios A. Tsaftaris

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

Online tracking of multiple objects in videos requires strong capacity of modeling and matching object appearances. Previous methods for learning appearance embedding mostly rely on instance-level matching without considering the temporal…

Computer Vision and Pattern Recognition · Computer Science 2021-07-07 Wei Li , Yuanjun Xiong , Shuo Yang , Mingze Xu , Yongxin Wang , Wei Xia

Robust Estimation of Similarity Transformation for Visual Object Tracking

Most of existing correlation filter-based tracking approaches only estimate simple axis-aligned bounding boxes, and very few of them is capable of recovering the underlying similarity transformation. To tackle this challenging problem, in…

Computer Vision and Pattern Recognition · Computer Science 2018-11-08 Yang Li , Jianke Zhu , Steven C. H. Hoi , Wenjie Song , Zhefeng Wang , Hantang Liu

Learning Less-Overlapping Representations

In representation learning (RL), how to make the learned representations easy to interpret and less overfitted to training data are two important but challenging issues. To address these problems, we study a new type of regulariza- tion…

Machine Learning · Computer Science 2017-11-28 Pengtao Xie , Hongbao Zhang , Eric P. Xing

Enforcing Template Representability and Temporal Consistency for Adaptive Sparse Tracking

Sparse representation has been widely studied in visual tracking, which has shown promising tracking performance. Despite a lot of progress, the visual tracking problem is still a challenging task due to appearance variations over time. In…

Computer Vision and Pattern Recognition · Computer Science 2016-05-03 Xue Yang , Fei Han , Hua Wang , Hao Zhang

Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere

Contrastive representation learning has been outstandingly successful in practice. In this work, we identify two key properties related to the contrastive loss: (1) alignment (closeness) of features from positive pairs, and (2) uniformity…

Machine Learning · Computer Science 2022-08-17 Tongzhou Wang , Phillip Isola

ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting

Spatial-temporal forecasting is crucial and widely applicable in various domains such as traffic, energy, and climate. Benefiting from the abundance of unlabeled spatial-temporal data, self-supervised methods are increasingly adapted to…

Machine Learning · Computer Science 2024-12-20 Qi Zheng , Zihao Yao , Yaying Zhang

Structure-Regularized Attention for Deformable Object Representation

Capturing contextual dependencies has proven useful to improve the representational power of deep neural networks. Recent approaches that focus on modeling global context, such as self-attention and non-local operation, achieve this goal by…

Computer Vision and Pattern Recognition · Computer Science 2021-06-15 Shenao Zhang , Li Shen , Zhifeng Li , Wei Liu

Adaptive Feature Representation for Visual Tracking

Robust feature representation plays significant role in visual tracking. However, it remains a challenging issue, since many factors may affect the experimental performance. The existing method which combine different features by setting…

Computer Vision and Pattern Recognition · Computer Science 2017-05-15 Yuqi Han , Chenwei Deng , Zengshuo Zhang , Jiatong Li , Baojun Zhao

Learning by Aligning Videos in Time

We present a self-supervised approach for learning video representations using temporal video alignment as a pretext task, while exploiting both frame-level and video-level information. We leverage a novel combination of temporal alignment…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Sanjay Haresh , Sateesh Kumar , Huseyin Coskun , Shahram Najam Syed , Andrey Konin , Muhammad Zeeshan Zia , Quoc-Huy Tran