Related papers: Modeling sequential data using higher-order relati…

Learning to encode motion using spatio-temporal synchrony

We consider the task of learning to extract motion from videos. To this end, we show that the detection of spatial transformations can be viewed as the detection of synchrony between the image sequence and a sequence of features undergoing…

Computer Vision and Pattern Recognition · Computer Science 2014-02-11 Kishore Reddy Konda , Roland Memisevic , Vincent Michalski

Enhancing Spatiotemporal Prediction Model using Modular Design and Beyond

Predictive learning uses a known state to generate a future state over a period of time. It is a challenging task to predict spatiotemporal sequence because the spatiotemporal sequence varies both in time and space. The mainstream method is…

Computer Vision and Pattern Recognition · Computer Science 2022-10-05 Haoyu Pan , Hao Wu , Tan Yang

Self-Supervised Representation Learning from Temporal Ordering of Automated Driving Sequences

Self-supervised feature learning enables perception systems to benefit from the vast raw data recorded by vehicle fleets worldwide. While video-level self-supervised learning approaches have shown strong generalizability on classification…

Computer Vision and Pattern Recognition · Computer Science 2023-11-09 Christopher Lang , Alexander Braun , Lars Schillingmann , Karsten Haug , Abhinav Valada

Learning Knowledge-Rich Sequential Model for Planar Homography Estimation in Aerial Video

This paper presents an unsupervised approach that leverages raw aerial videos to learn to estimate planar homographic transformation between consecutive video frames. Previous learning-based estimators work on pairs of images to estimate…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Pu Li , Xiaobai Liu

Overparameterized Neural Networks Implement Associative Memory

Identifying computational mechanisms for memorization and retrieval of data is a long-standing problem at the intersection of machine learning and neuroscience. Our main finding is that standard overparameterized deep neural networks…

Machine Learning · Computer Science 2022-05-25 Adityanarayanan Radhakrishnan , Mikhail Belkin , Caroline Uhler

Continual Learning of Predictive Models in Video Sequences via Variational Autoencoders

This paper proposes a method for performing continual learning of predictive models that facilitate the inference of future frames in video sequences. For a first given experience, an initial Variational Autoencoder, together with a set of…

Computer Vision and Pattern Recognition · Computer Science 2020-06-04 Damian Campo , Giulia Slavic , Mohamad Baydoun , Lucio Marcenaro , Carlo Regazzoni

Encoding-based Memory Modules for Recurrent Neural Networks

Learning to solve sequential tasks with recurrent models requires the ability to memorize long sequences and to extract task-relevant features from them. In this paper, we study the memorization subtask from the point of view of the design…

Machine Learning · Computer Science 2020-02-03 Antonio Carta , Alessandro Sperduti , Davide Bacciu

Predictive Learning: Using Future Representation Learning Variantial Autoencoder for Human Action Prediction

The unsupervised Pretraining method has been widely used in aiding human action recognition. However, existing methods focus on reconstructing the already present frames rather than generating frames which happen in future.In this paper, We…

Computer Vision and Pattern Recognition · Computer Science 2017-12-13 Yu Runsheng , Shi Zhenyu , Ma Qiongxiong , Qing Laiyun

Multimodal Prediction based on Graph Representations

This paper proposes a learning model, based on rank-fusion graphs, for general applicability in multimodal prediction tasks, such as multimodal regression and image classification. Rank-fusion graphs encode information from multiple…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Icaro Cavalcante Dourado , Salvatore Tabbone , Ricardo da Silva Torres

A Deep Learning Model for Structured Outputs with High-order Interaction

Many real-world applications are associated with structured data, where not only input but also output has interplay. However, typical classification and regression models often lack the ability of simultaneously exploring high-order…

Machine Learning · Computer Science 2015-05-01 Hongyu Guo , Xiaodan Zhu , Martin Renqiang Min

Learning to Linearize Under Uncertainty

Training deep feature hierarchies to solve supervised learning tasks has achieved state of the art performance on many problems in computer vision. However, a principled way in which to train such hierarchies in the unsupervised setting has…

Computer Vision and Pattern Recognition · Computer Science 2015-09-11 Ross Goroshin , Michael Mathieu , Yann LeCun

Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding

Attentional mechanisms are order-invariant. Positional encoding is a crucial component to allow attention-based deep model architectures such as Transformer to address sequences or images where the position of information matters. In this…

Machine Learning · Computer Science 2021-11-10 Yang Li , Si Si , Gang Li , Cho-Jui Hsieh , Samy Bengio

Rank Ordered Autoencoders

A new method for the unsupervised learning of sparse representations using autoencoders is proposed and implemented by ordering the output of the hidden units by their activation value and progressively reconstructing the input in this…

Machine Learning · Computer Science 2016-05-09 Paul Bertens

Conditional Mutual information-based Contrastive Loss for Financial Time Series Forecasting

We present a representation learning framework for financial time series forecasting. One challenge of using deep learning models for finance forecasting is the shortage of available training data when using small datasets. Direct trend…

Machine Learning · Computer Science 2021-05-10 Hanwei Wu , Ather Gattami , Markus Flierl

Higher-Order Function Networks for Learning Composable 3D Object Representations

We present a new approach to 3D object representation where a neural network encodes the geometry of an object directly into the weights and biases of a second 'mapping' network. This mapping network can be used to reconstruct an object by…

Machine Learning · Computer Science 2020-04-07 Eric Mitchell , Selim Engin , Volkan Isler , Daniel D Lee

Image Reconstruction as a Tool for Feature Analysis

Vision encoders are increasingly used in modern applications, from vision-only models to multimodal systems such as vision-language models. Despite their remarkable success, it remains unclear how these architectures represent features…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Eduard Allakhverdov , Dmitrii Tarasov , Elizaveta Goncharova , Andrey Kuznetsov

Learning a Generative Motion Model from Image Sequences based on a Latent Motion Matrix

We propose to learn a probabilistic motion model from a sequence of images for spatio-temporal registration. Our model encodes motion in a low-dimensional probabilistic space - the motion matrix - which enables various motion analysis tasks…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Julian Krebs , Hervé Delingette , Nicholas Ayache , Tommaso Mansi

Semi-Structured Object Sequence Encoders

In this paper we explore the task of modeling semi-structured object sequences; in particular, we focus our attention on the problem of developing a structure-aware input representation for such sequences. Examples of such data include user…

Computer Vision and Pattern Recognition · Computer Science 2023-05-24 Rudra Murthy , Riyaz Bhat , Chulaka Gunasekara , Siva Sankalp Patel , Hui Wan , Tejas Indulal Dhamecha , Danish Contractor , Marina Danilevsky

Fresh in memory: Training-order recency is linearly encoded in language model activations

We show that language models' activations linearly encode when information was learned during training. Our setup involves creating a model with a known training order by sequentially fine-tuning Llama-3.2-1B on six disjoint but otherwise…

Machine Learning · Computer Science 2025-09-23 Dmitrii Krasheninnikov , Richard E. Turner , David Krueger

Self-supervised learning of class embeddings from video

This work explores how to use self-supervised learning on videos to learn a class-specific image embedding that encodes pose and shape information. At train time, two frames of the same video of an object class (e.g. human upper body) are…

Computer Vision and Pattern Recognition · Computer Science 2019-10-29 Olivia Wiles , A. Sophia Koepke , Andrew Zisserman