Related papers: Accelerating Representation Learning with View-Con…

Value-Consistent Representation Learning for Data-Efficient Reinforcement Learning

Deep reinforcement learning (RL) algorithms suffer severe performance degradation when the interaction data is scarce, which limits their real-world application. Recently, visual representation learning has been shown to be effective and…

Machine Learning · Computer Science 2022-08-17 Yang Yue , Bingyi Kang , Zhongwen Xu , Gao Huang , Shuicheng Yan

Learning Markov State Abstractions for Deep Reinforcement Learning

A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov. However, when MDPs have rich observations, agents typically learn by way of an abstract state…

Machine Learning · Computer Science 2024-03-18 Cameron Allen , Neev Parikh , Omer Gottesman , George Konidaris

Intrinsic Dynamics-Driven Generalizable Scene Representations for Vision-Oriented Decision-Making Applications

How to improve the ability of scene representation is a key issue in vision-oriented decision-making applications, and current approaches usually learn task-relevant state representations within visual reinforcement learning to address this…

Artificial Intelligence · Computer Science 2024-10-24 Dayang Liang , Jinyang Lai , Yunlong Liu

Learning Generalizable Representations for Reinforcement Learning via Adaptive Meta-learner of Behavioral Similarities

How to learn an effective reinforcement learning-based model for control tasks from high-level visual observations is a practical and challenging problem. A key to solving this problem is to learn low-dimensional state representations from…

Machine Learning · Computer Science 2022-12-27 Jianda Chen , Sinno Jialin Pan

Pre-trained Visual Dynamics Representations for Efficient Policy Learning

Pre-training for Reinforcement Learning (RL) with purely video data is a valuable yet challenging problem. Although in-the-wild videos are readily available and inhere a vast amount of prior world knowledge, the absence of action…

Computer Vision and Pattern Recognition · Computer Science 2024-11-06 Hao Luo , Bohan Zhou , Zongqing Lu

Integrating Contrastive Learning with Dynamic Models for Reinforcement Learning from Images

Recent methods for reinforcement learning from images use auxiliary tasks to learn image features that are used by the agent's policy or Q-function. In particular, methods based on contrastive learning that induce linearity of the latent…

Machine Learning · Computer Science 2022-03-04 Bang You , Oleg Arenz , Youping Chen , Jan Peters

Dynamics-aware Embeddings

In this paper we consider self-supervised representation learning to improve sample efficiency in reinforcement learning (RL). We propose a forward prediction objective for simultaneously learning embeddings of states and action sequences.…

Machine Learning · Computer Science 2020-01-15 William Whitney , Rajat Agarwal , Kyunghyun Cho , Abhinav Gupta

DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors

Learning from previously collected data via behavioral cloning or offline reinforcement learning (RL) is a powerful recipe for scaling generalist agents by avoiding the need for expensive online learning. Despite strong generalization in…

Machine Learning · Computer Science 2024-09-30 Joseph Ortiz , Antoine Dedieu , Wolfgang Lehrach , Swaroop Guntupalli , Carter Wendelken , Ahmad Humayun , Guangyao Zhou , Sivaramakrishnan Swaminathan , Miguel Lázaro-Gredilla , Kevin Murphy

Towards Principled Representation Learning from Videos for Reinforcement Learning

We study pre-training representations for decision-making using video data, which is abundantly available for tasks such as game agents and software testing. Even though significant empirical advances have been made on this problem, a…

Machine Learning · Computer Science 2024-03-21 Dipendra Misra , Akanksha Saran , Tengyang Xie , Alex Lamb , John Langford

Time-Aware and View-Aware Video Rendering for Unsupervised Representation Learning

The recent success in deep learning has lead to various effective representation learning methods for videos. However, the current approaches for video representation require large amount of human labeled datasets for effective learning. We…

Computer Vision and Pattern Recognition · Computer Science 2018-11-30 Shruti Vyas , Yogesh S Rawat , Mubarak Shah

Low-Dimensional State and Action Representation Learning with MDP Homomorphism Metrics

Deep Reinforcement Learning has shown its ability in solving complicated problems directly from high-dimensional observations. However, in end-to-end settings, Reinforcement Learning algorithms are not sample-efficient and requires long…

Machine Learning · Computer Science 2021-07-06 Nicolò Botteghi , Mannes Poel , Beril Sirmacek , Christoph Brune

Learning Fused State Representations for Control from Multi-View Observations

Multi-View Reinforcement Learning (MVRL) seeks to provide agents with multi-view observations, enabling them to perceive environment with greater effectiveness and precision. Recent advancements in MVRL focus on extracting latent…

Machine Learning · Computer Science 2025-09-23 Zeyu Wang , Yao-Hui Li , Xin Li , Hongyu Zang , Romain Laroche , Riashat Islam

Keep it Simple: Data-efficient Learning for Controlling Complex Systems with Simple Models

When manipulating a novel object with complex dynamics, a state representation is not always available, for example for deformable objects. Learning both a representation and dynamics from observations requires large amounts of data. We…

Robotics · Computer Science 2021-02-18 Thomas Power , Dmitry Berenson

Towards Efficient and Effective Self-Supervised Learning of Visual Representations

Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Sravanti Addepalli , Kaushal Bhogale , Priyam Dey , R. Venkatesh Babu

State Representation Learning for Goal-Conditioned Reinforcement Learning

This paper presents a novel state representation for reward-free Markov decision processes. The idea is to learn, in a self-supervised manner, an embedding space where distances between pairs of embedded states correspond to the minimum…

Machine Learning · Computer Science 2022-05-05 Lorenzo Steccanella , Anders Jonsson

Memory-augmented Dense Predictive Coding for Video Representation Learning

The objective of this paper is self-supervised learning from video, in particular for representations for action recognition. We make the following contributions: (i) We propose a new architecture and learning framework Memory-augmented…

Computer Vision and Pattern Recognition · Computer Science 2020-08-04 Tengda Han , Weidi Xie , Andrew Zisserman

Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning

Vision-based reinforcement learning (RL) is a promising approach to solve control tasks involving images as the main observation. State-of-the-art RL algorithms still struggle in terms of sample efficiency, especially when using image…

Machine Learning · Computer Science 2021-10-05 Elie Aljalbout , Maximilian Ulmer , Rudolph Triebel

Bridging State and History Representations: Understanding Self-Predictive RL

Representations are at the core of all deep reinforcement learning (RL) methods for both Markov decision processes (MDPs) and partially observable Markov decision processes (POMDPs). Many representation learning methods and theoretical…

Machine Learning · Computer Science 2024-04-23 Tianwei Ni , Benjamin Eysenbach , Erfan Seyedsalehi , Michel Ma , Clement Gehring , Aditya Mahajan , Pierre-Luc Bacon

ImplicitRDP: An End-to-End Visual-Force Diffusion Policy with Structural Slow-Fast Learning

Human-level contact-rich manipulation relies on the distinct roles of two key modalities: vision provides spatially rich but temporally slow global context, while force sensing captures rapid, high-frequency local contact dynamics.…

Robotics · Computer Science 2025-12-12 Wendi Chen , Han Xue , Yi Wang , Fangyuan Zhou , Jun Lv , Yang Jin , Shirun Tang , Chuan Wen , Cewu Lu

Visualizing Dynamics: from t-SNE to SEMI-MDPs

Deep Reinforcement Learning (DRL) is a trending field of research, showing great promise in many challenging problems such as playing Atari, solving Go and controlling robots. While DRL agents perform well in practice we are still missing…

Machine Learning · Statistics 2016-06-24 Nir Ben Zrihem , Tom Zahavy , Shie Mannor