Related papers: Visual Reinforcement Learning with Self-Supervised…

Unsupervised Visuomotor Control through Distributional Planning Networks

While reinforcement learning (RL) has the potential to enable robots to autonomously acquire a wide range of skills, in practice, RL usually requires manual, per-task engineering of reward functions, especially in real world settings where…

Robotics · Computer Science 2019-02-15 Tianhe Yu , Gleb Shevchuk , Dorsa Sadigh , Chelsea Finn

End-to-end Reinforcement Learning of Robotic Manipulation with Robust Keypoints Representation

We present an end-to-end Reinforcement Learning(RL) framework for robotic manipulation tasks, using a robust and efficient keypoints representation. The proposed method learns keypoints from camera images as the state representation,…

Robotics · Computer Science 2022-02-15 Tianying Wang , En Yen Puang , Marcus Lee , Yan Wu , Wei Jing

Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control

Deep reinforcement learning (RL) algorithms can learn complex robotic skills from raw sensory inputs, but have yet to achieve the kind of broad generalization and applicability demonstrated by deep learning methods in supervised domains. We…

Robotics · Computer Science 2018-12-04 Frederik Ebert , Chelsea Finn , Sudeep Dasari , Annie Xie , Alex Lee , Sergey Levine

ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations

Deploying visual reinforcement learning (RL) policies in real-world manipulation is often hindered by camera viewpoint changes. A policy trained from a fixed front-facing camera may fail when the camera is shifted -- an unavoidable…

Robotics · Computer Science 2026-03-13 Zheng Li , Pei Qu , Yufei Jia , Shihui Zhou , Haizhou Ge , Jiahang Cao , Jinni Zhou , Guyue Zhou , Jun Ma

Masked World Models for Visual Control

Visual model-based reinforcement learning (RL) has the potential to enable sample-efficient robot learning from visual observations. Yet the current approaches typically train a single model end-to-end for learning both visual…

Robotics · Computer Science 2023-05-30 Younggyo Seo , Danijar Hafner , Hao Liu , Fangchen Liu , Stephen James , Kimin Lee , Pieter Abbeel

Integrating Reinforcement Learning with Visual Generative Models: Foundations and Advances

Generative models have made significant progress in synthesizing visual content, including images, videos, and 3D/4D structures. However, they are typically trained with surrogate objectives such as likelihood or reconstruction loss, which…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Yuanzhi Liang , Yijie Fang , Ke Hao , Rui Li , Ziqi Ni , Ruijie Su , Chi Zhang

Enhancing 2D Representation Learning with a 3D Prior

Learning robust and effective representations of visual data is a fundamental task in computer vision. Traditionally, this is achieved by training models with labeled data which can be expensive to obtain. Self-supervised learning attempts…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Mehmet Aygün , Prithviraj Dhar , Zhicheng Yan , Oisin Mac Aodha , Rakesh Ranjan

Light-weight probing of unsupervised representations for Reinforcement Learning

Unsupervised visual representation learning offers the opportunity to leverage large corpora of unlabeled trajectories to form useful visual representations, which can benefit the training of reinforcement learning (RL) algorithms. However,…

Machine Learning · Computer Science 2024-06-04 Wancong Zhang , Anthony GX-Chen , Vlad Sobal , Yann LeCun , Nicolas Carion

Reinforcement Learning for Motor Control: A Comprehensive Review

Electric motors are crucial in many applications, but traditional control methods struggle with nonlinearities, parameter uncertainties, and external disturbances. Reinforcement Learning (RL) offers a promising solution as a data-driven…

Systems and Control · Electrical Eng. & Systems 2024-12-25 Danial Kazemikia

R3L: Relative Representations for Reinforcement Learning

Visual Reinforcement Learning is a popular and powerful framework that takes full advantage of the Deep Learning breakthrough. It is known that variations in input domains (e.g., different panorama colors due to seasonal changes) or task…

Machine Learning · Computer Science 2025-02-19 Antonio Pio Ricciardi , Valentino Maiorca , Luca Moschella , Riccardo Marin , Emanuele Rodolà

Seeking Visual Discomfort: Curiosity-driven Representations for Reinforcement Learning

Vision-based reinforcement learning (RL) is a promising approach to solve control tasks involving images as the main observation. State-of-the-art RL algorithms still struggle in terms of sample efficiency, especially when using image…

Machine Learning · Computer Science 2021-10-05 Elie Aljalbout , Maximilian Ulmer , Rudolph Triebel

Reinforcement Learning with Action-Free Pre-Training from Videos

Recent unsupervised pre-training methods have shown to be effective on language and vision domains by learning useful representations for multiple downstream tasks. In this paper, we investigate if such unsupervised pre-training methods can…

Computer Vision and Pattern Recognition · Computer Science 2022-06-20 Younggyo Seo , Kimin Lee , Stephen James , Pieter Abbeel

State Representation Learning from Demonstration

Robots could learn their own state and world representation from perception and experience without supervision. This desirable goal is the main focus of our field of interest, state representation learning (SRL). Indeed, a compact…

Machine Learning · Computer Science 2021-09-28 Astrid Merckling , Alexandre Coninx , Loic Cressot , Stéphane Doncieux , Nicolas Perrin-Gilbert

Boosting Reinforcement Learning in 3D Visuospatial Tasks Through Human-Informed Curriculum Design

Reinforcement Learning is a mature technology, often suggested as a potential route towards Artificial General Intelligence, with the ambitious goal of replicating the wide range of abilities found in natural and artificial intelligence,…

Machine Learning · Computer Science 2025-11-25 Markus D. Solbach , John K. Tsotsos

SOLAR: Deep Structured Representations for Model-Based Reinforcement Learning

Model-based reinforcement learning (RL) has proven to be a data efficient approach for learning control tasks but is difficult to utilize in domains with complex observations such as images. In this paper, we present a method for learning…

Machine Learning · Computer Science 2019-06-25 Marvin Zhang , Sharad Vikram , Laura Smith , Pieter Abbeel , Matthew J. Johnson , Sergey Levine

Reinforcement Learning with Prototypical Representations

Learning effective representations in image-based environments is crucial for sample efficient Reinforcement Learning (RL). Unfortunately, in RL, representation learning is confounded with the exploratory experience of the agent -- learning…

Machine Learning · Computer Science 2021-07-21 Denis Yarats , Rob Fergus , Alessandro Lazaric , Lerrel Pinto

Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data

Robotic systems that rely primarily on self-supervised learning have the potential to decrease the amount of human annotation and engineering effort required to learn control strategies. In the same way that prior robotic systems have…

Machine Learning · Computer Science 2025-06-11 Chongyi Zheng , Benjamin Eysenbach , Homer Walke , Patrick Yin , Kuan Fang , Ruslan Salakhutdinov , Sergey Levine

Robot Perception enables Complex Navigation Behavior via Self-Supervised Learning

Learning visuomotor control policies in robotic systems is a fundamental problem when aiming for long-term behavioral autonomy. Recent supervised-learning-based vision and motion perception systems, however, are often separately built with…

Robotics · Computer Science 2020-06-17 Marvin Chancán , Michael Milford

ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning

Current image-based reinforcement learning (RL) algorithms typically operate on the whole image without performing object-level reasoning. This leads to inefficient goal sampling and ineffective reward functions. In this paper, we improve…

Machine Learning · Computer Science 2020-11-16 Yufei Wang , Gautham Narayan Narasimhan , Xingyu Lin , Brian Okorn , David Held

Towards Efficient and Effective Self-Supervised Learning of Visual Representations

Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Sravanti Addepalli , Kaushal Bhogale , Priyam Dey , R. Venkatesh Babu