Sarah Bechtle — Scifaro

Genie: Generative Interactive Environments

We introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described…

Machine Learning · Computer Science 2024-02-26 Jake Bruce , Michael Dennis , Ashley Edwards , Jack Parker-Holder , Yuge Shi , Edward Hughes , Matthew Lai , Aditi Mavalankar , Richie Steigerwald , Chris Apps , Yusuf Aytar , Sarah Bechtle , Feryal Behbahani , Stephanie Chan , Nicolas Heess , Lucy Gonzalez , Simon Osindero , Sherjil Ozair , Scott Reed , Jingwei Zhang , Konrad Zolna , Jeff Clune , Nando de Freitas , Satinder Singh , Tim Rocktäschel

Offline Actor-Critic Reinforcement Learning Scales to Large Models

We show that offline actor-critic reinforcement learning can scale to large models - such as transformers - and follows similar scaling laws as supervised learning. We find that offline actor-critic algorithms can outperform strong,…

Machine Learning · Computer Science 2024-02-09 Jost Tobias Springenberg , Abbas Abdolmaleki , Jingwei Zhang , Oliver Groth , Michael Bloesch , Thomas Lampe , Philemon Brakel , Sarah Bechtle , Steven Kapturowski , Roland Hafner , Nicolas Heess , Martin Riedmiller

Mastering Stacking of Diverse Shapes with Large-Scale Iterative Reinforcement Learning on Real Robots

Reinforcement learning solely from an agent's self-generated data is often believed to be infeasible for learning on real robots, due to the amount of data needed. However, if done right, agents learning from real data can be surprisingly…

Robotics · Computer Science 2023-12-19 Thomas Lampe , Abbas Abdolmaleki , Sarah Bechtle , Sandy H. Huang , Jost Tobias Springenberg , Michael Bloesch , Oliver Groth , Roland Hafner , Tim Hertweck , Michael Neunert , Markus Wulfmeier , Jingwei Zhang , Francesco Nori , Nicolas Heess , Martin Riedmiller

Foundations for Transfer in Reinforcement Learning: A Taxonomy of Knowledge Modalities

Contemporary artificial intelligence systems exhibit rapidly growing abilities accompanied by the growth of required resources, expansive datasets and corresponding investments into computing infrastructure. Although earlier successes…

Machine Learning · Computer Science 2023-12-05 Markus Wulfmeier , Arunkumar Byravan , Sarah Bechtle , Karol Hausman , Nicolas Heess

A Generalist Dynamics Model for Control

We investigate the use of transformer sequence models as dynamics models (TDMs) for control. We find that TDMs exhibit strong generalization capabilities to unseen environments, both in a few-shot setting, where a generalist TDM is…

Artificial Intelligence · Computer Science 2023-09-26 Ingmar Schubert , Jingwei Zhang , Jake Bruce , Sarah Bechtle , Emilio Parisotto , Martin Riedmiller , Jost Tobias Springenberg , Arunkumar Byravan , Leonard Hasenclever , Nicolas Heess

Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning

We present a novel approach to address the challenge of generalization in offline reinforcement learning (RL), where the agent learns from a fixed dataset without any additional interaction with the environment. Specifically, we aim to…

Machine Learning · Computer Science 2023-09-15 Cristina Pinneri , Sarah Bechtle , Markus Wulfmeier , Arunkumar Byravan , Jingwei Zhang , William F. Whitney , Martin Riedmiller

Model-Based Inverse Reinforcement Learning from Visual Demonstrations

Scaling model-based inverse reinforcement learning (IRL) to real robotic manipulation tasks with unknown dynamics remains an open problem. The key challenges lie in learning good dynamics models, developing algorithms that scale to…

Robotics · Computer Science 2023-03-08 Neha Das , Sarah Bechtle , Todor Davchev , Dinesh Jayaraman , Akshara Rai , Franziska Meier

Model Based Meta Learning of Critics for Policy Gradients

Being able to seamlessly generalize across different tasks is fundamental for robots to act in our world. However, learning representations that generalize quickly to new scenarios is still an open research problem in reinforcement…

Machine Learning · Computer Science 2022-04-06 Sarah Bechtle , Ludovic Righetti , Franziska Meier

Learning Time-Invariant Reward Functions through Model-Based Inverse Reinforcement Learning

Inverse reinforcement learning is a paradigm motivated by the goal of learning general reward functions from demonstrated behaviours. Yet the notion of generality for learnt costs is often evaluated in terms of robustness to various spatial…

Robotics · Computer Science 2021-09-15 Todor Davchev , Sarah Bechtle , Subramanian Ramamoorthy , Franziska Meier

Multi-Modal Learning of Keypoint Predictive Models for Visual Object Manipulation

Humans have impressive generalization capabilities when it comes to manipulating objects and tools in completely novel environments. These capabilities are, at least partially, a result of humans having internal models of their bodies and…

Robotics · Computer Science 2021-06-28 Sarah Bechtle , Neha Das , Franziska Meier

Meta-Learning via Learned Loss

Typically, loss functions, regularization mechanisms and other important aspects of training parametric models are chosen heuristically from a limited set of options. In this paper, we take the first step towards automating this process,…

Machine Learning · Computer Science 2021-01-20 Sarah Bechtle , Artem Molchanov , Yevgen Chebotar , Edward Grefenstette , Ludovic Righetti , Gaurav Sukhatme , Franziska Meier

Leveraging Forward Model Prediction Error for Learning Control

Learning for model based control can be sample-efficient and generalize well, however successfully learning models and controllers that represent the problem at hand can be challenging for complex tasks. Using inaccurate models for learning…

Robotics · Computer Science 2020-11-10 Sarah Bechtle , Bilal Hammoud , Akshara Rai , Franziska Meier , Ludovic Righetti

Curious iLQR: Resolving Uncertainty in Model-based RL

Curiosity as a means to explore during reinforcement learning problems has recently become very popular. However, very little progress has been made in utilizing curiosity for learning control. In this work, we propose a model-based…

Robotics · Computer Science 2019-10-09 Sarah Bechtle , Yixin Lin , Akshara Rai , Ludovic Righetti , Franziska Meier