Max Simchowitz — Scifaro

Nano World Models: A Minimalist Implementation of Future Video Prediction

World models have become a central paradigm for learning predictive simulators that support generation, planning, and decision-making. Yet, despite rapid progress in industry-scale interactive video generation, the broader research…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Siqiao Huang , Partha Kaushik , Michael Chen , Hengkai Pan , Kaiwen Geng , Omar Chehab , Fernando Moreno-Pino , Max Simchowitz

Turning Video Models into Generalist Robot Policies

Video generative models have emerged as a promising robotics backbone, capable of generating videos that depict the completion of complex tasks across embodiments and environments. Recent work proposes robot foundation models that jointly…

Robotics · Computer Science 2026-05-28 Sizhe Lester Li , Evan Kim , Xingjian Bai , Tong Zhao , Tao Pang , Max Simchowitz , Vincent Sitzmann

MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI

Modern AI progress has been driven by ML methods that are generalizable across settings and scalable to larger regimes. As large language models demonstrate advanced capabilities in reasoning, coding, and engineering tasks, it is…

Machine Learning · Computer Science 2026-05-28 Bohan Lyu , Yucheng Yang , Siqiao Huang , Jiaru Zhang , Qixin Xu , Xinghan Li , Xinyang Han , Yicheng Zhang , Huaqing Zhang , Runhan Huang , Kaicheng Yang , Zitao Chen , Wentao Guo , Junlin Yang , Xinyue Ai , Wenhao Chai , Yadi Cao , Ziran Yang , Kun Wang , Dapeng Jiang , Huan-ang Gao , Shange Tang , Chengshuai Shi , Simon S. Du , Max Simchowitz , Jiantao Jiao , Dawn Song , Chi Jin

Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps

Flow and diffusion models produce high-quality samples, but adapting them to user preferences or constraints post-training remains costly and brittle, a challenge commonly called reward alignment. We argue that efficient reward alignment…

Machine Learning · Computer Science 2026-05-19 Peter Holderrieth , Douglas Chen , Luca Eyring , Ishin Shah , Giri Anantharaman , Yutong He , Zeynep Akata , Tommi Jaakkola , Nicholas Matthew Boffi , Max Simchowitz

OGPO: Sample Efficient Full-Finetuning of Generative Control Policies

Generative control policies (GCPs), such as diffusion- and flow-based control policies, have emerged as effective parameterizations for robot learning. This work introduces Off-policy Generative Policy Optimization (OGPO), a…

Machine Learning · Computer Science 2026-05-06 Sarvesh Patil , Mitsuhiko Nakamoto , Manan Agarwal , Shashwat Saxena , Jesse Zhang , Giri Anantharaman , Cleah Winston , Chaoyi Pan , Douglas Chen , Nai-Chieh Huang , Zeynep Temel , Oliver Kroemer , Sergey Levine , Abhishek Gupta , Hongkai Da , Paarth Shah , Max Simchowitz

Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models

Log-likelihood evaluation enables important capabilities in generative models, including model comparison, certain fine-tuning objectives, and many downstream applications. Yet paradoxically, some of today's best generative models --…

Machine Learning · Computer Science 2026-04-21 Xinyue Ai , Yutong He , Albert Gu , Ruslan Salakhutdinov , J Zico Kolter , Nicholas Matthew Boffi , Max Simchowitz

Much Ado About Noising: Dispelling the Myths of Generative Robotic Control

Generative models, like flows and diffusions, have recently emerged as popular and efficacious policy parameterizations in robotics. There has been much speculation as to the factors underlying their successes, ranging from capturing…

Robotics · Computer Science 2026-02-24 Chaoyi Pan , Giri Anantharaman , Nai-Chieh Huang , Claire Jin , Daniel Pfrommer , Chenyang Yuan , Frank Permenter , Guannan Qu , Nicholas Boffi , Guanya Shi , Max Simchowitz

Is Your Conditional Diffusion Model Actually Denoising?

We study the inductive biases of diffusion models with a conditioning-variable, which have seen widespread application as both text-conditioned generative image models and observation-conditioned continuous control policies. We observe that…

Machine Learning · Computer Science 2025-12-23 Daniel Pfrommer , Zehao Dou , Christopher Scarvelis , Max Simchowitz , Ali Jadbabaie

Action Chunking and Exploratory Data Collection Yield Exponential Improvements in Behavior Cloning for Continuous Control

This paper presents a theoretical analysis of two of the most impactful interventions in modern learning from demonstration in robotics and continuous control: the practice of action-chunking (predicting sequences of actions in open-loop)…

Machine Learning · Computer Science 2025-11-27 Thomas T. Zhang , Daniel Pfrommer , Chaoyi Pan , Nikolai Matni , Max Simchowitz

Is Linear Feedback on Smoothed Dynamics Sufficient for Stabilizing Contact-Rich Plans?

Designing planners and controllers for contact-rich manipulation is extremely challenging as contact violates the smoothness conditions that many gradient-based controller synthesis tools assume. Contact smoothing approximates a non-smooth…

Robotics · Computer Science 2025-10-06 Yuki Shirai , Tong Zhao , H. J. Terry Suh , Huaijiang Zhu , Xinpei Ni , Jiuguang Wang , Max Simchowitz , Tao Pang

A Test-Function Approach to Incremental Stability

This paper presents a novel framework for analyzing Incremental-Input-to-State Stability ($\delta$ISS) based on the idea of using rewards as "test functions." Whereas control theory traditionally deals with Lyapunov functions that satisfy a…

Machine Learning · Computer Science 2025-09-19 Daniel Pfrommer , Max Simchowitz , Ali Jadbabaie

The Pitfalls of Imitation Learning when Actions are Continuous

We study the problem of imitating an expert demonstrator in a discrete-time, continuous state-and-action control system. We show that, even if the dynamics satisfy a control-theoretic property called exponential stability (i.e. the effects…

Machine Learning · Computer Science 2025-07-29 Max Simchowitz , Daniel Pfrommer , Ali Jadbabaie

History-Guided Video Diffusion

Classifier-free guidance (CFG) is a key technique for improving conditional generation in diffusion models, enabling more accurate control while enhancing sample quality. It is natural to extend this technique to video diffusion, which…

Machine Learning · Computer Science 2025-07-25 Kiwhan Song , Boyuan Chen , Max Simchowitz , Yilun Du , Russ Tedrake , Vincent Sitzmann

e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs

Test-time scaling offers a promising path to improve LLM reasoning by utilizing more compute at inference time; however, the true promise of this paradigm lies in extrapolation (i.e., improvement in performance on hard problems as LLMs keep…

Machine Learning · Computer Science 2025-06-16 Amrith Setlur , Matthew Y. R. Yang , Charlie Snell , Jeremy Greer , Ian Wu , Virginia Smith , Max Simchowitz , Aviral Kumar

Diffusion Policy Policy Optimization

We introduce Diffusion Policy Policy Optimization, DPPO, an algorithmic framework including best practices for fine-tuning diffusion-based policies (e.g. Diffusion Policy) in continuous control and robot learning tasks using the policy…

Robotics · Computer Science 2024-12-11 Allen Z. Ren , Justin Lidard , Lars L. Ankile , Anthony Simeonov , Pulkit Agrawal , Anirudha Majumdar , Benjamin Burchfiel , Hongkai Dai , Max Simchowitz

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

This paper presents Diffusion Forcing, a new training paradigm where a diffusion model is trained to denoise a set of tokens with independent per-token noise levels. We apply Diffusion Forcing to sequence generative modeling by training a…

Machine Learning · Computer Science 2024-12-11 Boyuan Chen , Diego Marti Monso , Yilun Du , Max Simchowitz , Russ Tedrake , Vincent Sitzmann

Self-Improvement in Language Models: The Sharpening Mechanism

Recent work in language modeling has raised the possibility of self-improvement, where a language models evaluates and refines its own generations to achieve higher performance without external feedback. It is impossible for this…

Artificial Intelligence · Computer Science 2024-12-05 Audrey Huang , Adam Block , Dylan J. Foster , Dhruv Rohatgi , Cyril Zhang , Max Simchowitz , Jordan T. Ash , Akshay Krishnamurthy

Faster Algorithms for Growing Collision-Free Convex Polytopes in Robot Configuration Space

We propose two novel algorithms for constructing convex collision-free polytopes in robot configuration space. Finding these polytopes enables the application of stronger motion-planning frameworks such as trajectory optimization with…

Robotics · Computer Science 2024-11-15 Peter Werner , Thomas Cohn , Rebecca H. Jiang , Tim Seyde , Max Simchowitz , Russ Tedrake , Daniela Rus

Oracle-Efficient Smoothed Online Learning for Piecewise Continuous Decision Making

Smoothed online learning has emerged as a popular framework to mitigate the substantial loss in statistical and computational complexity that arises when one moves from classical to adversarial learning. Unfortunately, for some spaces, it…

Machine Learning · Statistics 2024-03-20 Adam Block , Alexander Rakhlin , Max Simchowitz

Smoothed Online Learning for Prediction in Piecewise Affine Systems

The problem of piecewise affine (PWA) regression and planning is of foundational importance to the study of online learning, control, and robotics, where it provides a theoretically and empirically tractable setting to study systems…

Machine Learning · Statistics 2024-03-20 Adam Block , Max Simchowitz , Russ Tedrake