Equivariant Diffusion Policy

Dian Wang; Stephen Hart; David Surovik; Tarik Kelestemur; Haojie Huang; Haibo Zhao; Mark Yeatman; Jiuguang Wang; Robin Walters; Robert Platt

Equivariant Diffusion Policy

Robotics 2024-10-16 v3 Machine Learning

Authors: Dian Wang , Stephen Hart , David Surovik , Tarik Kelestemur , Haojie Huang , Haibo Zhao , Mark Yeatman , Jiuguang Wang , Robin Walters , Robert Platt

View on arXiv ↗ PDF ↗

Abstract

Recent work has shown diffusion models are an effective approach to learning the multimodal distributions arising from demonstration data in behavior cloning. However, a drawback of this approach is the need to learn a denoising function, which is significantly more complex than learning an explicit policy. In this work, we propose Equivariant Diffusion Policy, a novel diffusion policy learning method that leverages domain symmetries to obtain better sample efficiency and generalization in the denoising function. We theoretically analyze the $\mathrm{SO}(2)$ symmetry of full 6-DoF control and characterize when a diffusion model is $\mathrm{SO}(2)$ -equivariant. We furthermore evaluate the method empirically on a set of 12 simulation tasks in MimicGen, and show that it obtains a success rate that is, on average, 21.9% higher than the baseline Diffusion Policy. We also evaluate the method on a real-world system to show that effective policies can be learned with relatively few training samples, whereas the baseline Diffusion Policy cannot.

Keywords

diffusion model hyperparameter optimization machine learning

Cite

@article{arxiv.2407.01812,
  title  = {Equivariant Diffusion Policy},
  author = {Dian Wang and Stephen Hart and David Surovik and Tarik Kelestemur and Haojie Huang and Haibo Zhao and Mark Yeatman and Jiuguang Wang and Robin Walters and Robert Platt},
  journal= {arXiv preprint arXiv:2407.01812},
  year   = {2024}
}

Comments

Conference on Robot Learning 2024, Oral Presentation

Equivariant Diffusion Policy

Abstract

Keywords

Cite

Comments

Related papers