Related papers: Bidirectional Decoding: Improving Action Chunking …

Self-Guided Action Diffusion

Recent works have shown the promise of inference-time search over action samples for improving generative robot policies. In particular, optimizing cross-chunk coherence via bidirectional decoding has proven effective in boosting the…

Robotics · Computer Science 2025-08-19 Rhea Malhotra , Yuejiang Liu , Chelsea Finn

Temporal Action Selection for Action Chunking

Action chunking is a widely adopted approach in Learning from Demonstration (LfD). By modeling multi-step action chunks rather than single-step actions, action chunking significantly enhances modeling capabilities for human expert policies.…

Robotics · Computer Science 2025-11-07 Yueyang Weng , Xiaopeng Zhang , Yongjin Mu , Yingcong Zhu , Yanjie Li , Qi Liu

Learning Action-based Representations Using Invariance

Robust reinforcement learning agents using high-dimensional observations must be able to identify relevant state features amidst many exogeneous distractors. A representation that captures controllability identifies these state elements by…

Machine Learning · Computer Science 2024-06-25 Max Rudolph , Caleb Chuck , Kevin Black , Misha Lvovsky , Scott Niekum , Amy Zhang

BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin

Skeleton-based motion representations are robust for action localization and understanding for their invariance to perspective, lighting, and occlusion, compared with images. Yet, they are often ambiguous and incomplete when taken out of…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Qihang Fang , Chengcheng Tang , Shugao Ma , Yanchao Yang

Concurrent Training Improves the Performance of Behavioral Cloning from Observation

Learning from demonstration is widely used as an efficient way for robots to acquire new skills. However, it typically requires that demonstrations provide full access to the state and action sequences. In contrast, learning from…

Machine Learning · Computer Science 2020-08-05 Zachary W. Robertson , Matthew R. Walter

Action Chunking and Exploratory Data Collection Yield Exponential Improvements in Behavior Cloning for Continuous Control

This paper presents a theoretical analysis of two of the most impactful interventions in modern learning from demonstration in robotics and continuous control: the practice of action-chunking (predicting sequences of actions in open-loop)…

Machine Learning · Computer Science 2025-11-27 Thomas T. Zhang , Daniel Pfrommer , Chaoyi Pan , Nikolai Matni , Max Simchowitz

Bidirectional Action Sequence Learning for Long-term Action Anticipation with Large Language Models

Video-based long-term action anticipation is crucial for early risk detection in areas such as automated driving and robotics. Conventional approaches extract features from past actions using encoders and predict future events with…

Computer Vision and Pattern Recognition · Computer Science 2025-08-04 Yuji Sato , Yasunori Ishii , Takayoshi Yamashita

Decoupled Q-Chunking

Temporal-difference (TD) methods learn state and action values efficiently by bootstrapping from their own future value predictions, but such a self-bootstrapping mechanism is prone to bootstrapping bias, where the errors in the value…

Machine Learning · Computer Science 2025-12-15 Qiyang Li , Seohong Park , Sergey Levine

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

We propose a new formulation of temporal action detection (TAD) with denoising diffusion, DiffTAD in short. Taking as input random temporal proposals, it can yield action proposals accurately given an untrimmed long video. This presents a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-17 Sauradip Nag , Xiatian Zhu , Jiankang Deng , Yi-Zhe Song , Tao Xiang

Adaptive Action Chunking via Multi-Chunk Q Value Estimation

Action chunking emerged as a pivotal technique in imitation learning, enabling policies to predict cohesive action sequences rather than single actions. Recently, this approach has expanded to reinforcement learning (RL), enhancing…

Machine Learning · Computer Science 2026-05-12 Yongjae Shin , Jongseong Chae , Seongmin Kim , Jongeui Park , Youngchul Sung

From Knowing to Doing: Learning Diverse Motor Skills through Instruction Learning

Recent years have witnessed many successful trials in the robot learning field. For contact-rich robotic tasks, it is challenging to learn coordinated motor skills by reinforcement learning. Imitation learning solves this problem by using a…

Robotics · Computer Science 2023-11-02 Linqi Ye , Jiayi Li , Yi Cheng , Xianhao Wang , Bin Liang , Yan Peng

CCDP: Composition of Conditional Diffusion Policies with Guided Sampling

Imitation Learning offers a promising approach to learn directly from data without requiring explicit models, simulations, or detailed task definitions. During inference, actions are sampled from the learned distribution and executed on the…

Robotics · Computer Science 2025-10-28 Amirreza Razmjoo , Sylvain Calinon , Michael Gienger , Fan Zhang

Improving Generative Behavior Cloning via Self-Guidance and Adaptive Chunking

Generative Behavior Cloning (GBC) is a simple yet effective framework for robot learning, particularly in multi-task settings. Recent GBC methods often employ diffusion policies with open-loop (OL) control, where actions are generated via a…

Robotics · Computer Science 2025-10-15 Junhyuk So , Chiwoong Lee , Shinyoung Lee , Jungseul Ok , Eunhyeok Park

HiPolicy: Hierarchical Multi-Frequency Action Chunking for Policy Learning

Robotic imitation learning faces a fundamental trade-off between modeling long-horizon dependencies and enabling fine-grained closed-loop control. Existing fixed-frequency action chunking approaches struggle to achieve both. Building on…

Robotics · Computer Science 2026-04-08 Jiyao Zhang , Zimu Han , Junhan Wang , Xionghao Wu , Shihong Lin , Jinzhou Li , Hongwei Fan , Ruihai Wu , Dongjiang Li , Hao Dong

BIRD: Behavior Induction via Representation-structure Distillation

Human-aligned deep learning models exhibit behaviors consistent with human values, such as robustness, fairness, and honesty. Transferring these behavioral properties to models trained on different tasks or data distributions remains…

Machine Learning · Computer Science 2025-06-02 Galen Pogoncheff , Michael Beyeler

An Empirical Study of End-to-End Temporal Action Detection

Temporal action detection (TAD) is an important yet challenging task in video understanding. It aims to simultaneously predict the semantic label and the temporal interval of every action instance in an untrimmed video. Rather than…

Computer Vision and Pattern Recognition · Computer Science 2022-04-07 Xiaolong Liu , Song Bai , Xiang Bai

LiPo: A Lightweight Post-optimization Framework for Smoothing Action Chunks Generated by Learned Policies

Recent advances in imitation learning have enabled robots to perform increasingly complex manipulation tasks in unstructured environments. However, most learned policies rely on discrete action chunking, which introduces discontinuities at…

Robotics · Computer Science 2025-06-06 Dongwoo Son , Suhan Park

PID-Inspired Inductive Biases for Deep Reinforcement Learning in Partially Observable Control Tasks

Deep reinforcement learning (RL) has shown immense potential for learning to control systems through data alone. However, one challenge deep RL faces is that the full state of the system is often not observable. When this is the case, the…

Machine Learning · Computer Science 2023-10-27 Ian Char , Jeff Schneider

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

When observing the actions of others, humans make inferences about why they acted as they did, and what this implies about the world; humans also use the fact that their actions will be interpreted in this manner, allowing them to act…

Multiagent Systems · Computer Science 2019-09-12 Jakob N. Foerster , Francis Song , Edward Hughes , Neil Burch , Iain Dunning , Shimon Whiteson , Matthew Botvinick , Michael Bowling

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

Temporal credit assignment in reinforcement learning is challenging due to delayed and stochastic outcomes. Monte Carlo targets can bridge long delays between action and consequence but lead to high-variance targets due to stochasticity.…

Machine Learning · Computer Science 2024-06-05 Aditya A. Ramesh , Kenny Young , Louis Kirsch , Jürgen Schmidhuber