Related papers: Faster Diffusion Action Segmentation

Temporal Action Segmentation: An Analysis of Modern Techniques

Temporal action segmentation (TAS) in videos aims at densely identifying video frames in minutes-long videos with multiple action classes. As a long-range video understanding task, researchers have developed an extended collection of…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Guodong Ding , Fadime Sener , Angela Yao

Learning Action Hierarchies via Hybrid Geometric Diffusion

Temporal action segmentation is a critical task in video understanding, where the goal is to assign action labels to each frame in a video. While recent advances leverage iterative refinement-based strategies, they fail to explicitly…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Arjun Ramesh Kaushik , Nalini K. Ratha , Venu Govindaraju

Efficient Temporal Action Segmentation via Boundary-aware Query Voting

Although the performance of Temporal Action Segmentation (TAS) has improved in recent years, achieving promising results often comes with a high computational cost due to dense inputs, complex model structures, and resource-intensive…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Peiyao Wang , Yuewei Lin , Erik Blasch , Jie Wei , Haibin Ling

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

We propose a new formulation of temporal action detection (TAD) with denoising diffusion, DiffTAD in short. Taking as input random temporal proposals, it can yield action proposals accurately given an untrimmed long video. This presents a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-17 Sauradip Nag , Xiatian Zhu , Jiankang Deng , Yi-Zhe Song , Tao Xiang

Temporal Action Selection for Action Chunking

Action chunking is a widely adopted approach in Learning from Demonstration (LfD). By modeling multi-step action chunks rather than single-step actions, action chunking significantly enhances modeling capabilities for human expert policies.…

Robotics · Computer Science 2025-11-07 Yueyang Weng , Xiaopeng Zhang , Yongjin Mu , Yingcong Zhu , Yanjie Li , Qi Liu

An Efficient Framework for Few-shot Skeleton-based Temporal Action Segmentation

Temporal action segmentation (TAS) aims to classify and locate actions in the long untrimmed action sequence. With the success of deep learning, many deep models for action segmentation have emerged. However, few-shot TAS is still a…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Leiyang Xu , Qiang Wang , Xiaotian Lin , Lin Yuan

Streaming Video Temporal Action Segmentation In Real Time

Temporal action segmentation (TAS) is a critical step toward long-term video understanding. Recent studies follow a pattern that builds models based on features instead of raw video picture information. However, we claim those models are…

Computer Vision and Pattern Recognition · Computer Science 2023-10-11 Wujun Wen , Yunheng Li , Zhuben Dong , Lin Feng , Wanxiao Yang , Shenlan Liu

Boundary-Centric Active Learning for Temporal Action Segmentation

Temporal action segmentation (TAS) demands dense temporal supervision, yet most of the annotation cost in untrimmed videos is spent identifying and refining action transitions, where segmentation errors concentrate and small temporal shifts…

Computer Vision and Pattern Recognition · Computer Science 2026-04-17 Halil Ismail Helvaci , Sen-ching Samson Cheung

Improving Temporal Action Segmentation via Constraint-Aware Decoding

Temporal action segmentation (TAS) divides untrimmed videos into labeled action segments. While fully supervised methods have advanced the field, challenges such as action variability, ambiguous boundaries, and high annotation costs remain,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Yeo Keat Ee , Debaditya Roy , Chen Li , Hao Zhang , Basura Fernando

Introducing Gating and Context into Temporal Action Detection

Temporal Action Detection (TAD), the task of localizing and classifying actions in untrimmed video, remains challenging due to action overlaps and variable action durations. Recent findings suggest that TAD performance is dependent on the…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Aglind Reka , Diana Laura Borza , Dominick Reilly , Michal Balazia , Francois Bremond

VIFSS: View-Invariant and Figure Skating-Specific Pose Representation Learning for Temporal Action Segmentation

Understanding human actions from videos plays a critical role across various domains, including sports analytics. In figure skating, accurately recognizing the type and timing of jumps a skater performs is essential for objective…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Ryota Tanaka , Tomohiro Suzuki , Keisuke Fujii

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most existing works draw inspiration from image object detection and tend to reformulate it as a proposal generation - classification problem. However,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-04 Chen Zhao , Merey Ramazanova , Mengmeng Xu , Bernard Ghanem

Towards High-Quality Temporal Action Detection with Sparse Proposals

Temporal Action Detection (TAD) is an essential and challenging topic in video understanding, aiming to localize the temporal segments containing human action instances and predict the action categories. The previous works greatly rely upon…

Computer Vision and Pattern Recognition · Computer Science 2021-09-21 Jiannan Wu , Peize Sun , Shoufa Chen , Jiewen Yang , Zihao Qi , Lan Ma , Ping Luo

ActFusion: a Unified Diffusion Model for Action Segmentation and Anticipation

Temporal action segmentation and long-term action anticipation are two popular vision tasks for the temporal analysis of actions in videos. Despite apparent relevance and potential complementarity, these two problems have been investigated…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Dayoung Gong , Suha Kwak , Minsu Cho

End-to-End Streaming Video Temporal Action Segmentation with Reinforce Learning

The streaming temporal action segmentation (STAS) task, a supplementary task of temporal action segmentation (TAS), has not received adequate attention in the field of video understanding. Existing TAS methods are constrained to offline…

Computer Vision and Pattern Recognition · Computer Science 2024-05-24 Jinrong Zhang , Wujun Wen , Shenglan Liu , Yunheng Li , Qifeng Li , Lin Feng

Diffusion Action Segmentation

Temporal action segmentation is crucial for understanding long-form videos. Previous works on this task commonly adopt an iterative refinement paradigm by using multi-stage models. We propose a novel framework via denoising diffusion…

Computer Vision and Pattern Recognition · Computer Science 2023-08-15 Daochang Liu , Qiyue Li , AnhDung Dinh , Tingting Jiang , Mubarak Shah , Chang Xu

Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos

Video Analytics Software as a Service (VA SaaS) has been rapidly growing in recent years. VA SaaS is typically accessed by users using a lightweight client. Because the transmission bandwidth between the client and cloud is usually limited…

Computer Vision and Pattern Recognition · Computer Science 2018-08-16 Zhaoyang Zhang , Zhanghui Kuang , Ping Luo , Litong Feng , Wei Zhang

3D Pose-Based Temporal Action Segmentation for Figure Skating: A Fine-Grained and Jump Procedure-Aware Annotation Approach

Understanding human actions from videos is essential in many domains, including sports. In figure skating, technical judgments are performed by watching skaters' 3D movements, and its part of the judging procedure can be regarded as a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Ryota Tanaka , Tomohiro Suzuki , Keisuke Fujii

Region-Adaptive Sampling for Diffusion Transformers

Diffusion models (DMs) have become the leading choice for generative tasks across diverse domains. However, their reliance on multiple sequential forward passes significantly limits real-time performance. Previous acceleration methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-02-17 Ziming Liu , Yifan Yang , Chengruidong Zhang , Yiqi Zhang , Lili Qiu , Yang You , Yuqing Yang

Equivariant Sampling for Improving Diffusion Model-based Image Restoration

Recent advances in generative models, especially diffusion models, have significantly improved image restoration (IR) performance. However, existing problem-agnostic diffusion model-based image restoration (DMIR) methods face challenges in…

Computer Vision and Pattern Recognition · Computer Science 2025-11-14 Chenxu Wu , Qingpeng Kong , Peiang Zhao , Wendi Yang , Wenxin Ma , Fenghe Tang , Zihang Jiang , S. Kevin Zhou