Related papers: Temporal Action Segmentation with High-level Compl…

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

Temporal action segmentation is a topic of increasing interest, however, annotating each frame in a video is cumbersome and costly. Weakly supervised approaches therefore aim at learning temporal action segmentation from videos that are…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Mohsen Fayyaz , Juergen Gall

Hierarchical Attention Network for Action Segmentation

The temporal segmentation of events is an essential task and a precursor for the automatic recognition of human actions in the video. Several attempts have been made to capture frame-level salient aspects through attention but they lack the…

Computer Vision and Pattern Recognition · Computer Science 2020-05-08 Harshala Gammulle , Simon Denman , Sridha Sridharan , Clinton Fookes

SegTAD: Precise Temporal Action Detection via Semantic Segmentation

Temporal action detection (TAD) is an important yet challenging task in video analysis. Most existing works draw inspiration from image object detection and tend to reformulate it as a proposal generation - classification problem. However,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-04 Chen Zhao , Merey Ramazanova , Mengmeng Xu , Bernard Ghanem

Temporal Action Detection with Structured Segment Networks

Detecting actions in untrimmed videos is an important yet challenging task. In this paper, we present the structured segment network (SSN), a novel framework which models the temporal structure of each action instance via a structured…

Computer Vision and Pattern Recognition · Computer Science 2017-09-19 Yue Zhao , Yuanjun Xiong , Limin Wang , Zhirong Wu , Xiaoou Tang , Dahua Lin

Untrimmed Video Classification for Activity Detection: submission to ActivityNet Challenge

Current state-of-the-art human activity recognition is focused on the classification of temporally trimmed videos in which only one action occurs per frame. We propose a simple, yet effective, method for the temporal detection of activities…

Computer Vision and Pattern Recognition · Computer Science 2016-07-14 Gurkirt Singh , Fabio Cuzzolin

A Hybrid Graph Network for Complex Activity Detection in Video

Interpretation and understanding of video presents a challenging computer vision task in numerous fields - e.g. autonomous driving and sports analytics. Existing approaches to interpreting the actions taking place within a video clip are…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Salman Khan , Izzeddin Teeti , Andrew Bradley , Mohamed Elhoseiny , Fabio Cuzzolin

Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation

Temporal action segmentation classifies the action of each frame in (long) video sequences. Due to the high cost of frame-wise labeling, we propose the first semi-supervised method for temporal action segmentation. Our method hinges on…

Computer Vision and Pattern Recognition · Computer Science 2021-12-09 Dipika Singhania , Rahul Rahaman , Angela Yao

Temporal Action Segmentation: An Analysis of Modern Techniques

Temporal action segmentation (TAS) in videos aims at densely identifying video frames in minutes-long videos with multiple action classes. As a long-range video understanding task, researchers have developed an extended collection of…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Guodong Ding , Fadime Sener , Angela Yao

TAEC: Unsupervised Action Segmentation with Temporal-Aware Embedding and Clustering

Temporal action segmentation in untrimmed videos has gained increased attention recently. However, annotating action classes and frame-wise boundaries is extremely time consuming and cost intensive, especially on large-scale datasets. To…

Computer Vision and Pattern Recognition · Computer Science 2023-03-10 Wei Lin , Anna Kukleva , Horst Possegger , Hilde Kuehne , Horst Bischof

SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation

Temporal action segmentation is a task to classify each frame in the video with an action label. However, it is quite expensive to annotate every frame in a large corpus of videos to construct a comprehensive supervised training dataset.…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Zhe Wang , Hao Chen , Xinyu Li , Chunhui Liu , Yuanjun Xiong , Joseph Tighe , Charless Fowlkes

Unsupervised learning of action classes with continuous temporal embedding

The task of temporally detecting and segmenting actions in untrimmed videos has seen an increased attention recently. One problem in this context arises from the need to define and label action boundaries to create annotations for training…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Anna Kukleva , Hilde Kuehne , Fadime Sener , Juergen Gall

Action Sets: Weakly Supervised Action Segmentation without Ordering Constraints

Action detection and temporal segmentation of actions in videos are topics of increasing interest. While fully supervised systems have gained much attention lately, full annotation of each action within the video is costly and impractical…

Computer Vision and Pattern Recognition · Computer Science 2018-05-18 Alexander Richard , Hilde Kuehne , Juergen Gall

C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation

Temporal action segmentation tags action labels for every frame in an input untrimmed video containing multiple actions in a sequence. For the task of temporal action segmentation, we propose an encoder-decoder-style architecture named…

Computer Vision and Pattern Recognition · Computer Science 2022-12-22 Dipika Singhania , Rahul Rahaman , Angela Yao

Temporal Action Detection by Joint Identification-Verification

Temporal action detection aims at not only recognizing action category but also detecting start time and end time for each action instance in an untrimmed video. The key challenge of this task is to accurately classify the action and…

Computer Vision and Pattern Recognition · Computer Science 2018-10-22 Wen Wang , Yongjian Wu , Haijun Liu , Shiguang Wang , Jian Cheng

MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation

Temporally locating and classifying action segments in long untrimmed videos is of particular interest to many applications like surveillance and robotics. While traditional approaches follow a two-step pipeline, by generating frame-wise…

Computer Vision and Pattern Recognition · Computer Science 2019-04-03 Yazan Abu Farha , Juergen Gall

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation

Action segmentation refers to inferring boundaries of semantically consistent visual concepts in videos and is an important requirement for many video understanding tasks. For this and other video understanding tasks, supervised approaches…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 M. Saquib Sarfraz , Naila Murray , Vivek Sharma , Ali Diba , Luc Van Gool , Rainer Stiefelhagen

Efficient Action Detection in Untrimmed Videos via Multi-Task Learning

This paper studies the joint learning of action recognition and temporal localization in long, untrimmed videos. We employ a multi-task learning framework that performs the three highly related steps of action proposal, action recognition,…

Computer Vision and Pattern Recognition · Computer Science 2017-04-05 Yi Zhu , Shawn Newsam

Temporal Action Localization with Enhanced Instant Discriminability

Temporal action detection (TAD) aims to detect all action boundaries and their corresponding categories in an untrimmed video. The unclear boundaries of actions in videos often result in imprecise predictions of action boundaries by…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Dingfeng Shi , Qiong Cao , Yujie Zhong , Shan An , Jian Cheng , Haogang Zhu , Dacheng Tao

ContextDet: Temporal Action Detection with Adaptive Context Aggregation

Temporal action detection (TAD), which locates and recognizes action segments, remains a challenging task in video understanding due to variable segment lengths and ambiguous boundaries. Existing methods treat neighboring contexts of an…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Ning Wang , Yun Xiao , Xiaopeng Peng , Xiaojun Chang , Xuanhong Wang , Dingyi Fang

Action parsing using context features

We propose an action parsing algorithm to parse a video sequence containing an unknown number of actions into its action segments. We argue that context information, particularly the temporal information about other actions in the video…

Computer Vision and Pattern Recognition · Computer Science 2022-05-23 Nagita Mehrseresht