Related papers: Action Sets: Weakly Supervised Action Segmentation…

Spatio-Temporal Action Localization in a Weakly Supervised Setting

Enabling computational systems with the ability to localize actions in video-based content has manifold applications. Traditionally, such a problem is approached in a fully-supervised setting where video-clips with complete frame-by-frame…

Computer Vision and Pattern Recognition · Computer Science 2019-05-07 Kurt Degiorgio , Fabio Cuzzolin

Temporal Action Segmentation from Timestamp Supervision

Temporal action segmentation approaches have been very successful recently. However, annotating videos with frame-wise labels to train such models is very expensive and time consuming. While weakly supervised methods trained using only…

Computer Vision and Pattern Recognition · Computer Science 2021-03-29 Zhe Li , Yazan Abu Farha , Juergen Gall

SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation

Temporal action segmentation is a topic of increasing interest, however, annotating each frame in a video is cumbersome and costly. Weakly supervised approaches therefore aim at learning temporal action segmentation from videos that are…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Mohsen Fayyaz , Juergen Gall

Robust Action Segmentation from Timestamp Supervision

Action segmentation is the task of predicting an action label for each frame of an untrimmed video. As obtaining annotations to train an approach for action segmentation in a fully supervised way is expensive, various approaches have been…

Computer Vision and Pattern Recognition · Computer Science 2022-10-14 Yaser Souri , Yazan Abu Farha , Emad Bahrami , Gianpiero Francesca , Juergen Gall

Distill and Collect for Semi-Supervised Temporal Action Segmentation

Recent temporal action segmentation approaches need frame annotations during training to be effective. These annotations are very expensive and time-consuming to obtain. This limits their performances when only limited annotated data is…

Computer Vision and Pattern Recognition · Computer Science 2022-11-04 Sovan Biswas , Anthony Rhodes , Ramesh Manuvinakurike , Giuseppe Raffa , Richard Beckwith

Weakly Supervised Action Labeling in Videos Under Ordering Constraints

We are given a set of video clips, each one annotated with an {\em ordered} list of actions, such as "walk" then "sit" then "answer phone" extracted from, for example, the associated text script. We seek to temporally localize the…

Computer Vision and Pattern Recognition · Computer Science 2014-07-07 Piotr Bojanowski , Rémi Lajugie , Francis Bach , Ivan Laptev , Jean Ponce , Cordelia Schmid , Josef Sivic

Pose-Aware Weakly-Supervised Action Segmentation

Understanding human behavior is an important problem in the pursuit of visual intelligence. A challenge in this endeavor is the extensive and costly effort required to accurately label action segments. To address this issue, we consider…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Seth Z. Zhao , Reza Ghoddoosian , Isht Dwivedi , Nakul Agarwal , Behzad Dariush

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector

Weakly supervised temporal action detection is a Herculean task in understanding untrimmed videos, since no supervisory signal except the video-level category label is available on training data. Under the supervision of category labels,…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Jia-Xing Zhong , Nannan Li , Weijie Kong , Tao Zhang , Thomas H. Li , Ge Li

Weakly Supervised Temporal Action Localization with Segment-Level Labels

Temporal action localization presents a trade-off between test performance and annotation-time cost. Fully supervised methods achieve good performance with time-consuming boundary annotations. Weakly supervised methods with cheaper…

Computer Vision and Pattern Recognition · Computer Science 2020-07-06 Xinpeng Ding , Nannan Wang , Xinbo Gao , Jie Li , Xiaoyu Wang , Tongliang Liu

Weakly supervised learning of actions from transcripts

We present an approach for weakly supervised learning of human actions from video transcriptions. Our system is based on the idea that, given a sequence of input data and a transcript, i.e. a list of the order the actions occur in the…

Computer Vision and Pattern Recognition · Computer Science 2017-06-20 Hilde Kuehne , Alexander Richard , Juergen Gall

Weakly Supervised Semantic Segmentation using Web-Crawled Videos

We propose a novel algorithm for weakly supervised semantic segmentation based on image-level class labels only. In weakly supervised setting, it is commonly observed that trained model overly focuses on discriminative parts rather than the…

Computer Vision and Pattern Recognition · Computer Science 2018-01-09 Seunghoon Hong , Donghun Yeo , Suha Kwak , Honglak Lee , Bohyung Han

Unsupervised Action Segmentation for Instructional Videos

In this paper we address the problem of automatically discovering atomic actions in unsupervised manner from instructional videos, which are rarely annotated with atomic actions. We present an unsupervised approach to learn atomic actions…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 AJ Piergiovanni , Anelia Angelova , Michael S. Ryoo , Irfan Essa

Transformers in Action: Weakly Supervised Action Segmentation

The video action segmentation task is regularly explored under weaker forms of supervision, such as transcript supervision, where a list of actions is easier to obtain than dense frame-wise labels. In this formulation, the task presents…

Computer Vision and Pattern Recognition · Computer Science 2022-01-24 John Ridley , Huseyin Coskun , David Joseph Tan , Nassir Navab , Federico Tombari

Action Recognition from Single Timestamp Supervision in Untrimmed Videos

Recognising actions in videos relies on labelled supervision during training, typically the start and end times of each action instance. This supervision is not only subjective, but also expensive to acquire. Weak video-level supervision…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Davide Moltisanti , Sanja Fidler , Dima Damen

On Evaluating Weakly Supervised Action Segmentation Methods

Action segmentation is the task of temporally segmenting every frame of an untrimmed video. Weakly supervised approaches to action segmentation, especially from transcripts have been of considerable interest to the computer vision…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Yaser Souri , Alexander Richard , Luca Minciullo , Juergen Gall

A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation

Action recognition has become a rapidly developing research field within the last decade. But with the increasing demand for large scale data, the need of hand annotated data for the training becomes more and more impractical. One way to…

Computer Vision and Pattern Recognition · Computer Science 2019-06-05 Hilde Kuehne , Alexander Richard , Juergen Gall

Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions

Action understanding has evolved into the era of fine granularity, as most human behaviors in real life have only minor differences. To detect these fine-grained actions accurately in a label-efficient way, we tackle the problem of…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Zhi Li , Lu He , Huijuan Xu

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation

Action segmentation refers to inferring boundaries of semantically consistent visual concepts in videos and is an important requirement for many video understanding tasks. For this and other video understanding tasks, supervised approaches…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 M. Saquib Sarfraz , Naila Murray , Vivek Sharma , Ali Diba , Luc Van Gool , Rainer Stiefelhagen

Learning to Segment Actions from Observation and Narration

We apply a generative segmental model of task structure, guided by narration, to action segmentation in video. We focus on unsupervised and weakly-supervised settings where no action labels are known during training. Despite its simplicity,…

Computation and Language · Computer Science 2020-08-13 Daniel Fried , Jean-Baptiste Alayrac , Phil Blunsom , Chris Dyer , Stephen Clark , Aida Nematzadeh

Weakly Supervised Temporal Action Localization Using Deep Metric Learning

Temporal action localization is an important step towards video understanding. Most current action localization methods depend on untrimmed videos with full temporal annotations of action instances. However, it is expensive and…

Computer Vision and Pattern Recognition · Computer Science 2020-01-23 Ashraful Islam , Richard J. Radke