Related papers: Asynchronous Interaction Aggregation for Action De…

Action Unit Detection with Joint Adaptive Attention and Graph Relation

This paper describes an approach to the facial action unit (AU) detection. In this work, we present our submission to the Field Affective Behavior Analysis (ABAW) 2021 competition. The proposed method uses the pre-trained JAA model as the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-12 Chenggong Zhang , Juan Song , Qingyang Zhang , Weilong Dong , Ruomeng Ding , Zhilei Liu

Video action detection by learning graph-based spatio-temporal interactions

Action Detection is a complex task that aims to detect and classify human actions in video clips. Typically, it has been addressed by processing fine-grained features extracted from a video classification backbone. Recently, thanks to the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-02 Matteo Tomei , Lorenzo Baraldi , Simone Calderara , Simone Bronzin , Rita Cucchiara

Identity-aware Graph Memory Network for Action Detection

Action detection plays an important role in high-level video understanding and media interpretation. Many existing studies fulfill this spatio-temporal localization by modeling the context, capturing the relationship of actors, objects, and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-27 Jingcheng Ni , Jie Qin , Di Huang

Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion

Action recognition is an important yet challenging task in computer vision. In this paper, we propose a novel deep-based framework for action recognition, which improves the recognition accuracy by: 1) deriving more precise features for…

Computer Vision and Pattern Recognition · Computer Science 2017-11-21 Weiyao Lin , Yang Mi , Jianxin Wu , Ke Lu , Hongkai Xiong

Learning Adaptive Node Selection with External Attention for Human Interaction Recognition

Most GCN-based methods model interacting individuals as independent graphs, neglecting their inherent inter-dependencies. Although recent approaches utilize predefined interaction adjacency matrices to integrate participants, these matrices…

Computer Vision and Pattern Recognition · Computer Science 2025-08-14 Chen Pang , Xuequan Lu , Qianyu Zhou , Lei Lyu

Action Units Recognition Using Improved Pairwise Deep Architecture

Facial Action Units (AUs) represent a set of facial muscular activities and various combinations of AUs can represent a wide range of emotions. AU recognition is often used in many applications, including marketing, healthcare, education,…

Computer Vision and Pattern Recognition · Computer Science 2021-07-09 Junya Saito , Xiaoyu Mi , Akiyoshi Uchida , Sachihiro Youoku , Takahisa Yamamoto , Kentaro Murase , Osafumi Nakayama

Relation Learning and Aggregate-attention for Multi-person Motion Prediction

Multi-person motion prediction is an emerging and intricate task with broad real-world applications. Unlike single person motion prediction, it considers not just the skeleton structures or human trajectories but also the interactions…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Kehua Qu , Rui Ding , Jin Tang

Inductive Attention for Video Action Anticipation

Anticipating future actions based on spatiotemporal observations is essential in video understanding and predictive computer vision. Moreover, a model capable of anticipating the future has important applications, it can benefit…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Tsung-Ming Tai , Giuseppe Fiameni , Cheng-Kuang Lee , Simon See , Oswald Lanz

A new interval-based aggregation approach based on bagging and Interval Agreement Approach (IAA) in ensemble learning

The main aim in ensemble learning is using multiple individual classifiers outputs rather than one classifier output to aggregate them for more accurate classification. Generating an ensemble classifier generally is composed of three steps:…

Machine Learning · Computer Science 2021-01-26 Mansoureh Maadia , Uwe Aickelin , Hadi Akbarzadeh Khorshidi

On the Overhead of Interference Alignment: Training, Feedback, and Cooperation

Interference alignment (IA) is a cooperative transmission strategy that, under some conditions, achieves the interference channel's maximum number of degrees of freedom. Realizing IA gains, however, is contingent upon providing transmitters…

Information Theory · Computer Science 2013-04-15 Omar El Ayach , Angel Lozano , Robert W. Heath

Learning Asynchronous and Sparse Human-Object Interaction in Videos

Human activities can be learned from video. With effective modeling it is possible to discover not only the action labels but also the temporal structures of the activities such as the progression of the sub-activities. Automatically…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Romero Morais , Vuong Le , Svetha Venkatesh , Truyen Tran

Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video?

Anticipating actions before they occur is a core challenge in action understanding research. While conventional methods rely on extracting and aggregating temporal information from videos, as humans we can often predict upcoming actions by…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Manuel Benavent-Lledo , Konstantinos Bacharidis , Victoria Manousaki , Konstantinos Papoutsakis , Antonis Argyros , Jose Garcia-Rodriguez

Holistic Interaction Transformer Network for Action Detection

Actions are about how we interact with the environment, including other people, objects, and ourselves. In this paper, we propose a novel multi-modal Holistic Interaction Transformer Network (HIT) that leverages the largely ignored, but…

Computer Vision and Pattern Recognition · Computer Science 2022-11-21 Gueter Josmy Faure , Min-Hung Chen , Shang-Hong Lai

Motion Guided Attention Fusion to Recognize Interactions from Videos

We present a dual-pathway approach for recognizing fine-grained interactions from videos. We build on the success of prior dual-stream approaches, but make a distinction between the static and dynamic representations of objects and their…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Tae Soo Kim , Jonathan Jones , Gregory D. Hager

Spatial-Temporal Alignment Network for Action Recognition and Detection

This paper studies how to introduce viewpoint-invariant feature representations that can help action recognition and detection. Although we have witnessed great progress of action recognition in the past decade, it remains challenging yet…

Computer Vision and Pattern Recognition · Computer Science 2020-12-07 Junwei Liang , Liangliang Cao , Xuehan Xiong , Ting Yu , Alexander Hauptmann

NUTA: Non-uniform Temporal Aggregation for Action Recognition

In the world of action recognition research, one primary focus has been on how to construct and train networks to model the spatial-temporal volume of an input video. These methods typically uniformly sample a segment of an input clip…

Computer Vision and Pattern Recognition · Computer Science 2020-12-16 Xinyu Li , Chunhui Liu , Bing Shuai , Yi Zhu , Hao Chen , Joseph Tighe

Action Unit Memory Network for Weakly Supervised Temporal Action Localization

Weakly supervised temporal action localization aims to detect and localize actions in untrimmed videos with only video-level labels during training. However, without frame-level annotations, it is challenging to achieve localization…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Wang Luo , Tianzhu Zhang , Wenfei Yang , Jingen Liu , Tao Mei , Feng Wu , Yongdong Zhang

Learning to Discriminate Information for Online Action Detection: Analysis and Application

Online action detection, which aims to identify an ongoing action from a streaming video, is an important subject in real-world applications. For this task, previous methods use recurrent neural networks for modeling temporal relations in…

Computer Vision and Pattern Recognition · Computer Science 2022-11-21 Sumin Lee , Hyunjun Eun , Jinyoung Moon , Seokeon Choi , Yoonhyung Kim , Chanho Jung , Changick Kim

Adaptive Ensemble Aggregation for Actor-Critics

Ensembles are ubiquitous in off-policy actor-critic learning, yet their efficacy depends critically on how they are aggregated. Current methods typically rely on static rules or task-specific hyperparameters to balance overestimation bias…

Machine Learning · Computer Science 2026-05-07 Nicklas Werge , Yi-Shan Wu , Manuel Haussmann , Bahareh Tasdighi , Melih Kandemir

Interaction-and-Aggregation Network for Person Re-identification

Person re-identification (reID) benefits greatly from deep convolutional neural networks (CNNs) which learn robust feature embeddings. However, CNNs are inherently limited in modeling the large variations in person pose and scale due to…

Computer Vision and Pattern Recognition · Computer Science 2019-07-22 Ruibing Hou , Bingpeng Ma , Hong Chang , Xinqian Gu , Shiguang Shan , Xilin Chen