Related papers: Visual Recognition by Counting Instances: A Multi-…

Visual Semantic Role Labeling

In this paper we introduce the problem of Visual Semantic Role Labeling: given an image we want to detect people doing actions and localize the objects of interaction. Classical approaches to action recognition either study the task of…

Computer Vision and Pattern Recognition · Computer Science 2015-05-19 Saurabh Gupta , Jitendra Malik

Multi-Granularity Reasoning for Social Relation Recognition from Images

Discovering social relations in images can make machines better interpret the behavior of human beings. However, automatically recognizing social relations in images is a challenging task due to the significant gap between the domains of…

Computer Vision and Pattern Recognition · Computer Science 2019-01-11 Meng Zhang , Xinchen Liu , Wu Liu , Anfu Zhou , Huadong Ma , Tao Mei

Curriculum Learning of Visual Attribute Clusters for Multi-Task Classification

Visual attributes, from simple objects (e.g., backpacks, hats) to soft-biometrics (e.g., gender, height, clothing) have proven to be a powerful representational approach for many applications such as image description and human…

Computer Vision and Pattern Recognition · Computer Science 2018-07-11 Nikolaos Sarafianos , Theodore Giannakopoulos , Christophoros Nikou , Ioannis A. Kakadiaris

Context-aware Video Anomaly Detection in Long-Term Datasets

Video anomaly detection research is generally evaluated on short, isolated benchmark videos only a few minutes long. However, in real-world environments, security cameras observe the same scene for months or years at a time, and the notion…

Computer Vision and Pattern Recognition · Computer Science 2024-04-12 Zhengye Yang , Richard Radke

From Recognition to Cognition: Visual Commonsense Reasoning

Visual understanding goes well beyond object recognition. With one glance at an image, we can effortlessly imagine the world beyond the pixels: for instance, we can infer people's actions, goals, and mental states. While this task is easy…

Computer Vision and Pattern Recognition · Computer Science 2019-03-27 Rowan Zellers , Yonatan Bisk , Ali Farhadi , Yejin Choi

Counting Everyday Objects in Everyday Scenes

We are interested in counting the number of instances of object classes in natural, everyday images. Previous counting approaches tackle the problem in restricted domains such as counting pedestrians in surveillance videos. Counts can also…

Computer Vision and Pattern Recognition · Computer Science 2017-05-10 Prithvijit Chattopadhyay , Ramakrishna Vedantam , Ramprasaath R. Selvaraju , Dhruv Batra , Devi Parikh

Fast Low-parameter Video Activity Localization in Collaborative Learning Environments

Research on video activity detection has primarily focused on identifying well-defined human activities in short video segments. The majority of the research on video activity recognition is focused on the development of large parameter…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Venkatesh Jatla , Sravani Teeparthi , Ugesh Egala , Sylvia Celedon Pattichis , Marios S. Patticis

Recognizing Video Events with Varying Rhythms

Recognizing Video events in long, complex videos with multiple sub-activities has received persistent attention recently. This task is more challenging than traditional action recognition with short, relatively homogeneous video clips. In…

Computer Vision and Pattern Recognition · Computer Science 2020-01-16 Yikang Li , Tianshu Yu , Baoxin Li

Learning to Forecast Videos of Human Activity with Multi-granularity Models and Adaptive Rendering

We propose an approach for forecasting video of complex human activity involving multiple people. Direct pixel-level prediction is too simple to handle the appearance variability in complex activities. Hence, we develop novel intermediate…

Computer Vision and Pattern Recognition · Computer Science 2017-12-07 Mengyao Zhai , Jiacheng Chen , Ruizhi Deng , Lei Chen , Ligeng Zhu , Greg Mori

Improved Actor Relation Graph based Group Activity Recognition

Video understanding is to recognize and classify different actions or activities appearing in the video. A lot of previous work, such as video captioning, has shown promising performance in producing general video understanding. However, it…

Computer Vision and Pattern Recognition · Computer Science 2023-11-23 Zijian Kuang , Xinran Tie

Reliable Shot Identification for Complex Event Detection via Visual-Semantic Embedding

Multimedia event detection is the task of detecting a specific event of interest in an user-generated video on websites. The most fundamental challenge facing this task lies in the enormously varying quality of the video as well as the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-18 Minnan Luo , Xiaojun Chang , Chen Gong

Human-like Relational Models for Activity Recognition in Video

Video activity recognition by deep neural networks is impressive for many classes. However, it falls short of human performance, especially for challenging to discriminate activities. Humans differentiate these complex activities by…

Computer Vision and Pattern Recognition · Computer Science 2022-01-12 Joseph Chrol-Cannon , Andrew Gilbert , Ranko Lazic , Adithya Madhusoodanan , Frank Guerin

Future Aspects in Human Action Recognition: Exploring Emerging Techniques and Ethical Influences

Visual-based human action recognition can be found in various application fields, e.g., surveillance systems, sports analytics, medical assistive technologies, or human-robot interaction frameworks, and it concerns the identification and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Antonios Gasteratos , Stavros N. Moutsis , Konstantinos A. Tsintotas , Yiannis Aloimonos

Activity Recognition Using A Combination of Category Components And Local Models for Video Surveillance

This paper presents a novel approach for automatic recognition of human activities for video surveillance applications. We propose to represent an activity by a combination of category components, and demonstrate that this approach offers…

Computer Vision and Pattern Recognition · Computer Science 2015-03-03 Weiyao Lin , Ming-Ting Sun , Radha Poovendran , Zhengyou Zhang

Automatic Interaction and Activity Recognition from Videos of Human Manual Demonstrations with Application to Anomaly Detection

This paper presents a new method to describe spatio-temporal relations between objects and hands, to recognize both interactions and activities within video demonstrations of manual tasks. The approach exploits Scene Graphs to extract key…

Computer Vision and Pattern Recognition · Computer Science 2023-07-10 Elena Merlo , Marta Lagomarsino , Edoardo Lamon , Arash Ajoudani

Active Learning for Online Recognition of Human Activities from Streaming Videos

Recognising human activities from streaming videos poses unique challenges to learning algorithms: predictive models need to be scalable, incrementally trainable, and must remain bounded in size even when the data stream is arbitrarily…

Machine Learning · Statistics 2016-10-06 Rocco De Rosa , Ilaria Gori , Fabio Cuzzolin , Barbara Caputo , Nicolò Cesa-Bianchi

Taskology: Utilizing Task Relations at Scale

Many computer vision tasks address the problem of scene understanding and are naturally interrelated e.g. object classification, detection, scene segmentation, depth estimation, etc. We show that we can leverage the inherent relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-03-18 Yao Lu , Sören Pirk , Jan Dlabal , Anthony Brohan , Ankita Pasad , Zhao Chen , Vincent Casser , Anelia Angelova , Ariel Gordon

Attend and Interact: Higher-Order Object Interactions for Video Understanding

Human actions often involve complex interactions across several inter-related objects in the scene. However, existing approaches to fine-grained video understanding or visual relationship detection often rely on single object representation…

Computer Vision and Pattern Recognition · Computer Science 2018-03-22 Chih-Yao Ma , Asim Kadav , Iain Melvin , Zsolt Kira , Ghassan AlRegib , Hans Peter Graf

Utilizing Dynamic Properties of Sharing Bits and Registers to Estimate User Cardinalities over Time

Online monitoring user cardinalities (or degrees) in graph streams is fundamental for many applications. For example in a bipartite graph representing user-website visiting activities, user cardinalities (the number of distinct visited…

Data Structures and Algorithms · Computer Science 2018-11-27 Pinghui Wang , Peng Jia , Xiangliang Zhang , Jing Tao , Xiaohong Guan , Don Towsley

Discourse Parsing in Videos: A Multi-modal Appraoch

Text-level discourse parsing aims to unmask how two sentences in the text are related to each other. We propose the task of Visual Discourse Parsing, which requires understanding discourse relations among scenes in a video. Here we use the…

Computer Vision and Pattern Recognition · Computer Science 2022-01-25 Arjun R. Akula , Song-Chun Zhu