Related papers: Multiresolution Match Kernels for Gesture Video Cl…

Robust Photo-Realistic Hand Gesture Generation: from Single View to Multiple View

High-fidelity hand gesture generation represents a significant challenge in human-centric generation tasks. Existing methods typically employ a single-view mesh-rendered image prior to enhancing gesture generation quality. However, the…

Graphics · Computer Science 2025-08-07 Qifan Fu , Xu Chen , Muhammad Asad , Shanxin Yuan , Changjae Oh , Gregory Slabaugh

Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition

Acquiring spatio-temporal states of an action is the most crucial step for action classification. In this paper, we propose a data level fusion strategy, Motion Fused Frames (MFFs), designed to fuse motion information into static images as…

Computer Vision and Pattern Recognition · Computer Science 2018-04-27 Okan Köpüklü , Neslihan Köse , Gerhard Rigoll

A novel shape matching descriptor for real-time hand gesture recognition

The current state-of-the-art hand gesture recognition methodologies heavily rely in the use of machine learning. However there are scenarios that machine learning cannot be applied successfully, for example in situations where data is…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Michalis Lazarou , Bo Li , Tania Stathaki

Multiple Riemannian Manifold-valued Descriptors based Image Set Classification with Multi-Kernel Metric Learning

The importance of wild video based image set recognition is becoming monotonically increasing. However, the contents of these collected videos are often complicated, and how to efficiently perform set modeling and feature extraction is a…

Computer Vision and Pattern Recognition · Computer Science 2019-08-07 Rui Wang , XiaoJun Wu , Josef Kittler

Feature and Region Selection for Visual Learning

Visual learning problems such as object classification and action recognition are typically approached using extensions of the popular bag-of-words (BoW) model. Despite its great success, it is unclear what visual features the BoW model is…

Computer Vision and Pattern Recognition · Computer Science 2016-01-20 Ji Zhao , Liantao Wang , Ricardo Cabral , Fernando De la Torre

Synthetic Video Generation for Robust Hand Gesture Recognition in Augmented Reality Applications

Hand gestures are a natural means of interaction in Augmented Reality and Virtual Reality (AR/VR) applications. Recently, there has been an increased focus on removing the dependence of accurate hand gesture recognition on complex sensor…

Computer Vision and Pattern Recognition · Computer Science 2019-12-09 Varun Jain , Shivam Aggarwal , Suril Mehta , Ramya Hebbalaguppe

Kernel Selection using Multiple Kernel Learning and Domain Adaptation in Reproducing Kernel Hilbert Space, for Face Recognition under Surveillance Scenario

Face Recognition (FR) has been the interest to several researchers over the past few decades due to its passive nature of biometric authentication. Despite high accuracy achieved by face recognition algorithms under controlled conditions,…

Computer Vision and Pattern Recognition · Computer Science 2016-10-05 Samik Banerjee , Sukhendu Das

MVTN: A Multiscale Video Transformer Network for Hand Gesture Recognition

In this paper, we introduce a novel Multiscale Video Transformer Network (MVTN) for dynamic hand gesture recognition, since multiscale features can extract features with variable size, pose, and shape of hand which is a challenge in hand…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Mallika Garg , Debashis Ghosh , Pyari Mohan Pradhan

Real-Time Hand Gesture Recognition: Integrating Skeleton-Based Data Fusion and Multi-Stream CNN

Hand Gesture Recognition (HGR) enables intuitive human-computer interactions in various real-world contexts. However, existing frameworks often struggle to meet the real-time requirements essential for practical HGR applications. This study…

Computer Vision and Pattern Recognition · Computer Science 2026-04-17 Oluwaleke Yusuf , Maki Habib , Mohamed Moustafa

Macroblock Classification Method for Video Applications Involving Motions

In this paper, a macroblock classification method is proposed for various video processing applications involving motions. Based on the analysis of the Motion Vector field in the compressed video, we propose to classify Macroblocks of each…

Multimedia · Computer Science 2016-11-17 Weiyao Lin , Ming-Ting Sun , Hongxiang Li , Zhenzhong Chen , Wei Li , Bing Zhou

Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition

Recently, the popularity of depth-sensors such as Kinect has made depth videos easily available while its advantages have not been fully exploited. This paper investigates, for gesture recognition, to explore the spatial and temporal…

Computer Vision and Pattern Recognition · Computer Science 2016-11-29 Jiali Duan , Shuai Zhou , Jun Wan , Xiaoyuan Guo , Stan Z. Li

Multiresolution Kernels

We present in this work a new methodology to design kernels on data which is structured with smaller components, such as text, images or sequences. This methodology is a template procedure which can be applied on most kernels on measures…

Machine Learning · Computer Science 2007-05-23 Marco Cuturi , Kenji Fukumizu

Multi-modal Fusion for Single-Stage Continuous Gesture Recognition

Gesture recognition is a much studied research area which has myriad real-world applications including robotics and human-machine interaction. Current gesture recognition methods have focused on recognising isolated gestures, and existing…

Computer Vision and Pattern Recognition · Computer Science 2021-09-22 Harshala Gammulle , Simon Denman , Sridha Sridharan , Clinton Fookes

MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models

In recent years, vision language models (VLMs) have made significant advancements in video understanding. However, a crucial capability - fine-grained motion comprehension - remains under-explored in current benchmarks. To address this gap,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-13 Wenyi Hong , Yean Cheng , Zhuoyi Yang , Weihan Wang , Lefan Wang , Xiaotao Gu , Shiyu Huang , Yuxiao Dong , Jie Tang

A Deep Learning-based Multimodal Depth-Aware Dynamic Hand Gesture Recognition System

The dynamic hand gesture recognition task has seen studies on various unimodal and multimodal methods. Previously, researchers have explored depth and 2D-skeleton-based multimodal fusion CRNNs (Convolutional Recurrent Neural Networks) but…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Hasan Mahmud , Mashrur M. Morshed , Md. Kamrul Hasan

Video-Bench: Human-Aligned Video Generation Benchmark

Video generation assessment is essential for ensuring that generative models produce visually realistic, high-quality videos while aligning with human expectations. Current video generation benchmarks fall into two main categories:…

Computer Vision and Pattern Recognition · Computer Science 2025-04-30 Hui Han , Siyuan Li , Jiaqi Chen , Yiwen Yuan , Yuling Wu , Chak Tou Leong , Hanwen Du , Junchen Fu , Youhua Li , Jie Zhang , Chi Zhang , Li-jia Li , Yongxin Ni

MM-Gesture: Towards Precise Micro-Gesture Recognition through Multimodal Fusion

In this paper, we present MM-Gesture, the solution developed by our team HFUT-VUT, which ranked 1st in the micro-gesture classification track of the 3rd MiGA Challenge at IJCAI 2025, achieving superior performance compared to previous…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Jihao Gu , Fei Wang , Kun Li , Yanyan Wei , Zhiliang Wu , Dan Guo

Hand Gesture Recognition with Leap Motion

The recent introduction of depth cameras like Leap Motion Controller allows researchers to exploit the depth information to recognize hand gesture more robustly. This paper proposes a novel hand gesture recognition system with Leap Motion…

Computer Vision and Pattern Recognition · Computer Science 2017-11-15 Youchen Du , Shenglan Liu , Lin Feng , Menghui Chen , Jie Wu

Hand Shape and Gesture Recognition using Multiscale Template Matching, Background Subtraction and Binary Image Analysis

This paper presents a hand shape classification approach employing multiscale template matching. The integration of background subtraction is utilized to derive a binary image of the hand object, enabling the extraction of key features such…

Computer Vision and Pattern Recognition · Computer Science 2024-02-16 Ketan Suhaas Saichandran

Deep Neural Network approaches for Analysing Videos of Music Performances

This paper presents a framework to automate the labelling process for gestures in musical performance videos with a 3D Convolutional Neural Network (CNN). While this idea was proposed in a previous study, this paper introduces several…

Computer Vision and Pattern Recognition · Computer Science 2022-05-25 Foteini Simistira Liwicki , Richa Upadhyay , Prakash Chandra Chhipa , Killian Murphy , Federico Visi , Stefan Östersjö , Marcus Liwicki