Related papers: DiffEyeSyn: Diffusion-based User-specific Eye Move…

DiffEye: Diffusion-Based Continuous Eye-Tracking Data Generation Conditioned on Natural Images

Numerous models have been developed for scanpath and saliency prediction, which are typically trained on scanpaths, which model eye movement as a sequence of discrete fixation points connected by saccades, while the rich information…

Computer Vision and Pattern Recognition · Computer Science 2025-10-10 Ozgur Kara , Harris Nisar , James M. Rehg

DiffGaze: A Diffusion Model for Continuous Gaze Sequence Generation on 360{\deg} Images

We present DiffGaze, a novel method for generating realistic and diverse continuous human gaze sequences on 360{\deg} images based on a conditional score-based denoising diffusion model. Generating human gaze on 360{\deg} images is…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Chuhan Jiao , Yao Wang , Guanhua Zhang , Mihai Bâce , Zhiming Hu , Andreas Bulling

Privatization of Synthetic Gaze: Attenuating State Signatures in Diffusion-Generated Eye Movements

The recent success of deep learning (DL) has enabled the generation of high-quality synthetic gaze data. However, such data also raises privacy concerns because gaze sequences can encode subjects' internal states, like fatigue, emotional…

Human-Computer Interaction · Computer Science 2026-01-30 Kamrul Hasan , Oleg V. Komogortsev

Quantitative and Qualitative Comparison of Generative Models for Subject-Specific Gaze Synthesis: Diffusion vs GAN

Recent advances in deep learning demonstrate the ability to generate synthetic gaze data. However, most approaches have primarily focused on generating data from random noise distributions or global, predefined latent embeddings, whereas…

Human-Computer Interaction · Computer Science 2025-11-14 Kamrul Hasan , Dmytro Katrychuk , Mehedi Hasan Raju , Oleg V. Komogortsev

DiffInject: Revisiting Debias via Synthetic Data Generation using Diffusion-based Style Injection

Dataset bias is a significant challenge in machine learning, where specific attributes, such as texture or color of the images are unintentionally learned resulting in detrimental performance. To address this, previous efforts have focused…

Computer Vision and Pattern Recognition · Computer Science 2024-06-11 Donggeun Ko , Sangwoo Jo , Dongjun Lee , Namjun Park , Jaekwang Kim

Diffusion Recommender Model

Generative models such as Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) are widely utilized to model the generative process of user interactions. However, these generative models suffer from intrinsic…

Information Retrieval · Computer Science 2025-06-26 Wenjie Wang , Yiyan Xu , Fuli Feng , Xinyu Lin , Xiangnan He , Tat-Seng Chua

Data Augmentation for Seizure Prediction with Generative Diffusion Model

Data augmentation (DA) can significantly strengthen the electroencephalogram (EEG)-based seizure prediction methods. However, existing DA approaches are just the linear transformations of original data and cannot explore the feature space…

Signal Processing · Electrical Eng. & Systems 2024-12-10 Kai Shu , Le Wu , Yuchang Zhao , Aiping Liu , Ruobing Qian , Xun Chen

GazeFusion: Saliency-Guided Image Generation

Diffusion models offer unprecedented image generation power given just a text prompt. While emerging approaches for controlling diffusion models have enabled users to specify the desired spatial layouts of the generated content, they cannot…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Yunxiang Zhang , Nan Wu , Connor Z. Lin , Gordon Wetzstein , Qi Sun

Consistent View Synthesis with Pose-Guided Diffusion Models

Novel view synthesis from a single image has been a cornerstone problem for many Virtual Reality applications that provide immersive experiences. However, most existing techniques can only synthesize novel views within a limited range of…

Computer Vision and Pattern Recognition · Computer Science 2023-03-31 Hung-Yu Tseng , Qinbo Li , Changil Kim , Suhib Alsisan , Jia-Bin Huang , Johannes Kopf

FUSION: Full-Body Unified Motion Prior for Body and Hands via Diffusion

Hands are central to interacting with our surroundings and conveying gestures, making their inclusion essential for full-body motion synthesis. Despite this, existing human motion synthesis methods fall short: some ignore hand motions…

Computer Vision and Pattern Recognition · Computer Science 2026-01-08 Enes Duran , Nikos Athanasiou , Muhammed Kocabas , Michael J. Black , Omid Taheri

MultiDiffSense: Diffusion-Based Multi-Modal Visuo-Tactile Image Generation Conditioned on Object Shape and Contact Pose

Acquiring aligned visuo-tactile datasets is slow and costly, requiring specialised hardware and large-scale data collection. Synthetic generation is promising, but prior methods are typically single-modality, limiting cross-modal learning.…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Sirine Bhouri , Lan Wei , Jian-Qing Zheng , Dandan Zhang

DiffMotion: Speech-Driven Gesture Synthesis Using Denoising Diffusion Model

Speech-driven gesture synthesis is a field of growing interest in virtual human creation. However, a critical challenge is the inherent intricate one-to-many mapping between speech and gestures. Previous studies have explored and achieved…

Graphics · Computer Science 2023-02-03 Fan Zhang , Naye Ji , Fuxing Gao , Yongping Li

Diffusion-based Synthetic Data Generation for Visible-Infrared Person Re-Identification

The performance of models is intricately linked to the abundance of training data. In Visible-Infrared person Re-IDentification (VI-ReID) tasks, collecting and annotating large-scale images of each individual under various cameras and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Wenbo Dai , Lijing Lu , Zhihang Li

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Chaofan Ma , Yuhuan Yang , Chen Ju , Fei Zhang , Jinxiang Liu , Yu Wang , Ya Zhang , Yanfeng Wang

Consistent Human Image and Video Generation with Spatially Conditioned Diffusion

Consistent human-centric image and video synthesis aims to generate images or videos with new poses while preserving appearance consistency with a given reference image, which is crucial for low-cost visual content creation. Recent advances…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Mingdeng Cao , Chong Mou , Ziyang Yuan , Xintao Wang , Zhaoyang Zhang , Ying Shan , Yinqiang Zheng

Enhancing Eye Movement Biometrics for User Authentication via Continuous Gaze Offset Score Fusion

Eye movement biometrics (EMB) use subject-specific gaze dynamics for user authentication and identification. Recent deep learning-based EMB systems achieve strong performance by modeling temporal eye movement behavior. However, these…

Human-Computer Interaction · Computer Science 2026-05-11 Hashim Aziz , Mehedi Hasan Raju , Oleg V. Komogortsev

Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis

As an indicator of human attention gaze is a subtle behavioral cue which can be exploited in many applications. However, inferring 3D gaze direction is challenging even for deep neural networks given the lack of large amount of data…

Computer Vision and Pattern Recognition · Computer Science 2019-04-25 Yu Yu , Gang Liu , Jean-Marc Odobez

GazeMoDiff: Gaze-guided Diffusion Model for Stochastic Human Motion Prediction

Human motion prediction is important for many virtual and augmented reality (VR/AR) applications such as collision avoidance and realistic avatar generation. Existing methods have synthesised body motion only from observed past motion,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Haodong Yan , Zhiming Hu , Syn Schmitt , Andreas Bulling

DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis

In recent years, diffusion models have emerged as the most powerful approach in image synthesis. However, applying these models directly to video synthesis presents challenges, as it often leads to noticeable flickering contents. Although…

Computer Vision and Pattern Recognition · Computer Science 2023-08-11 Zhongjie Duan , Lizhou You , Chengyu Wang , Cen Chen , Ziheng Wu , Weining Qian , Jun Huang

Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion

Identity-preserving face synthesis aims to generate synthetic face images of virtual subjects that can substitute real-world data for training face recognition models. While prior arts strive to create images with consistent identities and…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Yuxi Mi , Zhizhou Zhong , Yuge Huang , Qiuyang Yuan , Xuan Zhao , Jianqing Xu , Shouhong Ding , ShaoMing Wang , Rizen Guo , Shuigeng Zhou