English
Related papers

Related papers: Diffusion models for Handwriting Generation

200 papers

Handwritten Text Generation (HTG) conditioned on text and style is a challenging task due to the variability of inter-user characteristics and the unlimited combinations of characters that form new words unseen during training. Diffusion…

Computer Vision and Pattern Recognition · Computer Science 2024-09-11 Konstantina Nikolaidou , George Retsinas , Giorgos Sfikas , Marcus Liwicki

Existing handwritten text generation methods primarily focus on isolated words. However, realistic handwritten text demands attention not only to individual words but also to the relationships between them, such as vertical alignment and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Gang Dai , Yifan Zhang , Yutao Qin , Qiangya Guo , Shuangping Huang , Shuicheng Yan

Text-to-image generative models can generate high-quality humans, but realism is lost when generating hands. Common artifacts include irregular hand poses, shapes, incorrect numbers of fingers, and physically implausible finger…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Supreeth Narasimhaswamy , Uttaran Bhattacharya , Xiang Chen , Ishita Dasgupta , Saayan Mitra , Minh Hoai

Text-to-Image synthesis is the task of generating an image according to a specific text description. Generative Adversarial Networks have been considered the standard method for image synthesis virtually since their introduction. Denoising…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Konstantina Nikolaidou , George Retsinas , Vincent Christlein , Mathias Seuret , Giorgos Sfikas , Elisa Barney Smith , Hamam Mokayed , Marcus Liwicki

We provide an overview of the diffusion model as a method to generate new samples. Generative models have been recently adopted for tasks such as art generation (Stable Diffusion, Dall-E) and text generation (ChatGPT). Diffusion models in…

Machine Learning · Statistics 2025-06-13 Justin Le

The imitation of cursive handwriting is mainly limited to generating handwritten words or lines. Multiple synthetic outputs must be stitched together to create paragraphs or whole pages, whereby consistency and layout information are lost.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Martin Mayr , Marcel Dreier , Florian Kordon , Mathias Seuret , Jochen Zöllner , Fei Wu , Andreas Maier , Vincent Christlein

We propose a simple and novel method for generating 3D human motion from complex natural language sentences, which describe different velocity, direction and composition of all kinds of actions. Different from existing methods that use…

Computer Vision and Pattern Recognition · Computer Science 2023-04-17 Zhiyuan Ren , Zhihong Pan , Xin Zhou , Le Kang

Diffusion Models are probabilistic models that create realistic samples by simulating the diffusion process, gradually adding and removing noise from data. These models have gained popularity in domains such as image processing, speech…

Computer Vision and Pattern Recognition · Computer Science 2024-08-21 Md Manjurul Ahsan , Shivakumar Raman , Yingtao Liu , Zahed Siddique

The recent wave of large-scale text-to-image diffusion models has dramatically increased our text-based image generation abilities. These models can generate realistic images for a staggering variety of prompts and exhibit impressive…

Machine Learning · Computer Science 2023-09-14 Alexander C. Li , Mihir Prabhudesai , Shivam Duggal , Ellis Brown , Deepak Pathak

The generation of images of realistic looking, readable handwritten text is a challenging task which is referred to as handwritten text generation (HTG). Given a string and examples from a writer, the goal is to synthesize an image…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Kai Brandenbusch

Diffusion models generate high-quality synthetic data. They operate by defining a continuous-time forward process which gradually adds Gaussian noise to data until fully corrupted. The corresponding reverse process progressively "denoises"…

Generative diffusion processes are an emerging and effective tool for image and speech generation. In the existing methods, the underlying noise distribution of the diffusion process is Gaussian noise. However, fitting distributions with…

Signal Processing · Electrical Eng. & Systems 2021-10-13 Eliya Nachmani , Robin San Roman , Lior Wolf

Generative diffusion processes are an emerging and effective tool for image and speech generation. In the existing methods, the underline noise distribution of the diffusion process is Gaussian noise. However, fitting distributions with…

Machine Learning · Computer Science 2021-06-17 Eliya Nachmani , Robin San Roman , Lior Wolf

Handwriting stroke generation is crucial for improving the performance of tasks such as handwriting recognition and writers order recovery. In handwriting stroke generation, it is significantly important to imitate the sample calligraphic…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Sidra Hanif , Longin Jan Latecki

In text generation, models that generate text from scratch one token at a time are currently the dominant paradigm. Despite being performant, these models lack the ability to revise existing text, which limits their usability in many…

Computation and Language · Computer Science 2022-11-01 Machel Reid , Vincent J. Hellendoorn , Graham Neubig

Diffusion-based generative models are a design framework that allows generating new images from processes analogous to those found in non-equilibrium thermodynamics. These models model the reversal of a physical diffusion process in which…

Artificial Intelligence · Computer Science 2023-02-21 Jordi de la Torre

Existing handwritten text generation methods often require more than ten handwriting samples as style references. However, in practical applications, users tend to prefer a handwriting generation model that operates with just a single…

Computer Vision and Pattern Recognition · Computer Science 2024-09-12 Gang Dai , Yifan Zhang , Quhui Ke , Qiangya Guo , Shuangping Huang

This paper introduces a novel data-driven strategy for synthesizing gramophone noise audio textures. A diffusion probabilistic model is applied to generate highly realistic quasiperiodic noises. The proposed model is designed to generate…

Audio and Speech Processing · Electrical Eng. & Systems 2022-07-01 Eloi Moliner , Vesa Välimäki

Diffusion models, a family of generative models based on deep learning, have become increasingly prominent in cutting-edge machine learning research. With a distinguished performance in generating samples that resemble the observed data,…

Machine Learning · Computer Science 2023-05-02 Lequan Lin , Zhengkun Li , Ruikun Li , Xuliang Li , Junbin Gao

Diffusion models generate new samples by progressively decreasing the noise from the initially provided random distribution. This inference procedure generally utilizes a trained neural network numerous times to obtain the final output,…

‹ Prev 1 2 3 10 Next ›