English
Related papers

Related papers: Latent Diffusion for Guided Document Table Generat…

200 papers

Generative models such as GANs and diffusion models have demonstrated impressive image generation capabilities. Despite these successes, these systems are surprisingly poor at creating images with hands. We propose a novel training…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Yue Yang , Atith N Gandhi , Greg Turk

Data availability remains a critical bottleneck in many deep learning applications. Large-scale datasets are often expensive to collect, curate and annotate, which can limit the scalability and applicability of supervised learning methods.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-28 Nithesh Chandher Karthikeyan , Jonas Unger , Gabriel Eilertsen

Recent advances in computer vision have led to significant progress in the generation of realistic image data, with denoising diffusion probabilistic models proving to be a particularly effective method. In this study, we demonstrate that…

Image and Video Processing · Electrical Eng. & Systems 2023-08-09 Dennis Eschweiler , Rüveyda Yilmaz , Matisse Baumann , Ina Laube , Rijo Roy , Abin Jose , Daniel Brückner , Johannes Stegmaier

Anatomical atlases are widely used for population studies and analysis. Conditional atlases target a specific sub-population defined via certain conditions, such as demographics or pathologies, and allow for the investigation of…

Image and Video Processing · Electrical Eng. & Systems 2025-06-25 Sophie Starck , Vasiliki Sideri-Lampretsa , Bernhard Kainz , Martin J. Menten , Tamara T. Mueller , Daniel Rueckert

Modern diffusion-based image generative models have made significant progress and become promising to enrich training data for the object detection task. However, the generation quality and the controllability for complex scenes containing…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Jingyuan Zhu , Shiyu Li , Yuxuan Liu , Ping Huang , Jiulong Shan , Huimin Ma , Jian Yuan

The segmentation of mass lesions in digital breast tomosynthesis (DBT) images is very significant for the early screening of breast cancer. However, the high-density breast tissue often leads to high concealment of the mass lesions, which…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Haoxuan Zhang , Wenju Cui , Yuzhu Cao , Tao Tan , Jie Liu , Yunsong Peng , Jian Zheng

Data imputation and data generation have important applications for many domains, like healthcare and finance, where incomplete or missing data can hinder accurate analysis and decision-making. Diffusion models have emerged as powerful…

Machine Learning · Computer Science 2025-06-10 Mario Villaizán-Vallelado , Matteo Salvatori , Carlos Segura , Ioannis Arapakis

Given the inherently costly and time-intensive nature of pixel-level annotation, the generation of synthetic datasets comprising sufficiently diverse synthetic images paired with ground-truth pixel-level annotations has garnered increasing…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Haoyu Wang , Lei Zhang , Wenrui Liu , Dengyang Jiang , Wei Wei , Chen Ding

Medical image segmentation models struggle with rare abnormalities due to scarce annotated pathological data. We propose DiffAug a novel framework that combines textguided diffusion-based generation with automatic segmentation validation to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Maham Nazir , Muhammad Aqeel , Francesco Setti

In layout-to-image (L2I) synthesis, controlled complex scenes are generated from coarse information like bounding boxes. Such a task is exciting to many downstream applications because the input layouts offer strong guidance to the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Ruyu Wang , Xuefeng Hou , Sabrina Schmedding , Marco F. Huber

Latent diffusion models (LDMs) dominate high-quality image generation, yet integrating representation learning with generative modeling remains a challenge. We introduce a novel generative image modeling framework that seamlessly bridges…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Theodoros Kouzelis , Efstathios Karypidis , Ioannis Kakogeorgiou , Spyros Gidaris , Nikos Komodakis

Recent advances in generative modeling, namely Diffusion models, have revolutionized generative modeling, enabling high-quality image generation tailored to user needs. This paper proposes a framework for the generative design of structural…

In computer vision, it is well-known that a lack of data diversity will impair model performance. In this study, we address the challenges of enhancing the dataset diversity problem in order to benefit various downstream tasks such as…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Yuhang Li , Xin Dong , Chen Chen , Weiming Zhuang , Lingjuan Lyu

Current deep networks are very data-hungry and benefit from training on largescale datasets, which are often time-consuming to collect and annotate. By contrast, synthetic data can be generated infinitely using generative models such as…

Computer Vision and Pattern Recognition · Computer Science 2023-10-11 Weijia Wu , Yuzhong Zhao , Hao Chen , Yuchao Gu , Rui Zhao , Yefei He , Hong Zhou , Mike Zheng Shou , Chunhua Shen

Generative models, such as GANs and diffusion models, have been used to augment training sets and boost performances in different tasks. We focus on generative models for cell detection instead, i.e., locating and classifying cells in given…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Chen Li , Xiaoling Hu , Shahira Abousamra , Meilong Xu , Chao Chen

Collecting and annotating images with pixel-wise labels is time-consuming and laborious. In contrast, synthetic data can be freely available using a generative model (e.g., DALL-E, Stable Diffusion). In this paper, we show that it is…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Weijia Wu , Yuzhong Zhao , Mike Zheng Shou , Hong Zhou , Chunhua Shen

Layout-to-image generation refers to the task of synthesizing photo-realistic images based on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational diffusion model pretrained on large-scale image or text-image…

Computer Vision and Pattern Recognition · Computer Science 2023-02-20 Jiaxin Cheng , Xiao Liang , Xingjian Shi , Tong He , Tianjun Xiao , Mu Li

Instance segmentation datasets play a crucial role in training accurate and robust computer vision models. However, obtaining accurate mask annotations to produce high-quality segmentation datasets is a costly and labor-intensive process.…

Computer Vision and Pattern Recognition · Computer Science 2024-02-27 Markus Pobitzer , Filip Janicki , Mattia Rigotti , Cristiano Malossi

Accurate single cell detection in brightfield microscopy is crucial for biological research, yet data scarcity and annotation bottlenecks limit the progress of deep learning methods. We investigate the use of unconditional models to…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Mario de Jesus da Graca , Jörg Dahlkemper , Peer Stelldinger

We introduce a framework for joint grounded scene graph - image generation, a challenging task involving high-dimensional, multi-modal structured data. To effectively model this complex joint distribution, we adopt a factorized approach:…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Bicheng Xu , Qi Yan , Renjie Liao , Lele Wang , Leonid Sigal
‹ Prev 1 2 3 10 Next ›