Related papers: Image Generation with a Sphere Encoder

Efficient Image Synthesis with Sphere Latent Encoder

Few-step image generation has seen rapid progress, with consistency and meanflow-based methods significantly reducing the number of sampling steps. Despite their low inference cost, these approaches often suffer from training instability…

Computer Vision and Pattern Recognition · Computer Science 2026-05-18 Tung Do , Thuan Hoang Nguyen , Hao Li

Spherical Image Generation from a Single Normal Field of View Image by Considering Scene Symmetry

Spherical images taken in all directions (360 degrees) allow representing the surroundings of the subject and the space itself, providing an immersive experience to the viewers. Generating a spherical image from a single…

Computer Vision and Pattern Recognition · Computer Science 2020-01-10 Takayuki Hara , Tatsuya Harada

A Method for Training-free Person Image Picture Generation

The current state-of-the-art Diffusion model has demonstrated excellent results in generating images. However, the images are monotonous and are mostly the result of the distribution of images of people in the training set, making it…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Tianyu Chen

Identity Encoder for Personalized Diffusion

Many applications can benefit from personalized image generation models, including image enhancement, video conferences, just to name a few. Existing works achieved personalization by fine-tuning one model for each person. While being…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Yu-Chuan Su , Kelvin C. K. Chan , Yandong Li , Yang Zhao , Han Zhang , Boqing Gong , Huisheng Wang , Xuhui Jia

Transforming Image Generation from Scene Graphs

Generating images from semantic visual knowledge is a challenging task, that can be useful to condition the synthesis process in complex, subtle, and unambiguous ways, compared to alternatives such as class labels or text descriptions.…

Computer Vision and Pattern Recognition · Computer Science 2022-07-04 Renato Sortino , Simone Palazzo , Concetto Spampinato

Sketch-Guided Scene Image Generation

Text-to-image models are showcasing the impressive ability to create high-quality and diverse generative images. Nevertheless, the transition from freehand sketches to complex scene images remains challenging using diffusion models. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Tianyu Zhang , Xiaoxuan Xie , Xusheng Du , Haoran Xie

Emergence of Object Segmentation in Perturbed Generative Models

We introduce a novel framework to build a model that can learn how to segment objects from a collection of images without any human annotation. Our method builds on the observation that the location of object segments can be perturbed…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Adam Bielski , Paolo Favaro

SPGen: Spherical Projection as Consistent and Flexible Representation for Single Image 3D Shape Generation

Existing single-view 3D generative models typically adopt multiview diffusion priors to reconstruct object surfaces, yet they remain prone to inter-view inconsistencies and are unable to faithfully represent complex internal structure or…

Computer Vision and Pattern Recognition · Computer Science 2025-09-17 Jingdong Zhang , Weikai Chen , Yuan Liu , Jionghao Wang , Zhengming Yu , Zhuowen Shen , Bo Yang , Wenping Wang , Xin Li

Transformer-based Image Generation from Scene Graphs

Graph-structured scene descriptions can be efficiently used in generative models to control the composition of the generated image. Previous approaches are based on the combination of graph convolutional networks and adversarial methods for…

Computer Vision and Pattern Recognition · Computer Science 2023-03-09 Renato Sortino , Simone Palazzo , Concetto Spampinato

Accelerating Diffusion Decoders via Multi-Scale Sampling and One-Step Distillation

Image tokenization plays a central role in modern generative modeling by mapping visual inputs into compact representations that serve as an intermediate signal between pixels and generative models. Diffusion-based decoders have recently…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Chuhan Wang , Hao Chen

A Layer-Based Sequential Framework for Scene Generation with GANs

The visual world we sense, interpret and interact everyday is a complex composition of interleaved physical entities. Therefore, it is a very challenging task to generate vivid scenes of similar complexity using computers. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2019-02-05 Mehmet Ozgur Turkoglu , William Thong , Luuk Spreeuwers , Berkay Kicanaoglu

Optical Diffusion Models for Image Generation

Diffusion models generate new samples by progressively decreasing the noise from the initially provided random distribution. This inference procedure generally utilizes a trained neural network numerous times to obtain the final output,…

Optics · Physics 2024-11-01 Ilker Oguz , Niyazi Ulas Dinc , Mustafa Yildirim , Junjie Ke , Innfarn Yoo , Qifei Wang , Feng Yang , Christophe Moser , Demetri Psaltis

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model

Controllable spherical panoramic image generation holds substantial applicative potential across a variety of domains.However, it remains a challenging task due to the inherent spherical distortion and geometry characteristics, resulting in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Tao Wu , Xuewei Li , Zhongang Qi , Di Hu , Xintao Wang , Ying Shan , Xi Li

Content-Aware Preserving Image Generation

Remarkable progress has been achieved in image generation with the introduction of generative models. However, precisely controlling the content in generated images remains a challenging task due to their fundamental training objective.…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Giang H. Le , Anh Q. Nguyen , Byeongkeun Kang , Yeejin Lee

High-Fidelity Medical Shape Generation via Skeletal Latent Diffusion

Anatomy shape modeling is a fundamental problem in medical data analysis. However, the geometric complexity and topological variability of anatomical structures pose significant challenges to accurate anatomical shape generation. In this…

Computer Vision and Pattern Recognition · Computer Science 2026-03-13 Guoqing Zhang , Jingyun Yang , Siqi Chen , Anping Zhang , Yang Li

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering

Modern video generative models based on diffusion models can produce very realistic clips, but they are computationally inefficient, often requiring minutes of GPU time for just a few seconds of video. This inefficiency poses a critical…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Jieying Chen , Jeffrey Hu , Joan Lasenby , Ayush Tewari

Towards Spatially Consistent Image Generation: On Incorporating Intrinsic Scene Properties into Diffusion Models

Image generation models trained on large datasets can synthesize high-quality images but often produce spatially inconsistent and distorted images due to limited information about the underlying structures and spatial layouts. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Hyundo Lee , Suhyung Choi , Inwoo Hwang , Byoung-Tak Zhang

Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

One of the main drawback of diffusion models is the slow inference time for image generation. Among the most successful approaches to addressing this problem are distillation methods. However, these methods require considerable…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Senmao Li , Taihang Hu , Joost van de Weijer , Fahad Shahbaz Khan , Tao Liu , Linxuan Li , Shiqi Yang , Yaxing Wang , Ming-Ming Cheng , Jian Yang

Single-step Diffusion for Image Compression at Ultra-Low Bitrates

Although there have been significant advancements in image compression techniques, such as standard and learned codecs, these methods still suffer from severe quality degradation at extremely low bits per pixel. While recent diffusion-based…

Image and Video Processing · Electrical Eng. & Systems 2025-09-23 Chanung Park , Joo Chan Lee , Jong Hwan Ko

Spherical Geometry Diffusion: Generating High-quality 3D Face Geometry via Sphere-anchored Representations

A fundamental challenge in text-to-3D face generation is achieving high-quality geometry. The core difficulty lies in the arbitrary and intricate distribution of vertices in 3D space, making it challenging for existing models to establish…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Junyi Zhang , Yiming Wang , Yunhong Lu , Qichao Wang , Wenzhe Qian , Xiaoyin Xu , David Gu , Min Zhang