Related papers: Generating Driving Scenes with Diffusion

Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion

Automated creation of synthetic traffic scenarios is a key part of validating the safety of autonomous vehicles (AVs). In this paper, we propose Scenario Diffusion, a novel diffusion-based architecture for generating traffic scenarios that…

Machine Learning · Computer Science 2023-11-20 Ethan Pronovost , Meghana Reddy Ganesina , Noureldin Hendy , Zeyu Wang , Andres Morales , Kai Wang , Nicholas Roy

TSDiT: Traffic Scene Diffusion Models With Transformers

In this paper, we introduce a novel approach to trajectory generation for autonomous driving, combining the strengths of Diffusion models and Transformers. First, we use the historical trajectory data for efficient preprocessing and…

Robotics · Computer Science 2024-05-07 Chen Yang , Tianyu Shi

DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving

Evaluating and training autonomous driving systems require diverse and scalable corner cases. However, most existing scene generation methods lack controllability, accuracy, and versatility, resulting in unsatisfactory generation results.…

Robotics · Computer Science 2024-10-11 Sheng Wang , Ge Sun , Fulong Ma , Tianshuai Hu , Qiang Qin , Yongkang Song , Lei Zhu , Junwei Liang

Rolling Ahead Diffusion for Traffic Scene Simulation

Realistic driving simulation requires that NPCs not only mimic natural driving behaviors but also react to the behavior of other simulated agents. Recent developments in diffusion-based scenario generation focus on creating diverse and…

Machine Learning · Computer Science 2025-02-14 Yunpeng Liu , Matthew Niedoba , William Harvey , Adam Scibior , Berend Zwartsenberg , Frank Wood

SceneDM: Scene-level Multi-agent Trajectory Generation with Consistent Diffusion Models

Realistic scene-level multi-agent motion simulations are crucial for developing and evaluating self-driving algorithms. However, most existing works focus on generating trajectories for a certain single agent type, and typically ignore the…

Robotics · Computer Science 2023-11-28 Zhiming Guo , Xing Gao , Jianlan Zhou , Xinyu Cai , Botian Shi

Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model

Recently, diffusion-based image generation methods are credited for their remarkable text-to-image generation capabilities, while still facing challenges in accurately generating multilingual scene text images. To tackle this problem, we…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Lingjun Zhang , Xinyuan Chen , Yaohui Wang , Yue Lu , Yu Qiao

A Diffusion-Model of Joint Interactive Navigation

Simulation of autonomous vehicle systems requires that simulated traffic participants exhibit diverse and realistic behaviors. The use of prerecorded real-world traffic scenarios in simulation ensures realism but the rarity of safety…

Machine Learning · Computer Science 2023-10-26 Matthew Niedoba , Jonathan Wilder Lavington , Yunpeng Liu , Vasileios Lioutas , Justice Sefas , Xiaoxuan Liang , Dylan Green , Setareh Dabiri , Berend Zwartsenberg , Adam Scibior , Frank Wood

Diffusion-Based Environment-Aware Trajectory Prediction

The ability to predict the future trajectories of traffic participants is crucial for the safe and efficient operation of autonomous vehicles. In this paper, a diffusion-based generative model for multi-agent trajectory prediction is…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Theodor Westny , Björn Olofsson , Erik Frisk

WcDT: World-centric Diffusion Transformer for Traffic Scene Generation

In this paper, we introduce a novel approach for autonomous driving trajectory generation by harnessing the complementary strengths of diffusion probabilistic models (a.k.a., diffusion models) and transformers. Our proposed framework,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Chen Yang , Yangfan He , Aaron Xuxiang Tian , Dong Chen , Jianhui Wang , Tianyu Shi , Arsalan Heydarian , Pei Liu

Controllable Latent Diffusion for Traffic Simulation

The validation of autonomous driving systems benefits greatly from the ability to generate scenarios that are both realistic and precisely controllable. Conventional approaches, such as real-world test drives, are not only expensive but…

Robotics · Computer Science 2025-04-01 Yizhuo Xiao , Mustafa Suphi Erden , Cheng Wang

Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models

Scene text detection techniques have garnered significant attention due to their wide-ranging applications. However, existing methods have a high demand for training data, and obtaining accurate human annotations is labor-intensive and…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Ling Fu , Zijie Wu , Yingying Zhu , Yuliang Liu , Xiang Bai

Move Anything with Layered Scene Diffusion

Diffusion models generate images with an unprecedented level of quality, but how can we freely rearrange image layouts? Recent works generate controllable scenes via learning spatially disentangled latent codes, but these methods do not…

Computer Vision and Pattern Recognition · Computer Science 2024-04-11 Jiawei Ren , Mengmeng Xu , Jui-Chieh Wu , Ziwei Liu , Tao Xiang , Antoine Toisoul

Sketch-Guided Scene Image Generation

Text-to-image models are showcasing the impressive ability to create high-quality and diverse generative images. Nevertheless, the transition from freehand sketches to complex scene images remains challenging using diffusion models. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Tianyu Zhang , Xiaoxuan Xie , Xusheng Du , Haoran Xie

X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability

Diffusion models are advancing autonomous driving by enabling realistic data synthesis, predictive end-to-end planning, and closed-loop simulation, with a primary focus on temporally consistent generation. However, large-scale 3D scene…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Yu Yang , Alan Liang , Jianbiao Mei , Yukai Ma , Yong Liu , Gim Hee Lee

SceneGen: Learning to Generate Realistic Traffic Scenes

We consider the problem of generating realistic traffic scenes automatically. Existing methods typically insert actors into the scene according to a set of hand-crafted heuristics and are limited in their ability to model the true…

Computer Vision and Pattern Recognition · Computer Science 2021-01-19 Shuhan Tan , Kelvin Wong , Shenlong Wang , Sivabalan Manivasagam , Mengye Ren , Raquel Urtasun

Layout Agnostic Scene Text Image Synthesis with Diffusion Models

While diffusion models have significantly advanced the quality of image generation their capability to accurately and coherently render text within these images remains a substantial challenge. Conventional diffusion-based methods for scene…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Qilong Zhangli , Jindong Jiang , Di Liu , Licheng Yu , Xiaoliang Dai , Ankit Ramchandani , Guan Pang , Dimitris N. Metaxas , Praveen Krishnan

Diffusion Probabilistic Models for Scene-Scale 3D Categorical Data

In this paper, we learn a diffusion model to generate 3D data on a scene-scale. Specifically, our model crafts a 3D scene consisting of multiple objects, while recent diffusion research has focused on a single object. To realize our goal,…

Computer Vision and Pattern Recognition · Computer Science 2023-01-03 Jumin Lee , Woobin Im , Sebin Lee , Sung-Eui Yoon

Compositional 3D Scene Generation using Locally Conditioned Diffusion

Designing complex 3D scenes has been a tedious, manual process requiring domain expertise. Emerging text-to-3D generative models show great promise for making this task more intuitive, but existing approaches are limited to object-level…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Ryan Po , Gordon Wetzstein

Human-Aware 3D Scene Generation with Spatially-constrained Diffusion Models

Generating 3D scenes from human motion sequences supports numerous applications, including virtual reality and architectural design. However, previous auto-regression-based human-aware 3D scene generation methods have struggled to…

Computer Vision and Pattern Recognition · Computer Science 2024-08-21 Xiaolin Hong , Hongwei Yi , Fazhi He , Qiong Cao

Context Diffusion: In-Context Aware Image Generation

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is…

Computer Vision and Pattern Recognition · Computer Science 2025-07-24 Ivona Najdenkoska , Animesh Sinha , Abhimanyu Dubey , Dhruv Mahajan , Vignesh Ramanathan , Filip Radenovic