Related papers: Guided Trajectory Optimization with Sparse Scaling…

Trust-Region Noise Search for Black-Box Alignment of Diffusion and Flow Models

Optimizing the noise samples of diffusion and flow models is an increasingly popular approach to align these models to target rewards at inference time. However, we observe that these approaches are usually restricted to differentiable or…

Machine Learning · Computer Science 2026-03-17 Niklas Schweiger , Daniel Cremers , Karnik Ram

Test-Time Scaling of Diffusion Models via Noise Trajectory Search

The iterative and stochastic nature of diffusion models enables test-time scaling, whereby spending additional compute during denoising generates higher-fidelity samples. Increasing the number of denoising steps is the primary scaling axis,…

Machine Learning · Computer Science 2025-09-09 Vignav Ramesh , Morteza Mardani

Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models

The new paradigm of test-time scaling has yielded remarkable breakthroughs in Large Language Models (LLMs) (e.g. reasoning models) and in generative vision models, allowing models to allocate additional computation during inference to…

Machine Learning · Computer Science 2025-08-14 Luca Eyring , Shyamgopal Karthik , Alexey Dosovitskiy , Nataniel Ruiz , Zeynep Akata

Scaling Image and Video Generation via Test-Time Evolutionary Search

As the marginal cost of scaling computation (data and parameters) during model pre-training continues to increase substantially, test-time scaling (TTS) has emerged as a promising direction for improving generative model performance by…

Computer Vision and Pattern Recognition · Computer Science 2025-05-26 Haoran He , Jiajun Liang , Xintao Wang , Pengfei Wan , Di Zhang , Kun Gai , Ling Pan

Accelerating Diffusion-based Super-Resolution with Dynamic Time-Spatial Sampling

Diffusion models have gained attention for their success in modeling complex distributions, achieving impressive perceptual quality in SR tasks. However, existing diffusion-based SR methods often suffer from high computational costs,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-22 Rui Qin , Qijie Wang , Ming Sun , Haowei Zhu , Chao Zhou , Bin Wang

Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models

Adapting a pretrained diffusion model to new objectives at inference time remains an open problem in generative modeling. Existing steering methods suffer from inaccurate value estimation, especially at high noise levels, which biases…

Machine Learning · Computer Science 2025-06-27 Vineet Jain , Kusha Sareen , Mohammad Pedramfar , Siamak Ravanbakhsh

Rethinking Test Time Scaling for Flow-Matching Generative Models

The performance of text-to-image diffusion models may be improved at test-time by scaling computation to search for a generated image that maximizes a given reward function. While existing trajectory level exploration methods improve the…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Qingtao Yu , Changlin Song , Minghao Sun , Zhengyang Yu , Vinay Kumar Verma , Soumya Roy , Sumit Negi , Hongdong Li , Dylan Campbell

Stream-T1: Test-Time Scaling for Streaming Video Generation

While Test-Time Scaling (TTS) offers a promising direction to enhance video generation without the surging costs of training, current test-time video generation methods based on diffusion models suffer from exorbitant candidate exploration…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Yijing Tu , Shaojin Wu , Mengqi Huang , Wenchuan Wang , Yuxin Wang , Chunxiao Liu , Zhendong Mao

Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models

Diffusion models have become the dominant paradigm in text-to-image generation, and test-time scaling (TTS) improves sample quality by allocating additional computation at inference. Existing TTS methods, however, resample the entire image,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Qin Ren , Yufei Wang , Lanqing Guo , Wen Zhang , Zhiwen Fan , Chenyu You

FR-TTS: Test-Time Scaling for NTP-based Image Generation with Effective Filling-based Reward Signal

Test-time scaling (TTS) has become a prevalent technique in image generation, significantly boosting output quality by expanding the number of parallel samples and filtering them using pre-trained reward models. However, applying this…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Hang Xu , Linjiang Huang , Feng Zhao

LatSearch: Latent Reward-Guided Search for Faster Inference-Time Scaling in Video Diffusion

The recent success of inference-time scaling in large language models has inspired similar explorations in video diffusion. In particular, motivated by the existence of "golden noise" that enhances video quality, prior work has attempted to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Zengqun Zhao , Ziquan Liu , Yu Cao , Shaogang Gong , Zhensong Zhang , Jifei Song , Jiankang Deng , Ioannis Patras

Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design

To fully leverage the capabilities of diffusion models, we are often interested in optimizing downstream reward functions during inference. While numerous algorithms for reward-guided generation have been recently proposed due to their…

Machine Learning · Computer Science 2025-04-18 Masatoshi Uehara , Xingyu Su , Yulai Zhao , Xiner Li , Aviv Regev , Shuiwang Ji , Sergey Levine , Tommaso Biancalani

Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following

Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable…

Machine Learning · Computer Science 2024-07-18 Brian Yang , Huangyuan Su , Nikolaos Gkanatsios , Tsung-Wei Ke , Ayush Jain , Jeff Schneider , Katerina Fragkiadaki

Effective Test-Time Scaling of Discrete Diffusion through Iterative Refinement

Test-time scaling through reward-guided generation remains largely unexplored for discrete diffusion models despite its potential as a promising alternative. In this work, we introduce Iterative Reward-Guided Refinement (IterRef), a novel…

Machine Learning · Computer Science 2025-11-11 Sanghyun Lee , Sunwoo Kim , Seungryong Kim , Jongho Park , Dongmin Park

Test-time scaling of diffusions with flow maps

A common recipe to improve diffusion models at test-time so that samples score highly against a user-specified reward is to introduce the gradient of the reward into the dynamics of the diffusion itself. This procedure is often ill posed,…

Machine Learning · Computer Science 2025-12-01 Amirmojtaba Sabour , Michael S. Albergo , Carles Domingo-Enrich , Nicholas M. Boffi , Sanja Fidler , Karsten Kreis , Eric Vanden-Eijnden

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Diffusion Multi-modal Large Language Models (dMLLMs) have recently emerged as a novel architecture unifying image generation and understanding. However, developing effective and efficient Test-Time Scaling (TTS) methods to unlock their full…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Yi Xin , Siqi Luo , Tianxiang Xu , Qi Qin , Haoxing Chen , Kaiwen Zhu , Zhiwei Zhang , Yangfan He , Rongchao Zhang , Jinbin Bai , Shuo Cao , Bin Fu , Junjun He , Yihao Liu , Yuewen Cao , Xiaohong Liu

Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation

Test-time scaling (TTS) aims to achieve better results by increasing random sampling and evaluating samples based on rules and metrics. However, in text-to-image(T2I) diffusion models, most related works focus on search strategies and…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Hang Xu , Linjiang Huang , Feng Zhao

Physics-Guided Null-Space Diffusion with Sparse Masking for Corrective Sparse-View CT Reconstruction

Diffusion models have demonstrated remarkable generative capabilities in image processing tasks. We propose a Sparse condition Temporal Rewighted Integrated Distribution Estimation guided diffusion model (STRIDE) for sparse-view CT…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Zekun Zhou , Yanru Gong , Liu Shi , Qiegen Liu

DCTTS: Discrete Diffusion Model with Contrastive Learning for Text-to-speech Generation

In the Text-to-speech(TTS) task, the latent diffusion model has excellent fidelity and generalization, but its expensive resource consumption and slow inference speed have always been a challenging. This paper proposes Discrete Diffusion…

Sound · Computer Science 2023-09-14 Zhichao Wu , Qiulin Li , Sixing Liu , Qun Yang

Parallel Test-Time Scaling for Latent Reasoning Models

Parallel test-time scaling (TTS) is a pivotal approach for enhancing large language models (LLMs), typically by sampling multiple token-based chains-of-thought in parallel and aggregating outcomes through voting or search. Recent advances…

Computation and Language · Computer Science 2026-04-21 Runyang You , Yongqi Li , Meng Liu , Wenjie Wang , Liqiang Nie , Wenjie Li