English
Related papers

Related papers: Guided Trajectory Optimization with Sparse Scaling…

200 papers

Optimizing the noise samples of diffusion and flow models is an increasingly popular approach to align these models to target rewards at inference time. However, we observe that these approaches are usually restricted to differentiable or…

Machine Learning · Computer Science 2026-03-17 Niklas Schweiger , Daniel Cremers , Karnik Ram

The iterative and stochastic nature of diffusion models enables test-time scaling, whereby spending additional compute during denoising generates higher-fidelity samples. Increasing the number of denoising steps is the primary scaling axis,…

Machine Learning · Computer Science 2025-09-09 Vignav Ramesh , Morteza Mardani

The new paradigm of test-time scaling has yielded remarkable breakthroughs in Large Language Models (LLMs) (e.g. reasoning models) and in generative vision models, allowing models to allocate additional computation during inference to…

Machine Learning · Computer Science 2025-08-14 Luca Eyring , Shyamgopal Karthik , Alexey Dosovitskiy , Nataniel Ruiz , Zeynep Akata

As the marginal cost of scaling computation (data and parameters) during model pre-training continues to increase substantially, test-time scaling (TTS) has emerged as a promising direction for improving generative model performance by…

Computer Vision and Pattern Recognition · Computer Science 2025-05-26 Haoran He , Jiajun Liang , Xintao Wang , Pengfei Wan , Di Zhang , Kun Gai , Ling Pan

Diffusion models have gained attention for their success in modeling complex distributions, achieving impressive perceptual quality in SR tasks. However, existing diffusion-based SR methods often suffer from high computational costs,…

Computer Vision and Pattern Recognition · Computer Science 2025-05-22 Rui Qin , Qijie Wang , Ming Sun , Haowei Zhu , Chao Zhou , Bin Wang

Adapting a pretrained diffusion model to new objectives at inference time remains an open problem in generative modeling. Existing steering methods suffer from inaccurate value estimation, especially at high noise levels, which biases…

Machine Learning · Computer Science 2025-06-27 Vineet Jain , Kusha Sareen , Mohammad Pedramfar , Siamak Ravanbakhsh

The performance of text-to-image diffusion models may be improved at test-time by scaling computation to search for a generated image that maximizes a given reward function. While existing trajectory level exploration methods improve the…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Qingtao Yu , Changlin Song , Minghao Sun , Zhengyang Yu , Vinay Kumar Verma , Soumya Roy , Sumit Negi , Hongdong Li , Dylan Campbell

While Test-Time Scaling (TTS) offers a promising direction to enhance video generation without the surging costs of training, current test-time video generation methods based on diffusion models suffer from exorbitant candidate exploration…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Yijing Tu , Shaojin Wu , Mengqi Huang , Wenchuan Wang , Yuxin Wang , Chunxiao Liu , Zhendong Mao

Diffusion models have become the dominant paradigm in text-to-image generation, and test-time scaling (TTS) improves sample quality by allocating additional computation at inference. Existing TTS methods, however, resample the entire image,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Qin Ren , Yufei Wang , Lanqing Guo , Wen Zhang , Zhiwen Fan , Chenyu You

Test-time scaling (TTS) has become a prevalent technique in image generation, significantly boosting output quality by expanding the number of parallel samples and filtering them using pre-trained reward models. However, applying this…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Hang Xu , Linjiang Huang , Feng Zhao

The recent success of inference-time scaling in large language models has inspired similar explorations in video diffusion. In particular, motivated by the existence of "golden noise" that enhances video quality, prior work has attempted to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Zengqun Zhao , Ziquan Liu , Yu Cao , Shaogang Gong , Zhensong Zhang , Jifei Song , Jiankang Deng , Ioannis Patras

To fully leverage the capabilities of diffusion models, we are often interested in optimizing downstream reward functions during inference. While numerous algorithms for reward-guided generation have been recently proposed due to their…

Machine Learning · Computer Science 2025-04-18 Masatoshi Uehara , Xingyu Su , Yulai Zhao , Xiner Li , Aviv Regev , Shuiwang Ji , Sergey Levine , Tommaso Biancalani

Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable…

Machine Learning · Computer Science 2024-07-18 Brian Yang , Huangyuan Su , Nikolaos Gkanatsios , Tsung-Wei Ke , Ayush Jain , Jeff Schneider , Katerina Fragkiadaki

Test-time scaling through reward-guided generation remains largely unexplored for discrete diffusion models despite its potential as a promising alternative. In this work, we introduce Iterative Reward-Guided Refinement (IterRef), a novel…

Machine Learning · Computer Science 2025-11-11 Sanghyun Lee , Sunwoo Kim , Seungryong Kim , Jongho Park , Dongmin Park

A common recipe to improve diffusion models at test-time so that samples score highly against a user-specified reward is to introduce the gradient of the reward into the dynamics of the diffusion itself. This procedure is often ill posed,…

Diffusion Multi-modal Large Language Models (dMLLMs) have recently emerged as a novel architecture unifying image generation and understanding. However, developing effective and efficient Test-Time Scaling (TTS) methods to unlock their full…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Yi Xin , Siqi Luo , Tianxiang Xu , Qi Qin , Haoxing Chen , Kaiwen Zhu , Zhiwei Zhang , Yangfan He , Rongchao Zhang , Jinbin Bai , Shuo Cao , Bin Fu , Junjun He , Yihao Liu , Yuewen Cao , Xiaohong Liu

Test-time scaling (TTS) aims to achieve better results by increasing random sampling and evaluating samples based on rules and metrics. However, in text-to-image(T2I) diffusion models, most related works focus on search strategies and…

Computer Vision and Pattern Recognition · Computer Science 2025-12-04 Hang Xu , Linjiang Huang , Feng Zhao

Diffusion models have demonstrated remarkable generative capabilities in image processing tasks. We propose a Sparse condition Temporal Rewighted Integrated Distribution Estimation guided diffusion model (STRIDE) for sparse-view CT…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Zekun Zhou , Yanru Gong , Liu Shi , Qiegen Liu

In the Text-to-speech(TTS) task, the latent diffusion model has excellent fidelity and generalization, but its expensive resource consumption and slow inference speed have always been a challenging. This paper proposes Discrete Diffusion…

Sound · Computer Science 2023-09-14 Zhichao Wu , Qiulin Li , Sixing Liu , Qun Yang

Parallel test-time scaling (TTS) is a pivotal approach for enhancing large language models (LLMs), typically by sampling multiple token-based chains-of-thought in parallel and aggregating outcomes through voting or search. Recent advances…

Computation and Language · Computer Science 2026-04-21 Runyang You , Yongqi Li , Meng Liu , Wenjie Wang , Liqiang Nie , Wenjie Li
‹ Prev 1 2 3 10 Next ›