English
Related papers

Related papers: RepControlNet: ControlNet Reparameterization

200 papers

ControlNet offers a powerful way to guide diffusion-based generative models, yet most implementations rely on ad-hoc heuristics to choose which network blocks to control-an approach that varies unpredictably with different tasks. To address…

Machine Learning · Computer Science 2025-02-21 Zheng Fang , Lichuan Xiang , Xu Cai , Kaicheng Zhou , Hongkai Wen

Despite the remarkable generation capabilities of Diffusion Models (DMs), conducting training and inference remains computationally expensive. Previous works have been devoted to accelerating diffusion sampling, but achieving data-efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Yize Li , Yihua Zhang , Sijia Liu , Xue Lin

Diffusion models have significantly advanced the field of generative modeling. However, training a diffusion model is computationally expensive, creating a pressing need to adapt off-the-shelf diffusion models for downstream generation…

Machine Learning · Computer Science 2024-06-07 Jincheng Zhong , Xingzhuo Guo , Jiaxiang Dong , Mingsheng Long

Recent advances in conditional image generation from diffusion models have shown great potential in achieving impressive image quality while preserving the constraints introduced by the user. In particular, ControlNet enables precise…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Hannah Kniesel , Pedro Hermosilla , Timo Ropinski

The Diffusion Transformer plays a pivotal role in advancing text-to-image and text-to-video generation, owing primarily to its inherent scalability. However, existing controlled diffusion transformer methods incur significant parameter and…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Ke Cao , Jing Wang , Ao Ma , Jiasong Feng , Xuanhua He , Run Ling , Haowei Liu , Jian Lu , Wei Feng , Haozhe Wang , Hongjuan Pei , Yihua Shao , Zhanjie Zhang , Jie Zhang

Large transformer models have demonstrated remarkable success. Post-training quantization (PTQ), which requires only a small dataset for calibration and avoids end-to-end retraining, is a promising solution for compressing these large…

Machine Learning · Computer Science 2024-02-09 Zhikai Li , Xuewen Liu , Jing Zhang , Qingyi Gu

Discrete diffusion models form a powerful class of generative models across diverse domains, including text and graphs. However, existing approaches face fundamental limitations. Masked diffusion models suffer from irreversible errors due…

Machine Learning · Computer Science 2026-04-21 Marcel Kollovieh , Sirine Ayadi , Stephan Günnemann

Diffusion models have recently gained significant attention in robotics due to their ability to generate multi-modal distributions of system states and behaviors. However, a key challenge remains: ensuring precise control over the generated…

Robotics · Computer Science 2025-10-01 Luobin Wang , Hongzhan Yu , Chenning Yu , Sicun Gao , Henrik Christensen

Material reconstruction from a photograph is a key component of 3D content creation democratization. We propose to formulate this ill-posed problem as a controlled synthesis one, leveraging the recent progress in generative deep networks.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Giuseppe Vecchio , Rosalie Martin , Arthur Roullier , Adrien Kaiser , Romain Rouffet , Valentin Deschaintre , Tamy Boubekeur

The reconstruction of unsteady flow fields from limited measurements is a challenging and crucial task for many engineering applications. Machine learning models are gaining popularity for solving this problem due to their ability to learn…

Fluid Dynamics · Physics 2026-01-09 Marc Amorós-Trepat , Luis Medrano-Navarro , Qiang Liu , Luca Guastoni , Nils Thuerey

Inference-time control of diffusion models aims to steer model outputs to satisfy new constraints without retraining. Previous approaches have mostly relied on heuristic guidance or have been coupled with Sequential Monte Carlo (SMC) for…

Diffusion models have demonstrated their powerful image generation capabilities, effectively fitting highly complex image distributions. These models can serve as strong priors for image restoration. Existing methods often utilize…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Hanbang Liang , Zhen Wang , Weihui Deng

Crowd counting remains challenging in variable-density scenes due to scale variations, occlusions, and the high computational cost of existing models. To address these issues, we propose RepSFNet (Reparameterized Single Fusion Network), a…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Mas Nurul Achmadiah , Chi-Chia Sun , Wen-Kai Kuo , Jun-Wei Hsieh

Recently, diffusion models like StableDiffusion have achieved impressive image generation results. However, the generation process of such diffusion models is uncontrollable, which makes it hard to generate videos with continuous and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Zhihao Hu , Dong Xu

Diffusion models have demonstrated remarkable and robust abilities in both image and video generation. To achieve greater control over generated results, researchers introduce additional architectures, such as ControlNet, Adapters and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Bohao Peng , Jian Wang , Yuechen Zhang , Wenbo Li , Ming-Chang Yang , Jiaya Jia

Recently, diffusion models have gained significant attention as a novel set of deep learning-based generative methods. These models attempt to sample data from a Gaussian distribution that adheres to a target distribution, and have been…

Image and Video Processing · Electrical Eng. & Systems 2024-02-20 Chenyan Zhang , Yifei Chen , Zhenxiong Fan , Yiyu Huang , Wenchao Weng , Ruiquan Ge , Dong Zeng , Changmiao Wang

Controllable diffusion generation often relies on various heuristics that are seemingly disconnected without a unified understanding. We bridge this gap with Diffusion Controller (DiffCon), a unified control-theoretic view that casts…

Machine Learning · Computer Science 2026-03-10 Tong Yang , Moonkyung Ryu , Chih-Wei Hsu , Guy Tennenholtz , Yuejie Chi , Craig Boutilier , Bo Dai

Diffusion models are powerful generative models that achieve state-of-the-art performance in image synthesis. However, training them demands substantial amounts of data and computational resources. Continual learning would allow for…

Machine Learning · Computer Science 2025-03-05 Sergi Masip , Pau Rodriguez , Tinne Tuytelaars , Gido M. van de Ven

Diffusion models have emerged as a formidable tool for training-free conditional generation.However, a key hurdle in inference-time guidance techniques is the need for compute-heavy backpropagation through the diffusion network for…

Computer Vision and Pattern Recognition · Computer Science 2024-06-05 Nithin Gopalakrishnan Nair , Vishal M Patel

Previously, non-autoregressive models were widely perceived as being superior in generation efficiency but inferior in generation quality due to the difficulties of modeling multiple target modalities. To enhance the multi-modality modeling…

Computation and Language · Computer Science 2023-11-30 Lihua Qian , Mingxuan Wang , Yang Liu , Hao Zhou
‹ Prev 1 2 3 10 Next ›