Related papers: BEND: Bagging Deep Learning Training Based on Effi…

TREAD: Token Routing for Efficient Architecture-agnostic Diffusion Training

Diffusion models have emerged as the mainstream approach for visual generation. However, these models typically suffer from sample inefficiency and high training costs. Consequently, methods for efficient finetuning, inference and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Felix Krause , Timy Phan , Ming Gui , Stefan Andreas Baumann , Vincent Tao Hu , Björn Ommer

Diffusion Models Beat GANs on Image Classification

While many unsupervised learning models focus on one family of tasks, either generative or discriminative, we explore the possibility of a unified representation learner: a model which uses a single pre-training stage to address both…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Soumik Mukhopadhyay , Matthew Gwilliam , Vatsal Agarwal , Namitha Padmanabhan , Archana Swaminathan , Srinidhi Hegde , Tianyi Zhou , Abhinav Shrivastava

BADiff: Bandwidth Adaptive Diffusion Model

In this work, we propose a novel framework to enable diffusion models to adapt their generation quality based on real-time network bandwidth constraints. Traditional diffusion models produce high-fidelity images by performing a fixed number…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Xi Zhang , Hanwei Zhu , Yan Zhong , Jiamang Wang , Weisi Lin

DiffSG: A Generative Solver for Network Optimization with Diffusion Model

Generative diffusion models, famous for their performance in image generation, are popular in various cross-domain applications. However, their use in the communication community has been mostly limited to auxiliary tasks like data modeling…

Networking and Internet Architecture · Computer Science 2025-03-11 Ruihuai Liang , Bo Yang , Zhiwen Yu , Bin Guo , Xuelin Cao , Mérouane Debbah , H. Vincent Poor , Chau Yuen

Bagging-Based Model Merging for Robust General Text Embeddings

General-purpose text embedding models underpin a wide range of NLP and information retrieval applications, and are typically trained on large-scale multi-task corpora to encourage broad generalization. However, it remains unclear how…

Information Retrieval · Computer Science 2026-02-10 Hengran Zhang , Keping Bi , Jiafeng Guo , Jiaming Zhang , Wenbo Yang , Daiting Shi , Xueqi Cheng

Training Recurrent Neural Networks by Diffusion

This work presents a new algorithm for training recurrent neural networks (although ideas are applicable to feedforward networks as well). The algorithm is derived from a theory in nonconvex optimization related to the diffusion equation.…

Machine Learning · Computer Science 2016-02-08 Hossein Mobahi

Evolutionary bagging for ensemble learning

Ensemble learning has gained success in machine learning with major advantages over other learning methods. Bagging is a prominent ensemble learning method that creates subgroups of data, known as bags, that are trained by individual…

Neural and Evolutionary Computing · Computer Science 2022-09-07 Giang Ngo , Rodney Beard , Rohitash Chandra

MetaBags: Bagged Meta-Decision Trees for Regression

Ensembles are popular methods for solving practical supervised learning problems. They reduce the risk of having underperforming models in production-grade software. Although critical, methods for learning heterogeneous regression ensembles…

Machine Learning · Computer Science 2018-04-18 Jihed Khiari , Luis Moreira-Matias , Ammar Shaker , Bernard Zenko , Saso Dzeroski

BEDS: Bagging ensemble deep segmentation for nucleus segmentation with testing stage stain augmentation

Reducing outcome variance is an essential task in deep learning based medical image analysis. Bootstrap aggregating, also known as bagging, is a canonical ensemble algorithm for aggregating weak learners to become a strong learner. Random…

Computer Vision and Pattern Recognition · Computer Science 2021-02-19 Xing Li , Haichun Yang , Jiaxin He , Aadarsh Jha , Agnes B. Fogo , Lee E. Wheless , Shilin Zhao , Yuankai Huo

FINED: Fast Inference Network for Edge Detection

In this paper, we address the design of lightweight deep learning-based edge detection. The deep learning technology offers a significant improvement on the edge detection accuracy. However, typical neural network designs have very high…

Computer Vision and Pattern Recognition · Computer Science 2020-12-16 Jan Kristanto Wibisono , Hsueh-Ming Hang

Diffusion-Based Neural Network Weights Generation

Transfer learning has gained significant attention in recent deep learning research due to its ability to accelerate convergence and enhance performance on new tasks. However, its success is often contingent on the similarity between source…

Machine Learning · Computer Science 2024-10-28 Bedionita Soro , Bruno Andreis , Hayeon Lee , Wonyong Jeong , Song Chong , Frank Hutter , Sung Ju Hwang

Padding Module: Learning the Padding in Deep Neural Networks

During the last decades, many studies have been dedicated to improving the performance of neural networks, for example, the network architectures, initialization, and activation. However, investigating the importance and effects of…

Computer Vision and Pattern Recognition · Computer Science 2023-01-12 Fahad Alrasheedi , Xin Zhong , Pei-Chi Huang

Generative Edge Detection with Stable Diffusion

Edge detection is typically viewed as a pixel-level classification problem mainly addressed by discriminative methods. Recently, generative edge detection methods, especially diffusion model based solutions, are initialized in the edge…

Computer Vision and Pattern Recognition · Computer Science 2024-10-07 Caixia Zhou , Yaping Huang , Mochu Xiang , Jiahui Ren , Haibin Ling , Jing Zhang

SEGEN: Sample-Ensemble Genetic Evolutional Network Model

Deep learning, a rebranding of deep neural network research works, has achieved a remarkable success in recent years. With multiple hidden layers, deep learning models aim at computing the hierarchical feature representations of the…

Neural and Evolutionary Computing · Computer Science 2018-06-06 Jiawei Zhang , Limeng Cui , Fisher B. Gouza

Diffusion As Self-Distillation: End-to-End Latent Diffusion In One Model

Standard Latent Diffusion Models rely on a complex, three-part architecture consisting of a separate encoder, decoder, and diffusion network, which are trained in multiple stages. This modular design is computationally inefficient, leads to…

Computer Vision and Pattern Recognition · Computer Science 2025-11-19 Xiyuan Wang , Muhan Zhang

Neural Network Diffusion

Diffusion models have achieved remarkable success in image and video generation. In this work, we demonstrate that diffusion models can also \textit{generate high-performing neural network parameters}. Our approach is simple, utilizing an…

Machine Learning · Computer Science 2025-01-03 Kai Wang , Dongwen Tang , Boya Zeng , Yida Yin , Zhaopan Xu , Yukun Zhou , Zelin Zang , Trevor Darrell , Zhuang Liu , Yang You

Deeply-Fused Nets

In this paper, we present a novel deep learning approach, deeply-fused nets. The central idea of our approach is deep fusion, i.e., combine the intermediate representations of base networks, where the fused output serves as the input of the…

Computer Vision and Pattern Recognition · Computer Science 2016-05-26 Jingdong Wang , Zhen Wei , Ting Zhang , Wenjun Zeng

Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models

Despite the remarkable generation capabilities of Diffusion Models (DMs), conducting training and inference remains computationally expensive. Previous works have been devoted to accelerating diffusion sampling, but achieving data-efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Yize Li , Yihua Zhang , Sijia Liu , Xue Lin

Deep Batch Active Learning by Diverse, Uncertain Gradient Lower Bounds

We design a new algorithm for batch active learning with deep neural network models. Our algorithm, Batch Active learning by Diverse Gradient Embeddings (BADGE), samples groups of points that are disparate and high-magnitude when…

Machine Learning · Computer Science 2020-02-25 Jordan T. Ash , Chicheng Zhang , Akshay Krishnamurthy , John Langford , Alekh Agarwal

On the Feature Learning in Diffusion Models

The predominant success of diffusion models in generative modeling has spurred significant interest in understanding their theoretical foundations. In this work, we propose a feature learning framework aimed at analyzing and comparing the…

Machine Learning · Statistics 2025-03-04 Andi Han , Wei Huang , Yuan Cao , Difan Zou