Related papers: Gradient-Guided Importance Sampling for Learning B…

Energy-Inspired Models: Learning with Sampler-Induced Distributions

Energy-based models (EBMs) are powerful probabilistic models, but suffer from intractable sampling and density evaluation due to the partition function. As a result, inference in EBMs relies on approximate sampling algorithms, leading to a…

Machine Learning · Computer Science 2020-01-10 Dieterich Lawson , George Tucker , Bo Dai , Rajesh Ranganath

Moment Matching Denoising Gibbs Sampling

Energy-Based Models (EBMs) offer a versatile framework for modeling complex data distributions. However, training and sampling from EBMs continue to pose significant challenges. The widely-used Denoising Score Matching (DSM) method for…

Machine Learning · Statistics 2024-03-20 Mingtian Zhang , Alex Hawkins-Hooker , Brooks Paige , David Barber

Learning Proposals for Practical Energy-Based Regression

Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression. However, energy-based regression requires a proposal distribution to be…

Machine Learning · Computer Science 2023-11-08 Fredrik K. Gustafsson , Martin Danelljan , Thomas B. Schön

Learning Equivariant Energy Based Models with Equivariant Stein Variational Gradient Descent

We focus on the problem of efficient sampling and learning of probability densities by incorporating symmetries in probabilistic models. We first introduce Equivariant Stein Variational Gradient Descent algorithm -- an equivariant sampling…

Machine Learning · Computer Science 2021-07-30 Priyank Jaini , Lars Holdijk , Max Welling

Energy-Based Modelling for Discrete and Mixed Data via Heat Equations on Structured Spaces

Energy-based models (EBMs) offer a flexible framework for probabilistic modelling across various data domains. However, training EBMs on data in discrete or mixed state spaces poses significant challenges due to the lack of robust and fast…

Machine Learning · Statistics 2024-12-03 Tobias Schröder , Zijing Ou , Yingzhen Li , Andrew B. Duncan

Gradient-based Adaptive Importance Samplers

Importance sampling (IS) is a powerful Monte Carlo methodology for the approximation of intractable integrals, very often involving a target probability density function. The performance of IS heavily depends on the appropriate selection of…

Computation · Statistics 2023-06-22 Víctor Elvira , Emilie Chouzenoux , Ömer Deniz Akyildiz , Luca Martino

Minimizing Energy Costs in Deep Learning Model Training: The Gaussian Sampling Approach

Computing the loss gradient via backpropagation consumes considerable energy during deep learning (DL) model training. In this paper, we propose a novel approach to efficiently compute DL models' gradients to mitigate the substantial energy…

Computer Vision and Pattern Recognition · Computer Science 2024-06-12 Challapalli Phanindra Revanth , Sumohana S. Channappayya , C Krishna Mohan

Balanced Training of Energy-Based Models with Adaptive Flow Sampling

Energy-based models (EBMs) are versatile density estimation models that directly parameterize an unnormalized log density. Although very flexible, EBMs lack a specified normalization constant of the model, making the likelihood of the model…

Machine Learning · Computer Science 2024-02-20 Louis Grenioux , Éric Moulines , Marylou Gabrié

GRASS: Gradient-based Adaptive Layer-wise Importance Sampling for Memory-efficient Large Language Model Fine-tuning

Full-parameter fine-tuning of large language models is constrained by substantial GPU memory requirements. Low-rank adaptation methods mitigate this challenge by updating only a subset of parameters. However, these approaches often limit…

Computation and Language · Computer Science 2026-04-10 Kaiyuan Tian , Yu Tang , Gongqingjian Jiang , Baihui Liu , Yifu Gao , Xialin Su , Linbo Qiao , Dongsheng Li

Biased Importance Sampling for Deep Neural Network Training

Importance sampling has been successfully used to accelerate stochastic optimization in many convex problems. However, the lack of an efficient way to calculate the importance still hinders its application to Deep Learning. In this paper,…

Machine Learning · Computer Science 2017-09-14 Angelos Katharopoulos , François Fleuret

Learning Energy-Based Models in High-Dimensional Spaces with Multi-scale Denoising Score Matching

Energy-Based Models (EBMs) assign unnormalized log-probability to data samples. This functionality has a variety of applications, such as sample synthesis, data denoising, sample restoration, outlier detection, Bayesian reasoning, and many…

Machine Learning · Statistics 2019-12-23 Zengyi Li , Yubei Chen , Friedrich T. Sommer

Online Importance Sampling for Stochastic Gradient Optimization

Machine learning optimization often depends on stochastic gradient descent, where the precision of gradient estimation is vital for model performance. Gradients are calculated from mini-batches formed by uniformly selecting data samples…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

Dual Training of Energy-Based Models with Overparametrized Shallow Neural Networks

Energy-based models (EBMs) are generative models that are usually trained via maximum likelihood estimation. This approach becomes challenging in generic situations where the trained energy is non-convex, due to the need to sample the Gibbs…

Machine Learning · Computer Science 2022-02-16 Carles Domingo-Enrich , Alberto Bietti , Marylou Gabrié , Joan Bruna , Eric Vanden-Eijnden

Learning Energy-based Model via Dual-MCMC Teaching

This paper studies the fundamental learning problem of the energy-based model (EBM). Learning the EBM can be achieved using the maximum likelihood estimation (MLE), which typically involves the Markov Chain Monte Carlo (MCMC) sampling, such…

Machine Learning · Computer Science 2023-12-06 Jiali Cui , Tian Han

Joint Learning of Energy-based Models and their Partition Function

Energy-based models (EBMs) offer a flexible framework for parameterizing probability distributions using neural networks. However, learning EBMs by exact maximum likelihood estimation (MLE) is generally intractable, due to the need to…

Machine Learning · Computer Science 2025-08-20 Michael E. Sander , Vincent Roulet , Tianlin Liu , Mathieu Blondel

Energy-based generator matching: A neural sampler for general state space

We propose Energy-based generator matching (EGM), a modality-agnostic approach to train generative models from energy functions in the absence of data. Extending the recently proposed generator matching, EGM enables training of arbitrary…

Machine Learning · Computer Science 2025-11-20 Dongyeop Woo , Minsu Kim , Minkyu Kim , Kiyoung Seong , Sungsoo Ahn

Distributional simplicity bias and effective convexity in Energy Based Models

Energy-based learning is a powerful framework for generative modelling, but its training is inherently non-convex, leading potentially to sensitivity to initialisation, poor local optima, and unstable gradient dynamics. We present a…

Machine Learning · Computer Science 2026-05-11 Aurélien Decelle , Alfonso de Jesús Navas Gómez , Beatriz Seoane

Multiple Importance Sampling for Stochastic Gradient Estimation

We introduce a theoretical and practical framework for efficient importance sampling of mini-batch samples for gradient estimation from single and multiple probability distributions. To handle noisy gradients, our framework dynamically…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

EBJR: Energy-Based Joint Reasoning for Adaptive Inference

State-of-the-art deep learning models have achieved significant performance levels on various benchmarks. However, the excellent performance comes at a cost of inefficient computational cost. Light-weight architectures, on the other hand,…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Mohammad Akbari , Amin Banitalebi-Dehkordi , Yong Zhang

Distributional Reinforcement Learning for Energy-Based Sequential Models

Global Autoregressive Models (GAMs) are a recent proposal [Parshakova et al., CoNLL 2019] for exploiting global properties of sequences for data-efficient learning of seq2seq models. In the first phase of training, an Energy-Based model…

Machine Learning · Computer Science 2019-12-19 Tetiana Parshakova , Jean-Marc Andreoli , Marc Dymetman