English
Related papers

Related papers: Gradient-Guided Importance Sampling for Learning B…

200 papers

Energy-based models (EBMs) are powerful probabilistic models, but suffer from intractable sampling and density evaluation due to the partition function. As a result, inference in EBMs relies on approximate sampling algorithms, leading to a…

Machine Learning · Computer Science 2020-01-10 Dieterich Lawson , George Tucker , Bo Dai , Rajesh Ranganath

Energy-Based Models (EBMs) offer a versatile framework for modeling complex data distributions. However, training and sampling from EBMs continue to pose significant challenges. The widely-used Denoising Score Matching (DSM) method for…

Machine Learning · Statistics 2024-03-20 Mingtian Zhang , Alex Hawkins-Hooker , Brooks Paige , David Barber

Energy-based models (EBMs) have experienced a resurgence within machine learning in recent years, including as a promising alternative for probabilistic regression. However, energy-based regression requires a proposal distribution to be…

Machine Learning · Computer Science 2023-11-08 Fredrik K. Gustafsson , Martin Danelljan , Thomas B. Schön

We focus on the problem of efficient sampling and learning of probability densities by incorporating symmetries in probabilistic models. We first introduce Equivariant Stein Variational Gradient Descent algorithm -- an equivariant sampling…

Machine Learning · Computer Science 2021-07-30 Priyank Jaini , Lars Holdijk , Max Welling

Energy-based models (EBMs) offer a flexible framework for probabilistic modelling across various data domains. However, training EBMs on data in discrete or mixed state spaces poses significant challenges due to the lack of robust and fast…

Machine Learning · Statistics 2024-12-03 Tobias Schröder , Zijing Ou , Yingzhen Li , Andrew B. Duncan

Importance sampling (IS) is a powerful Monte Carlo methodology for the approximation of intractable integrals, very often involving a target probability density function. The performance of IS heavily depends on the appropriate selection of…

Computation · Statistics 2023-06-22 Víctor Elvira , Emilie Chouzenoux , Ömer Deniz Akyildiz , Luca Martino

Computing the loss gradient via backpropagation consumes considerable energy during deep learning (DL) model training. In this paper, we propose a novel approach to efficiently compute DL models' gradients to mitigate the substantial energy…

Computer Vision and Pattern Recognition · Computer Science 2024-06-12 Challapalli Phanindra Revanth , Sumohana S. Channappayya , C Krishna Mohan

Energy-based models (EBMs) are versatile density estimation models that directly parameterize an unnormalized log density. Although very flexible, EBMs lack a specified normalization constant of the model, making the likelihood of the model…

Machine Learning · Computer Science 2024-02-20 Louis Grenioux , Éric Moulines , Marylou Gabrié

Full-parameter fine-tuning of large language models is constrained by substantial GPU memory requirements. Low-rank adaptation methods mitigate this challenge by updating only a subset of parameters. However, these approaches often limit…

Computation and Language · Computer Science 2026-04-10 Kaiyuan Tian , Yu Tang , Gongqingjian Jiang , Baihui Liu , Yifu Gao , Xialin Su , Linbo Qiao , Dongsheng Li

Importance sampling has been successfully used to accelerate stochastic optimization in many convex problems. However, the lack of an efficient way to calculate the importance still hinders its application to Deep Learning. In this paper,…

Machine Learning · Computer Science 2017-09-14 Angelos Katharopoulos , François Fleuret

Energy-Based Models (EBMs) assign unnormalized log-probability to data samples. This functionality has a variety of applications, such as sample synthesis, data denoising, sample restoration, outlier detection, Bayesian reasoning, and many…

Machine Learning · Statistics 2019-12-23 Zengyi Li , Yubei Chen , Friedrich T. Sommer

Machine learning optimization often depends on stochastic gradient descent, where the precision of gradient estimation is vital for model performance. Gradients are calculated from mini-batches formed by uniformly selecting data samples…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

Energy-based models (EBMs) are generative models that are usually trained via maximum likelihood estimation. This approach becomes challenging in generic situations where the trained energy is non-convex, due to the need to sample the Gibbs…

Machine Learning · Computer Science 2022-02-16 Carles Domingo-Enrich , Alberto Bietti , Marylou Gabrié , Joan Bruna , Eric Vanden-Eijnden

This paper studies the fundamental learning problem of the energy-based model (EBM). Learning the EBM can be achieved using the maximum likelihood estimation (MLE), which typically involves the Markov Chain Monte Carlo (MCMC) sampling, such…

Machine Learning · Computer Science 2023-12-06 Jiali Cui , Tian Han

Energy-based models (EBMs) offer a flexible framework for parameterizing probability distributions using neural networks. However, learning EBMs by exact maximum likelihood estimation (MLE) is generally intractable, due to the need to…

Machine Learning · Computer Science 2025-08-20 Michael E. Sander , Vincent Roulet , Tianlin Liu , Mathieu Blondel

We propose Energy-based generator matching (EGM), a modality-agnostic approach to train generative models from energy functions in the absence of data. Extending the recently proposed generator matching, EGM enables training of arbitrary…

Machine Learning · Computer Science 2025-11-20 Dongyeop Woo , Minsu Kim , Minkyu Kim , Kiyoung Seong , Sungsoo Ahn

Energy-based learning is a powerful framework for generative modelling, but its training is inherently non-convex, leading potentially to sensitivity to initialisation, poor local optima, and unstable gradient dynamics. We present a…

Machine Learning · Computer Science 2026-05-11 Aurélien Decelle , Alfonso de Jesús Navas Gómez , Beatriz Seoane

We introduce a theoretical and practical framework for efficient importance sampling of mini-batch samples for gradient estimation from single and multiple probability distributions. To handle noisy gradients, our framework dynamically…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

State-of-the-art deep learning models have achieved significant performance levels on various benchmarks. However, the excellent performance comes at a cost of inefficient computational cost. Light-weight architectures, on the other hand,…

Computer Vision and Pattern Recognition · Computer Science 2021-10-22 Mohammad Akbari , Amin Banitalebi-Dehkordi , Yong Zhang

Global Autoregressive Models (GAMs) are a recent proposal [Parshakova et al., CoNLL 2019] for exploiting global properties of sequences for data-efficient learning of seq2seq models. In the first phase of training, an Energy-Based model…

Machine Learning · Computer Science 2019-12-19 Tetiana Parshakova , Jean-Marc Andreoli , Marc Dymetman
‹ Prev 1 2 3 10 Next ›