English
Related papers

Related papers: Safe Adaptive Importance Sampling

200 papers

We introduce data structures for solving robust regression through stochastic gradient descent (SGD) by sampling gradients with probability proportional to their norm, i.e., importance sampling. Although SGD is widely used for large scale…

Machine Learning · Computer Science 2022-07-19 Sepideh Mahabadi , David P. Woodruff , Samson Zhou

Variational inference approximates the posterior distribution of a probabilistic model with a parameterized density by maximizing a lower bound for the model evidence. Modern solutions fit a flexible approximation with stochastic gradient…

Machine Learning · Statistics 2017-07-13 Joseph Sakaya , Arto Klami

Stochastic Gradient Descent (SGD) is one of the most widely used techniques for online optimization in machine learning. In this work, we accelerate SGD by adaptively learning how to sample the most useful training examples at each time…

Machine Learning · Computer Science 2016-03-16 Guillaume Bouchard , Théo Trouillon , Julien Perez , Adrien Gaidon

Sampling is an important tool for estimating large, complex sums and integrals over high dimensional spaces. For instance, important sampling has been used as an alternative to exact methods for inference in belief networks. Ideally, we…

Artificial Intelligence · Computer Science 2013-01-18 Luis E. Ortiz , Leslie Pack Kaelbling

In modern data analysis, random sampling is an efficient and widely-used strategy to overcome the computational difficulties brought by large sample size. In previous studies, researchers conducted random sampling which is according to the…

Machine Learning · Statistics 2018-03-05 Rong Zhu

We propose a novel adaptive importance sampling algorithm which incorporates Stein variational gradient decent algorithm (SVGD) with importance sampling (IS). Our algorithm leverages the nonparametric transforms in SVGD to iteratively…

Machine Learning · Statistics 2017-07-26 Jun Han , Qiang Liu

In this paper, we propose a stochastic optimization method that adaptively controls the sample size used in the computation of gradient approximations. Unlike other variance reduction techniques that either require additional storage or the…

Optimization and Control · Mathematics 2017-11-01 Raghu Bollapragada , Richard Byrd , Jorge Nocedal

Uniform sampling of training data has been commonly used in traditional stochastic optimization algorithms such as Proximal Stochastic Gradient Descent (prox-SGD) and Proximal Stochastic Dual Coordinate Ascent (prox-SDCA). Although uniform…

Machine Learning · Statistics 2015-01-05 Peilin Zhao , Tong Zhang

We introduce a theoretical and practical framework for efficient importance sampling of mini-batch samples for gradient estimation from single and multiple probability distributions. To handle noisy gradients, our framework dynamically…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

Deep neural network training spends most of the computation on examples that are properly handled, and could be ignored. We propose to mitigate this phenomenon with a principled importance sampling scheme that focuses computation on…

Machine Learning · Computer Science 2019-10-29 Angelos Katharopoulos , François Fleuret

Machine learning optimization often depends on stochastic gradient descent, where the precision of gradient estimation is vital for model performance. Gradients are calculated from mini-batches formed by uniformly selecting data samples…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

We study distributed optimization algorithms for minimizing the average of \emph{heterogeneous} functions distributed across several machines with a focus on communication efficiency. In such settings, naively using the classical stochastic…

Machine Learning · Computer Science 2020-11-18 Ilqar Ramazanli , Han Nguyen , Hai Pham , Sashank J. Reddi , Barnabas Poczos

Adaptive sampling algorithms are modern and efficient methods that dynamically adjust the sample size throughout the optimization process. However, they may encounter difficulties in risk-averse settings, particularly due to the challenge…

Optimization and Control · Mathematics 2025-02-17 Sandra Pieraccini , Tommaso Vanzan

We propose an adaptive importance sampling scheme for Gaussian approximations of intractable posteriors. Optimization-based approximations like variational inference can be too inaccurate while existing Monte Carlo methods can be too slow.…

Computation · Statistics 2025-02-04 Willem van den Boom , Andrea Cremaschi , Alexandre H. Thiery

Sharpness-aware Minimization (SAM) has been proposed recently to improve model generalization ability. However, SAM calculates the gradient twice in each optimization step, thereby doubling the computation costs compared to stochastic…

Computer Vision and Pattern Recognition · Computer Science 2024-03-15 Jiaxin Deng , Junbiao Pang , Baochang Zhang , Tian Wang

Importance sampling (IS) is a powerful Monte Carlo methodology for the approximation of intractable integrals, very often involving a target probability density function. The performance of IS heavily depends on the appropriate selection of…

Computation · Statistics 2023-06-22 Víctor Elvira , Emilie Chouzenoux , Ömer Deniz Akyildiz , Luca Martino

Stochastic gradient descent (SGD) is a powerful optimization technique that is particularly useful in online learning scenarios. Its convergence analysis is relatively well understood under the assumption that the data samples are…

Machine Learning · Computer Science 2024-10-03 Ethan Che , Jing Dong , Xin T. Tong

We introduce a clipping strategy for Stochastic Gradient Descent (SGD) which uses quantiles of the gradient norm as clipping thresholds. We prove that this new strategy provides a robust and efficient optimization algorithm for smooth…

Machine Learning · Statistics 2024-10-15 Ibrahim Merad , Stéphane Gaïffas

Stochastic gradient descent samples uniformly the training set to build an unbiased gradient estimate with a limited number of samples. However, at a given step of the training process, some data are more helpful than others to continue…

Machine Learning · Computer Science 2023-03-30 Thibault Lahire

Adaptive Monte Carlo schemes developed over the last years usually seek to ensure ergodicity of the sampling process in line with MCMC tradition. This poses constraints on what is possible in terms of adaptation. In the general case…

Machine Learning · Statistics 2015-07-22 Ingmar Schuster
‹ Prev 1 2 3 10 Next ›