Related papers: A Note on Uncertainty Quantification for Maximum L…

On the Pitfalls of Heteroscedastic Uncertainty Estimation with Probabilistic Neural Networks

Capturing aleatoric uncertainty is a critical part of many machine learning systems. In deep learning, a common approach to this end is to train a neural network to estimate the parameters of a heteroscedastic Gaussian distribution by…

Machine Learning · Computer Science 2022-04-04 Maximilian Seitzer , Arash Tavakoli , Dimitrije Antic , Georg Martius

Gradient-based Hyperparameter Optimization through Reversible Learning

Tuning hyperparameters of learning algorithms is hard because gradients are usually unavailable. We compute exact gradients of cross-validation performance with respect to all hyperparameters by chaining derivatives backwards through the…

Machine Learning · Statistics 2015-04-03 Dougal Maclaurin , David Duvenaud , Ryan P. Adams

Approximate Maximum Likelihood Estimation

In recent years, methods of approximate parameter estimation have attracted considerable interest in complex problems where exact likelihoods are hard to obtain. In their most basic form, Bayesian methods such as Approximate Bayesian…

Computation · Statistics 2015-07-17 Johanna Bertl , Gregory Ewing , Carolin Kosiol , Andreas Futschik

Automatic Differentiation of Optimization Algorithms with Time-Varying Updates

Numerous Optimization Algorithms have a time-varying update rule thanks to, for instance, a changing step size, momentum parameter or, Hessian approximation. In this paper, we apply unrolled or automatic differentiation to a time-varying…

Optimization and Control · Mathematics 2024-10-28 Sheheryar Mehmood , Peter Ochs

A Stochastic Gradient Method with Biased Estimation for Faster Nonconvex Optimization

A number of optimization approaches have been proposed for optimizing nonconvex objectives (e.g. deep learning models), such as batch gradient descent, stochastic gradient descent and stochastic variance reduced gradient descent. Theory…

Machine Learning · Computer Science 2019-05-15 Jia Bi , Steve R. Gunn

A New Stochastic Approximation Method for Gradient-based Simulated Parameter Estimation

This paper tackles the challenge of parameter calibration in stochastic models, particularly in scenarios where the likelihood function is unavailable in an analytical form. We introduce a gradient-based simulated parameter estimation…

Machine Learning · Statistics 2025-03-25 Zehao Li , Yijie Peng

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Stochastic Adaptive Optimization with Unreliable Inputs: A Unified Framework for High-Probability Complexity Analysis

We consider an unconstrained continuous optimization problem where, in each iteration, gradient estimates may be arbitrarily corrupted with a probability greater than 1/2. Additionally, function value estimates may exhibit heavy-tailed…

Optimization and Control · Mathematics 2025-11-25 Katya Scheinberg , Miaolan Xie

Learning Covariances for Estimation with Constrained Bilevel Optimization

We consider the problem of learning error covariance matrices for robotic state estimation. The convergence of a state estimator to the correct belief over the robot state is dependent on the proper tuning of noise models. During inference,…

Robotics · Computer Science 2023-09-19 Mohamad Qadri , Zachary Manchester , Michael Kaess

Inference by Stochastic Optimization: A Free-Lunch Bootstrap

Assessing sampling uncertainty in extremum estimation can be challenging when the asymptotic variance is not analytically tractable. Bootstrap inference offers a feasible solution but can be computationally costly especially when the model…

Econometrics · Economics 2020-09-15 Jean-Jacques Forneron , Serena Ng

Probabilistic Gradients for Fast Calibration of Differential Equation Models

Calibration of large-scale differential equation models to observational or experimental data is a widespread challenge throughout applied sciences and engineering. A crucial bottleneck in state-of-the art calibration methods is the…

Optimization and Control · Mathematics 2021-02-23 Jon Cockayne , Andrew B. Duncan

Multi-Level Restricted Maximum Likelihood Covariance Estimation and Kriging for Large Non-Gridded Spatial Datasets

We develop a multi-level restricted Gaussian maximum likelihood method for estimating the covariance function parameters and computing the best unbiased predictor. Our approach produces a new set of multi-level contrasts where the…

Computation · Statistics 2016-03-29 Julio E. Castrillon-Candas , Marc G. Genton , Rio Yokota

Randomized subspace gradient method for constrained optimization

We propose randomized subspace gradient methods for high-dimensional constrained optimization. While there have been similarly purposed studies on unconstrained optimization problems, there have been few on constrained optimization problems…

Optimization and Control · Mathematics 2023-07-10 Ryota Nozawa , Pierre-Louis Poirion , Akiko Takeda

Improving Portfolios Global Performance with Robust Covariance Matrix Estimation: Application to the Maximum Variety Portfolio

This paper presents how the most recent improvements made on covariance matrix estimation and model order selection can be applied to the portfolio optimisation problem. The particular case of the Maximum Variety Portfolio is treated but…

Applications · Statistics 2018-04-03 Emmanuelle Jay , Eugénie Terreaux , Jean-Philippe Ovarlez , Frédéric Pascal

Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models

Latent Gaussian models have a rich history in statistics and machine learning, with applications ranging from factor analysis to compressed sensing to time series analysis. The classical method for maximizing the likelihood of these models…

Machine Learning · Computer Science 2023-06-07 Alexander Lin , Bahareh Tolooshams , Yves Atchadé , Demba Ba

Two new algorithms for maximum likelihood estimation of sparse covariance matrices with applications to graphical modeling

In this paper, we propose two new algorithms for maximum-likelihood estimation (MLE) of high dimensional sparse covariance matrices. Unlike most of the state of-the-art methods, which either use regularization techniques or penalize the…

Methodology · Statistics 2023-05-12 Ghania Fatima , Prabhu Babu , Petre Stoica

A Stochastic Trust Region Algorithm Based on Careful Step Normalization

An algorithm is proposed for solving stochastic and finite sum minimization problems. Based on a trust region methodology, the algorithm employs normalized steps, at least as long as the norms of the stochastic gradient estimates are within…

Optimization and Control · Mathematics 2018-06-27 Frank E. Curtis , Katya Scheinberg , Rui Shi

Uncertainty estimation in equality-constrained MAP and maximum likelihood estimation with applications to system identification and state estimation

In unconstrained maximum a posteriori (MAP) and maximum likelihood estimation, the inverse of minus the merit-function Hessian matrix is an approximation of the estimate covariance matrix. In the Bayesian context of MAP estimation, it is…

Methodology · Statistics 2020-03-17 Dimas Abreu Archanjo Dutra

A Graduated Filter Method for Large Scale Robust Estimation

Due to the highly non-convex nature of large-scale robust parameter estimation, avoiding poor local minima is challenging in real-world applications where input data is contaminated by a large or unknown fraction of outliers. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-23 Huu Le , Christopher Zach

A Deterministic Global Optimization Method for Variational Inference

Variational inference methods for latent variable statistical models have gained popularity because they are relatively fast, can handle large data sets, and have deterministic convergence guarantees. However, in practice it is unclear…

Methodology · Statistics 2017-03-22 Hachem Saddiki , Andrew C. Trapp , Patrick Flaherty