Related papers: Statistics of extremes by oracle estimation

Optimal Kullback-Leibler Aggregation in Mixture Density Estimation by Maximum Likelihood

We study the maximum likelihood estimator of density of $n$ independent observations, under the assumption that it is well approximated by a mixture with a large number of components. The main focus is on statistical properties with respect…

Statistics Theory · Mathematics 2017-01-19 Arnak S. Dalalyan , Mehdi Sebbar

Optimal exponential bounds for aggregation of estimators for the Kullback-Leibler loss

We study the problem of model selection type aggregation with respect to the Kullback-Leibler divergence for various probabilistic models. Rather than considering a convex combination of the initial estimators $f_1, \ldots, f_N$, our…

Statistics Theory · Mathematics 2016-01-22 Cristina Butucea , Jean-François Delmas , Anne Dutfoy , Richard Fischer

Linear Regression for Power Law Distribution Fitting

We fit the exponent of the Pareto distribution, that is equivalent or can approximate the continuous power law distribution given a cutoff point, using linear regression (LR). We use LR on the logged variables of the empirical tail (one…

Applications · Statistics 2023-12-21 Samuel Forbes

Kullback-Leibler aggregation and misspecified generalized linear models

In a regression setup with deterministic design, we study the pure aggregation problem and introduce a natural extension from the Gaussian distribution to distributions in the exponential family. While this extension bears strong…

Machine Learning · Statistics 2012-06-06 Philippe Rigollet

A Kullback-Leibler divergence test for multivariate extremes: theory and practice

Testing whether two multivariate samples exhibit the same extremal behavior is an important problem in various fields including environmental and climate sciences. While several ad-hoc approaches exist in the literature, they often lack…

Statistics Theory · Mathematics 2026-02-03 Sebastian Engelke , Philippe Naveau , Chen Zhou

On the folded normal distribution

The characteristic function of the folded normal distribution and its moment function are derived. The entropy of the folded normal distribution and the Kullback--Leibler from the normal and half normal distributions are approximated using…

Methodology · Statistics 2014-02-17 Michail Tsagris , Christina Beneki , Hossein Hassani

Conditional Density Estimation by Penalized Likelihood Model Selection and Applications

In this technical report, we consider conditional density estimation with a maximum likelihood approach. Under weak assumptions, we obtain a theoretical bound for a Kullback-Leibler type loss for a single model maximum likelihood estimate.…

Statistics Theory · Mathematics 2012-07-11 Serge Cohen , Erwan Le Pennec

Estimation of Expected Shortfall under Various Experimental Conditions

Our primary aim is to find an estimate of the expected shortfall in various situations: (1) Nonparametric situation, when the probability distribution of the incurred loss is unknown, only satisfying some general conditions. Then, following…

Methodology · Statistics 2022-12-26 Jana Jurečková , Jan Kalina , Jan Večeř

Finite mixture regression: A sparse variable selection by model selection for clustering

We consider a finite mixture of Gaussian regression model for high- dimensional data, where the number of covariates may be much larger than the sample size. We propose to estimate the unknown conditional mixture density by a maximum…

Statistics Theory · Mathematics 2014-09-05 Emilie Devijver

Scoring Alternative Forecast Distributions: Completing the Kullback Distance Complex

We develop two surprising new results regarding the use of proper scoring rules for evaluating the predictive quality of two alternative sequential forecast distributions. Both of the proponents prefer to be awarded a score derived from the…

Probability · Mathematics 2019-09-17 Frank Lad , Giuseppe Sanfilippo

Strong Convergence of Peaks Over a Threshold

Extreme Value Theory plays an important role to provide approximation results for the extremes of a sequence of independent random variables when their distribution is unknown. An important one is given by the {generalised Pareto…

Probability · Mathematics 2024-05-08 Simone A. Padoan , Stefano Rizzelli

Alternative modelling and inference methods for claim size distributions

The upper tail of a claim size distribution of a property line of business is frequently modelled by Pareto distribution. However, the upper tail does not need to be Pareto distributed, extraordinary shapes are possible. Here, the…

Methodology · Statistics 2020-02-19 Mathias Raschke

Kullback-Leibler excess risk bounds for exponential weighted aggregation in Generalized linear models

Aggregation methods have emerged as a powerful and flexible framework in statistical learning, providing unified solutions across diverse problems such as regression, classification, and density estimation. In the context of generalized…

Statistics Theory · Mathematics 2025-04-15 The Tien Mai

Fixed-Form Variational Posterior Approximation through Stochastic Linear Regression

We propose a general algorithm for approximating nonstandard Bayesian posterior distributions. The algorithm minimizes the Kullback-Leibler divergence of an approximating distribution to the intractable posterior distribution. Our method…

Computation · Statistics 2014-07-29 Tim Salimans , David A. Knowles

On asymptotic efficiency of goodness-of-fit tests for the Pareto distribution based on its characterization

We introduce a new characterization of Pareto distribution and construct integral and supremum type goodness-of-fit tests based on it. Limiting distribution and large deviations of new statistics are described and their local Bahadur…

Statistics Theory · Mathematics 2014-08-21 K. Yu. Volkova

Estimation of discrete distributions in relative entropy, and the deviations of the missing mass

We study the problem of estimating a distribution over a finite alphabet from an i.i.d. sample, with accuracy measured in relative entropy (Kullback-Leibler divergence). While optimal bounds on the expected risk are known, high-probability…

Statistics Theory · Mathematics 2026-02-27 Jaouad Mourtada

Oracle Inequalities for Convex Loss Functions with Non-Linear Targets

This paper consider penalized empirical loss minimization of convex loss functions with unknown non-linear target functions. Using the elastic net penalty we establish a finite sample oracle inequality which bounds the loss of our estimator…

Statistics Theory · Mathematics 2013-12-13 Mehmet Caner , Anders Bredahl Kock

On the Kullback-Leibler divergence between discrete normal distributions

Discrete normal distributions are defined as the distributions with prescribed means and covariance matrices which maximize entropy on the integer lattice support. The set of discrete normal distributions form an exponential family with…

Information Theory · Computer Science 2022-01-25 Frank Nielsen

An \ell_1-oracle inequality for the Lasso in finite mixture of multivariate Gaussian regression models

We consider a multivariate finite mixture of Gaussian regression models for high-dimensional data, where the number of covariates and the size of the response may be much larger than the sample size. We provide an $\ell_1$-oracle inequality…

Statistics Theory · Mathematics 2014-10-20 Emilie Devijver

Distribution Estimation under the Infinity Norm

We present novel bounds for estimating discrete probability distributions under the $\ell_\infty$ norm. These are nearly optimal in various precise senses, including a kind of instance-optimality. Our data-dependent convergence guarantees…

Statistics Theory · Mathematics 2024-02-14 Aryeh Kontorovich , Amichai Painsky