Related papers: Estimating Random Variables from Random Sparse Obs…

A Mathematical Framework for Feature Selection from Real-World Data with Non-Linear Observations

In this paper, we study the challenge of feature selection based on a relatively small collection of sample pairs $\{(x_i, y_i)\}_{1 \leq i \leq m}$. The observations $y_i \in \mathbb{R}$ are thereby supposed to follow a noisy single-index…

Machine Learning · Statistics 2016-12-28 Martin Genzel , Gitta Kutyniok

Out-of-distribution generalization under random, dense distributional shifts

Many existing approaches for estimating parameters in settings with distributional shifts operate under an invariance assumption. For example, under covariate shift, it is assumed that $p(y|x)$ remains invariant. We refer to such…

Methodology · Statistics 2025-02-07 Yujin Jeong , Dominik Rothenhäusler

Statistics of selectively neutral genetic variation

Random models of evolution are instrumental in extracting rates of microscopic evolutionary mechanisms from empirical observations on genetic variation in genome sequences. In this context it is necessary to know the statistical properties…

Biological Physics · Physics 2009-11-07 A. Eriksson , B. Haubold , B. Mehlig

Inference on a Distribution from Noisy Draws

We consider a situation where the distribution of a random variable is being estimated by the empirical distribution of noisy measurements of that variable. This is common practice in, for example, teacher value-added models and other…

Econometrics · Economics 2021-12-08 Koen Jochmans , Martin Weidner

Compressed Anomaly Detection with Multiple Mixed Observations

We consider a collection of independent random variables that are identically distributed, except for a small subset which follows a different, anomalous distribution. We study the problem of detecting which random variables in the…

Information Theory · Computer Science 2018-06-21 Natalie Durgin , Rachel Grotheer , Chenxi Huang , Shuang Li , Anna Ma , Deanna Needell , Jing Qin

Sparse Choice Models

Choice models, which capture popular preferences over objects of interest, play a key role in making decisions whose eventual outcome is impacted by human choice behavior. In most scenarios, the choice model, which can effectively be viewed…

Methodology · Statistics 2011-09-22 Vivek F. Farias , Srikanth Jagabathula , Devavrat Shah

Sparse Signal Processing with Linear and Nonlinear Observations: A Unified Shannon-Theoretic Approach

We derive fundamental sample complexity bounds for recovering sparse and structured signals for linear and nonlinear observation models including sparse regression, group testing, multivariate regression and problems with missing features.…

Information Theory · Computer Science 2017-02-17 Cem Aksoylar , George Atia , Venkatesh Saligrama

Convergence rates of posterior distributions for noniid observations

We consider the asymptotic behavior of posterior distributions and Bayes estimators based on observations which are required to be neither independent nor identically distributed. We give general results on the rate of convergence of the…

Statistics Theory · Mathematics 2009-09-29 Subhashis Ghosal , Aad van der Vaart

Stable marked point processes

In many contexts such as queuing theory, spatial statistics, geostatistics and meteorology, data are observed at irregular spatial positions. One model of this situation involves considering the observation points as generated by a Poisson…

Statistics Theory · Mathematics 2007-08-07 Tucker McElroy , Dimitris N. Politis

Active Distribution Learning from Indirect Samples

This paper studies the problem of {\em learning} the probability distribution $P_X$ of a discrete random variable $X$ using indirect and sequential samples. At each time step, we choose one of the possible $K$ functions, $g_1, \ldots, g_K$…

Machine Learning · Computer Science 2018-08-17 Samarth Gupta , Gauri Joshi , Osman Yağan

Sampling Conditionally on a Rare Event via Generalized Splitting

We propose and analyze a generalized splitting method to sample approximately from a distribution conditional on the occurrence of a rare event. This has important applications in a variety of contexts in operations research, engineering,…

Methodology · Statistics 2019-09-10 Zdravko I. Botev , Pierre L'Ecuyer

Regression estimation from an individual stable sequence

We consider univariate regression estimation from an individual (non-random) sequence $(x_1,y_1),(x_2,y_2), ... \in \real \times \real$, which is stable in the sense that for each interval $A \subseteq \real$, (i) the limiting relative…

Probability · Mathematics 2008-06-19 Gusztav Morvai , Sanjeev R. Kulkarni , Andrew B. Nobel

Martingale posterior distributions

The prior distribution on parameters of a sampling distribution is the usual starting point for Bayesian uncertainty quantification. In this paper, we present a different perspective which focuses on missing observations as the source of…

Methodology · Statistics 2021-11-23 Edwin Fong , Chris Holmes , Stephen G. Walker

Causal Discovery Using Proxy Variables

Discovering causal relations is fundamental to reasoning and intelligence. In particular, observational causal discovery algorithms estimate the cause-effect relation between two random entities $X$ and $Y$, given $n$ samples from $P(X,Y)$.…

Machine Learning · Statistics 2017-02-24 Mateo Rojas-Carulla , Marco Baroni , David Lopez-Paz

Statistical learning with indirect observations

Let $(X,Y)\in\mathcal{X}\times \mathcal{Y}$ be a random couple with unknown distribution $P$. Let $\GG$ be a class of measurable functions and $\ell$ a loss function. The problem of statistical learning deals with the estimation of the…

Statistics Theory · Mathematics 2012-07-12 Sébastien Loustau

Quantifying and estimating dependence via sensitivity of conditional distributions

Recently established, directed dependence measures for pairs $(X,Y)$ of random variables build upon the natural idea of comparing the conditional distributions of $Y$ given $X=x$ with the marginal distribution of $Y$. They assign pairs…

Statistics Theory · Mathematics 2023-09-22 Jonathan Ansari , Patrick B. Langthaler , Sebastian Fuchs , Wolfgang Trutschnig

Sparse Data-Driven Random Projection in Regression for High-Dimensional Data

We examine the linear regression problem in a challenging high-dimensional setting with correlated predictors where the vector of coefficients can vary from sparse to dense. In this setting, we propose a combination of probabilistic…

Methodology · Statistics 2025-05-13 Roman Parzer , Peter Filzmoser , Laura Vana-Gür

High-Dimensional Varying Coefficient Models with Functional Random Effects

We consider a sparse high-dimensional varying coefficients model with random effects, a flexible linear model allowing covariates and coefficients to have a functional dependence with time. For each individual, we observe discretely sampled…

Statistics Theory · Mathematics 2021-10-14 Michael Law , Ya'acov Ritov

Sparse Structures for Multivariate Extremes

Extreme value statistics provides accurate estimates for the small occurrence probabilities of rare events. While theory and statistical tools for univariate extremes are well-developed, methods for high-dimensional and complex data sets…

Methodology · Statistics 2021-01-06 Sebastian Engelke , Jevgenijs Ivanovs

Learning from compressed observations

The problem of statistical learning is to construct a predictor of a random variable $Y$ as a function of a related random variable $X$ on the basis of an i.i.d. training sample from the joint distribution of $(X,Y)$. Allowable predictors…

Information Theory · Computer Science 2016-11-15 Maxim Raginsky