English
Related papers

Related papers: Efficient Statistics for Sparse Graphical Models f…

200 papers

We provide an efficient algorithm for the classical problem, going back to Galton, Pearson, and Fisher, of estimating, with arbitrary accuracy the parameters of a multivariate normal distribution from truncated samples. Truncated samples…

Statistics Theory · Mathematics 2020-10-26 Constantinos Daskalakis , Themis Gouleakis , Christos Tzamos , Manolis Zampetakis

We study the problem of estimating the parameters of a Gaussian distribution when samples are only shown if they fall in some (unknown) subset $S \subseteq \R^d$. This core problem in truncated statistics has long history going back to…

Statistics Theory · Mathematics 2019-08-06 Vasilis Kontonis , Christos Tzamos , Manolis Zampetakis

We study the problem of estimating the parameters of a Boolean product distribution in $d$ dimensions, when the samples are truncated by a set $S \subset \{0, 1\}^d$ accessible through a membership oracle. This is the first time that the…

Machine Learning · Computer Science 2026-05-05 Dimitris Fotakis , Alkis Kalavasis , Christos Tzamos

As in standard linear regression, in truncated linear regression, we are given access to observations $(A_i, y_i)_i$ whose dependent variable equals $y_i= A_i^{\rm T} \cdot x^* + \eta_i$, where $x^*$ is some fixed unknown vector of interest…

Machine Learning · Computer Science 2020-07-30 Constantinos Daskalakis , Dhruv Rohatgi , Manolis Zampetakis

Sparse linear regression is a central problem in high-dimensional statistics. We study the correlated random design setting, where the covariates are drawn from a multivariate Gaussian $N(0,\Sigma)$, and we seek an estimator with small…

Data Structures and Algorithms · Computer Science 2023-05-29 Jonathan Kelner , Frederic Koehler , Raghu Meka , Dhruv Rohatgi

We provide efficient algorithms for the problem of distribution learning from high-dimensional Gaussian data where in each sample, some of the variable values are missing. We suppose that the variables are missing not at random (MNAR). The…

Machine Learning · Computer Science 2025-04-29 Arnab Bhattacharyya , Constantinos Daskalakis , Themis Gouleakis , Yuhao Wang

Gaussian graphical models (GGMs) are widely used for statistical modeling, because of ease of inference and the ubiquitous use of the normal distribution in practical approximations. However, they are also known for their limited modeling…

Machine Learning · Statistics 2016-11-22 Qinliang Su , Xuejun Liao , Chunyuan Li , Zhe Gan , Lawrence Carin

Truncated linear regression is a classical challenge in Statistics, wherein a label, $y = w^T x + \varepsilon$, and its corresponding feature vector, $x \in \mathbb{R}^k$, are only observed if the label falls in some subset $S \subseteq…

Methodology · Statistics 2022-08-26 Constantinos Daskalakis , Patroklos Stefanou , Rui Yao , Manolis Zampetakis

We consider the problem of learning a graph modeling the statistical relations of the $d$ variables from a dataset with $n$ samples $X \in \mathbb{R}^{n \times d}$. Standard approaches amount to searching for a precision matrix $\Theta$…

Machine Learning · Statistics 2023-12-13 Titouan Vayer , Etienne Lasalle , Rémi Gribonval , Paulo Gonçalves

We introduce the truncated Gaussian graphical model (TGGM) as a novel framework for designing statistical models for nonlinear learning. A TGGM is a Gaussian graphical model (GGM) with a subset of variables truncated to be nonnegative. The…

Machine Learning · Statistics 2016-11-22 Qinliang Su , Xuejun Liao , Changyou Chen , Lawrence Carin

Graphical models have become a very popular tool for representing dependencies within a large set of variables and are key for representing causal structures. We provide results for uniform inference on high-dimensional graphical models…

Methodology · Statistics 2018-12-04 Sven Klaassen , Jannis Kück , Martin Spindler , Victor Chernozhukov

We provide a computationally and statistically efficient estimator for the classical problem of truncated linear regression, where the dependent variable $y = w^T x + \epsilon$ and its corresponding vector of covariates $x \in R^k$ are only…

Statistics Theory · Mathematics 2020-10-26 Constantinos Daskalakis , Themis Gouleakis , Christos Tzamos , Manolis Zampetakis

This paper introduces a simple principle for robust high-dimensional statistical inference via an appropriate shrinkage on the data. This widens the scope of high-dimensional techniques, reducing the moment conditions from sub-exponential…

Statistics Theory · Mathematics 2017-05-08 Jianqing Fan , Weichen Wang , Ziwei Zhu

One of the fundamental tasks of science is to find explainable relationships between observed phenomena. One approach to this task that has received attention in recent years is based on probabilistic graphical modelling with sparsity…

Machine Learning · Statistics 2014-04-16 Peter Orchard , Felix Agakov , Amos Storkey

Compressed sensing deals with the reconstruction of sparse signals using a small number of linear measurements. One of the main challenges in compressed sensing is to find the support of a sparse signal. In the literature, several bounds on…

Information Theory · Computer Science 2009-11-26 Ali Hormati , Amin Karbasi , Soheil Mohajer , Martin Vetterli

Gaussian Graphical Models (GGMs) are popular tools for studying network structures. However, many modern applications such as gene network discovery and social interactions analysis often involve high-dimensional noisy data with outliers or…

Machine Learning · Statistics 2015-10-30 Eunho Yang , Aurélie C. Lozano

Within Bayesian state estimation, considerable effort has been devoted to incorporating constraints into state estimation for process optimization, state monitoring, fault detection and control. Nonetheless, in the domain of state-space…

Systems and Control · Electrical Eng. & Systems 2025-07-28 Rodrigo A. González , Angel L. Cedeño , Koen Tiels , Tom Oomen

Many conventional statistical procedures are extremely sensitive to seemingly minor deviations from modeling assumptions. This problem is exacerbated in modern high-dimensional settings, where the problem dimension can grow with and…

Machine Learning · Statistics 2017-02-27 Simon S. Du , Sivaraman Balakrishnan , Aarti Singh

We show a statistical version of Taylor's theorem and apply this result to non-parametric density estimation from truncated samples, which is a classical challenge in Statistics \cite{woodroofe1985estimating, stute1993almost}. The…

Statistics Theory · Mathematics 2021-07-01 Constantinos Daskalakis , Vasilis Kontonis , Christos Tzamos , Manolis Zampetakis

We consider the problem of high-dimensional Gaussian graphical model selection. We identify a set of graphs for which an efficient estimation algorithm exists, and this algorithm is based on thresholding of empirical conditional…

Machine Learning · Computer Science 2012-03-06 Animashree Anandkumar , Vincent Y. F. Tan , Alan. S. Willsky
‹ Prev 1 2 3 10 Next ›