English
Related papers

Related papers: Method G: Uncertainty Quantification for Distribut…

200 papers

In recent years the ultrahigh dimensional linear regression problem has attracted enormous attentions from the research community. Under the sparsity assumption most of the published work is devoted to the selection and estimation of the…

Methodology · Statistics 2013-05-01 Randy C. S. Lai , Jan Hannig , Thomas C. M. Lee

We propose a distributed method for simultaneous inference for datasets with sample size much larger than the number of covariates, i.e., N >> p, in the generalized linear models framework. When such datasets are too big to be analyzed…

Methodology · Statistics 2020-07-23 Lu Tang , Ling Zhou , Peter X. -K. Song

This paper introduces to readers the new concept and methodology of confidence distribution and the modern-day distributional inference in statistics. This discussion should be of interest to people who would like to go into the depth of…

Methodology · Statistics 2021-09-07 Yifan Cui , Min-ge Xie

While linear mixed modeling methods are foundational concepts introduced in any statistical education, adequate general methods for interval estimation involving models with more than a few variance components are lacking, especially in the…

Methodology · Statistics 2012-11-07 Jessi Cisewski , Jan Hannig

We investigate the data distribution valuation problem, which aims to quantify the values of data distributions from their samples. This is a recently proposed problem that is related to but different from classical data valuation and can…

Machine Learning · Computer Science 2026-04-08 Cuong N. Nguyen , Cuong V. Nguyen

Advances in information technology have led to extremely large datasets that are often kept in different storage centers. Existing statistical methods must be adapted to overcome the resulting computational obstacles while retaining…

Methodology · Statistics 2021-11-12 Qiong Zhang , Jiahua Chen

This paper considers distributed statistical inference for general symmetric statistics %that encompasses the U-statistics and the M-estimators in the context of massive data where the data can be stored at multiple platforms in different…

Statistics Theory · Mathematics 2018-05-30 Song Xi Chen , Liuhua Peng

In multi-center clinical trials, due to various reasons, the individual-level data are strictly restricted to be assessed publicly. Instead, the summarized information is widely available from published results. With the advance of…

Methodology · Statistics 2021-01-05 Jing Qin , Yukun Liu , Pengfei Li

The increased availability of massive data sets provides a unique opportunity to discover subtle patterns in their distributions, but also imposes overwhelming computational challenges. To fully utilize the information contained in big…

Statistics Theory · Mathematics 2018-04-12 Stanislav Volgushev , Shih-Kang Chao , Guang Cheng

The rapid emergence of massive datasets in various fields poses a serious challenge to traditional statistical methods. Meanwhile, it provides opportunities for researchers to develop novel algorithms. Inspired by the idea of…

Computation · Statistics 2023-04-14 Yuan Gao , Weidong Liu , Hansheng Wang , Xiaozhou Wang , Yibo Yan , Riquan Zhang

Fiducial inference was introduced in the first half of the 20th century by Fisher (1935) as a means to get a posterior-like distribution for a parameter without having to arbitrarily define a prior. While the method originally fell out of…

Methodology · Statistics 2023-03-01 Alexander C. Murph , Jan Hannig , Jonathan P. Williams

Post-data statistical inference concerns making probability statements about model parameters conditional on observed data. When a priori knowledge about parameters is available, post-data inference can be conveniently made from Bayesian…

Statistics Theory · Mathematics 2025-06-05 Yang Liu , Jan Hannig , Alexander C Murph

This paper develops methods of distributed Bayesian hypothesis tests for fault detection and diagnosis that are based on belief propagation and optimization in graphical models. The main challenges in developing distributed statistical…

Systems and Control · Computer Science 2015-01-20 Kwang-Ki K. Kim

In modern scientific research, massive datasets with huge numbers of observations are frequently encountered. To facilitate the computational process, a divide-and-conquer scheme is often used for the analysis of big data. In such a…

Machine Learning · Statistics 2015-05-06 Chen Xu , Yongquan Zhang , Runze Li

Fiducial inference, as generalized by Hannig et al. (2016), is applied to nonparametric g-modeling (Efron, 2016) in the discrete case. We propose a computationally efficient algorithm to sample from the fiducial distribution, and use the…

Methodology · Statistics 2022-12-21 Yifan Cui , Jan Hannig

Data sharing barriers are paramount challenges arising from multicenter clinical trials where multiple data sources are stored in a distributed fashion at different local study sites. Merging such data sources into a common data storage for…

Methodology · Statistics 2022-04-05 Mengtong Hu , Xu Shi , Peter X. -K. Song

Discrete state spaces represent a major computational challenge to statistical inference, since the computation of normalisation constants requires summation over large or possibly infinite sets, which can be impractical. This paper…

Methodology · Statistics 2023-09-04 Takuo Matsubara , Jeremias Knoblauch , François-Xavier Briol , Chris. J. Oates

In modern federated learning, one of the main challenges is to account for inherent heterogeneity and the diverse nature of data distributions for different clients. This problem is often addressed by introducing personalization of the…

Machine Learning · Statistics 2023-12-19 Nikita Kotelevskii , Samuel Horváth , Karthik Nandakumar , Martin Takáč , Maxim Panov

The focus of modern biomedical studies has gradually shifted to explanation and estimation of joint effects of high dimensional predictors on disease risks. Quantifying uncertainty in these estimates may provide valuable insight into…

Methodology · Statistics 2021-03-09 Zhe Fei , Yi Li

Divide-and-conquer methods use large-sample approximations to provide frequentist guarantees when each block of data is both small enough to facilitate efficient computation and large enough to support approximately valid inferences. When…

Methodology · Statistics 2025-04-01 Emily C. Hector , Leonardo Cella , Ryan Martin
‹ Prev 1 2 3 10 Next ›