English
Related papers

Related papers: Distributed Estimation and Inference for Semi-para…

200 papers

This paper studies hypothesis testing and parameter estimation in the context of the divide and conquer algorithm. In a unified likelihood based framework, we propose new test statistics and point estimators obtained by aggregating various…

Statistics Theory · Mathematics 2015-09-21 Heather Battey , Jianqing Fan , Han Liu , Junwei Lu , Ziwei Zhu

The rapid emergence of massive datasets in various fields poses a serious challenge to traditional statistical methods. Meanwhile, it provides opportunities for researchers to develop novel algorithms. Inspired by the idea of…

Computation · Statistics 2023-04-14 Yuan Gao , Weidong Liu , Hansheng Wang , Xiaozhou Wang , Yibo Yan , Riquan Zhang

This paper presents a class of new algorithms for distributed statistical estimation that exploit divide-and-conquer approach. We show that one of the key benefits of the divide-and-conquer strategy is robustness, an important…

Statistics Theory · Mathematics 2018-08-29 Stanislav Minsker , Nate Strawn

Although various distributed machine learning schemes have been proposed recently for pure linear models and fully nonparametric models, little attention has been paid on distributed optimization for semi-paramemetric models with…

Machine Learning · Statistics 2019-11-05 Shaogao Lv , Heng Lian

The debiased estimator is a crucial tool in statistical inference for high-dimensional model parameters. However, constructing such an estimator involves estimating the high-dimensional inverse Hessian matrix, incurring significant…

Machine Learning · Statistics 2023-12-18 Jiyuan Tu , Weidong Liu , Xiaojun Mao , Mingyue Xu

This paper presents a unified framework for supervised learning and inference procedures using the divide-and-conquer approach for high-dimensional correlated outcomes. We propose a general class of estimators that can be implemented in a…

Statistics Theory · Mathematics 2020-09-22 Emily C. Hector , Peter X. -K. Song

The distributed Hill estimator is a divide-and-conquer algorithm for estimating the extreme value index when data are stored in multiple machines. In applications, estimates based on the distributed Hill estimator can be sensitive to the…

Methodology · Statistics 2021-12-21 Liujun Chen , Deyuan Li , Chen Zhou

We consider quantile estimation in a semi-supervised setting, characterized by two available data sets: (i) a small or moderate sized labeled data set containing observations for a response and a set of possibly high dimensional covariates,…

Methodology · Statistics 2024-08-15 Abhishek Chakrabortty , Guorong Dai , Raymond J. Carroll

Semiparametric discrete choice models are widely used in a variety of practical applications. While these models are point identified in the presence of continuous covariates, they can become partially identified when covariates are…

Econometrics · Economics 2024-05-29 Shakeeb Khan , Tatiana Komarova , Denis Nekipelov

Estimation and inference with modern longitudinal data from wearable devices, which consist of biological signals at high-frequency time points, is burdened by massive computational costs. We propose a distributed estimation and inference…

Methodology · Statistics 2023-09-13 Cole Manschot , Emily C. Hector

This paper studies the problem of nonparametric estimation of a smooth function with data distributed across multiple machines. We assume an independent sample from a white noise model is collected at each machine, and an estimator of the…

Machine Learning · Statistics 2018-06-26 Yuancheng Zhu , John Lafferty

The growing size of modern data brings many new challenges to existing statistical inference methodologies and theories, and calls for the development of distributed inferential approaches. This paper studies distributed inference for…

Machine Learning · Statistics 2019-10-01 Xiaozhou Wang , Zhuoyi Yang , Xi Chen , Weidong Liu

The increased availability of massive data sets provides a unique opportunity to discover subtle patterns in their distributions, but also imposes overwhelming computational challenges. To fully utilize the information contained in big…

Statistics Theory · Mathematics 2018-04-12 Stanislav Volgushev , Shih-Kang Chao , Guang Cheng

We consider the design of identical one-bit probabilistic quantizers for distributed estimation in sensor networks. We assume the parameter-range to be finite and known and use the maximum Cram\'er-Rao Lower Bound (CRB) over the…

Information Theory · Computer Science 2012-06-01 Swarnendu Kar , Hao Chen , Pramod K. Varshney

Estimating statistical models within sensor networks requires distributed algorithms, in which both data and computation are distributed across the nodes of the network. We propose a general approach for distributed learning based on…

Machine Learning · Computer Science 2012-07-03 Qiang Liu , Alexander Ihler

This article introduces an iterative distributed computing estimator for the multinomial logistic regression model with large choice sets. Compared to the maximum likelihood estimator, the proposed iterative distributed estimator achieves…

Econometrics · Economics 2024-12-03 Yanqin Fan , Yigit Okar , Xuetao Shi

This study proposes a computationally efficient semiparametric distribution estimator, which is a slight modification of the naive mixture proposed by Schuster and Yakowitz (1985) and Olkin and Spiegelman (1987). The proposed method is…

Statistics Theory · Mathematics 2025-09-12 Taku Moriyama

This paper considers distributed M-estimation under heterogeneous distributions among distributed data blocks. A weighted distributed estimator is proposed to improve the efficiency of the standard "Split-And-Conquer" (SaC) estimator for…

Statistics Theory · Mathematics 2022-09-15 Jia Gu , Songxi Chen

Previous analysis of regularized functional linear regression in a reproducing kernel Hilbert space (RKHS) typically requires the target function to be contained in this kernel space. This paper studies the convergence performance of…

Machine Learning · Statistics 2024-02-20 Jiading Liu , Lei Shi

This paper proposes a new method for estimating high-dimensional binary choice models. We consider a semiparametric model that places no distributional assumptions on the error term, allows for heteroskedastic errors, and permits endogenous…

Econometrics · Economics 2025-07-15 Fu Ouyang , Thomas Tao Yang
‹ Prev 1 2 3 10 Next ›