English
Related papers

Related papers: A Partially Linear Framework for Massive Heterogen…

200 papers

We consider an additive partially linear framework for modelling massive heterogeneous data. The major goal is to extract multiple common features simultaneously across all sub-populations while exploring heterogeneity of each…

Methodology · Statistics 2019-01-01 Binhuan Wang , Yixin Fang , Heng Lian , Hua Liang

A massive dataset often consists of a growing number of (potentially) heterogeneous sub-populations. This paper is concerned about testing various forms of heterogeneity arising from massive data. In a general nonparametric framework, a set…

Statistics Theory · Mathematics 2016-01-26 Junwei Lu , Guang Cheng , Han Liu

In this paper, a subgroup least squares and a convex clustering are introduced for inferring a partially heterogenous linear regression that has potential application in the areas of precision marketing and precision medicine. The…

Methodology · Statistics 2017-11-02 Lu Lin , Jun Lu , Chen Lin

We consider a flexible semiparametric quantile regression model for analyzing high dimensional heterogeneous data. This model has several appealing features: (1) By considering different conditional quantiles, we may obtain a more complete…

Statistics Theory · Mathematics 2016-01-25 Ben Sherwood , Lan Wang

We propose generalized additive partial linear models for complex data which allow one to capture nonlinear patterns of some covariates, in the presence of linear components. The proposed method improves estimation efficiency and increases…

Statistics Theory · Mathematics 2014-05-26 Li Wang , Lan Xue , Annie Qu , Hua Liang

We present a nonlinear regression framework based on tensor algebra tailored to high dimensional contexts where data is scarce. We exploit algebraic properties of a partial tensor product, namely the m-tensor product, to leverage structured…

Computational Engineering, Finance, and Science · Computer Science 2026-02-10 Rémi Cloarec , Sebastian Rodriguez , Xavier Kestelyn , Francisco Chinesta

This paper analyzes a semiparametric model of network formation in the presence of unobserved agent-specific heterogeneity. The objective is to identify and estimate the preference parameters associated with homophily on observed attributes…

Econometrics · Economics 2020-09-01 Luis E. Candelaria

Recently, high-dimensional heterogeneous data have attracted a lot of attention and discussion. Under heterogeneity, semiparametric regression is a popular choice to model data in statistics. In this paper, we take advantages of expectile…

Statistics Theory · Mathematics 2019-08-20 Jun Zhao , Guan'ao Yan , Yi Zhang

Linear regression is a fundamental and popular statistical method. There are various kinds of linear regression, such as mean regression and quantile regression. In this paper, we propose a new one called distribution regression, which…

Methodology · Statistics 2017-12-27 Xin Chen , Xuejun Ma , Wang Zhou

In this paper we propose a general series method to estimate a semiparametric partially linear varying coefficient model. We establish the consistency and \sqrtn-normality property of the estimator of the finite-dimensional parameters of…

Statistics Theory · Mathematics 2007-06-13 Ibrahim Ahmad , Sittisak Leelahanon , Qi Li

Prediction polling is an increasingly popular form of crowdsourcing in which multiple participants estimate the probability or magnitude of some future event. These estimates are then aggregated into a single forecast. Historically,…

Methodology · Statistics 2016-04-25 Ville A. Satopää , Shane T. Jensen , Robin Pemantle , Lyle H. Ungar

In many complex applications, data heterogeneity and homogeneity exist simultaneously. Ignoring either one will result in incorrect statistical inference. In addition, coping with complex data that are non-Euclidean becomes more common. To…

Methodology · Statistics 2021-05-28 Zixuan Han , Tao Li , Jinhong You

We consider efficient estimation of the Euclidean parameters in a generalized partially linear additive models for longitudinal/clustered data when multiple covariates need to be modeled nonparametrically, and propose an estimation…

Statistics Theory · Mathematics 2014-02-05 Guang Cheng , Lan Zhou , Jianhua Z. Huang

In conventional statistical and machine learning methods, it is typically assumed that the test data are identically distributed with the training data. However, this assumption does not always hold, especially in applications where the…

Methodology · Statistics 2023-09-19 Sai Li , Linjun Zhang

This article introduces a novel nonparametric methodology for Generalized Linear Models which combines the strengths of the binary regression and latent variable formulations for categorical data, while overcoming their disadvantages.…

Machine Learning · Statistics 2021-10-12 K. P. Chowdhury

In this paper, we consider a partial deconvolution kernel estimator for nonparametric regression when some covariates are measured with error while others are observed without error. We focus on a general and realistic setting in which the…

Statistics Theory · Mathematics 2026-01-29 Baba Thiam

A full parametric and linear specification may be insufficient to capture complicated patterns in studies exploring complex features, such as those investigating age-related changes in brain functional abilities. Alternatively, a partially…

Methodology · Statistics 2024-02-07 Jia Liang , Shuo Chen , Peter Kochunov , L Elliot Hong , Chixiang Chen

Inference on the parametric part of a semiparametric model is no trivial task. If one approximates the infinite dimensional part of the semiparametric model by a parametric function, one obtains a parametric model that is in some sense…

Statistics Theory · Mathematics 2025-09-23 Adam Lee , Emil A. Stoltenberg , Per A. Mykland

I develop a methodology to partially identify linear combinations of conditional mean outcomes when the researcher only has access to aggregate data. Unlike the existing literature, I only allow for marginal, not joint, distributions of…

Econometrics · Economics 2025-12-04 Sarah Moon

In this manuscript a unified framework for conducting inference on complex aggregated data in high dimensional settings is proposed. The data are assumed to be a collection of multiple non-Gaussian realizations with underlying undirected…

Applications · Statistics 2013-10-14 Fang Han , Han Liu , Brian Caffo
‹ Prev 1 2 3 10 Next ›