English
Related papers

Related papers: Data integration in high dimension with multiple q…

200 papers

We consider a problem of data integration. Consider determining which genes affect a disease. The genes, which we call predictor objects, can be measured in different experiments on the same individual. We address the question of finding…

Machine Learning · Statistics 2016-10-04 Xin Gao , Raymond J. Carroll

We consider the task of meta-analysis in high-dimensional settings in which the data sources are similar but non-identical. To borrow strength across such heterogeneous datasets, we introduce a global parameter that emphasizes…

Methodology · Statistics 2022-07-01 Subha Maity , Yuekai Sun , Moulinath Banerjee

Quantile regression has been successfully used to study heterogeneous and heavy-tailed data. Varying-coefficient models are frequently used to capture changes in the effect of input variables on the response as a function of an index or…

Methodology · Statistics 2021-10-18 Ran Dai , Mladen Kolar

This paper presents a selective survey of recent developments in statistical inference and multiple testing for high-dimensional regression models, including linear and logistic regression. We examine the construction of confidence…

Methodology · Statistics 2023-01-26 T. Tony Cai , Zijian Guo , Yin Xia

We endeavour to estimate numerous multi-dimensional means of various probability distributions on a common space based on independent samples. Our approach involves forming estimators through convex combinations of empirical means derived…

Machine Learning · Statistics 2025-03-11 Gilles Blanchard , Jean-Baptiste Fermanian , Hannah Marienwald

Measurement involves the determination of quantitative estimates of physical quantities from experiment, along with estimates of their associated uncertainties. Herewith an experimental system model is the key to extracting information from…

Applications · Statistics 2008-09-01 Vladimir B. Bokov

Data integration has become increasingly popular owing to the availability of multiple data sources. This study considered quantile regression estimation when a key covariate had multiple proxies across several datasets. In a unified…

Methodology · Statistics 2022-10-25 Dongyoung Go , Jongho Im , Ick Hoon Jin

Statistical learning evolves quickly with more and more sophisticated models proposed to incorporate the complicated data structure from modern scientific and business problems. Varying index coefficient models extend varying coefficient…

Statistics Theory · Mathematics 2019-03-05 Li Jialiang , Lv Jing

This article considers a linear model in a high dimensional data scenario. We propose a process which uses multiple loss functions both to select relevant predictors and to estimate parameters, and study its asymptotic properties. Variable…

Methodology · Statistics 2020-07-01 Guorong Dai , Ursula U. Müller

It is becoming increasingly common for researchers to consider incorporating external information from large studies to improve the accuracy of statistical inference instead of relying on a modestly sized dataset collected internally. With…

Methodology · Statistics 2021-07-20 Tian Gu , Jeremy M. G. Taylor , Bhramar Mukherjee

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz

With the availability of high dimensional genetic biomarkers, it is of interest to identify heterogeneous effects of these predictors on patients' survival, along with proper statistical inference. Censored quantile regression has emerged…

Methodology · Statistics 2021-07-26 Zhe Fei , Qi Zheng , Hyokyoung G. Hong , Yi Li

This paper studies policy evaluation with multiple data sources, especially in scenarios that involve one experimental dataset with two arms, complemented by a historical dataset generated under a single control arm. We propose novel data…

Machine Learning · Statistics 2024-06-04 Ting Li , Chengchun Shi , Qianglin Wen , Yang Sui , Yongli Qin , Chunbo Lai , Hongtu Zhu

As medical devices become more complex, they routinely collect extensive and complicated data. While classical regressions typically examine the relationship between an outcome and a vector of predictors, it becomes imperative to identify…

Methodology · Statistics 2024-05-16 Huaqing Jin , Fei Jiang

Data analysis based on information from several sources is common in economic and biomedical studies. This setting is often referred to as the data fusion problem, which differs from traditional missing data problems since no complete data…

Methodology · Statistics 2022-04-07 Wei Li , Shanshan Luo , Wangli Xu

The increased availability of massive data sets provides a unique opportunity to discover subtle patterns in their distributions, but also imposes overwhelming computational challenges. To fully utilize the information contained in big…

Statistics Theory · Mathematics 2018-04-12 Stanislav Volgushev , Shih-Kang Chao , Guang Cheng

The paper considers linear regression problems where the number of predictor variables is possibly larger than the sample size. The basic motivation of the study is to combine the points of view of model selection and functional regression…

Statistics Theory · Mathematics 2012-02-24 Alois Kneip , Pascal Sarda

We consider the problem of multi-task learning in the high dimensional setting. In particular, we introduce an estimator and investigate its statistical and computational properties for the problem of multiple connected linear regressions…

Machine Learning · Statistics 2023-07-03 Amir Asiaee , Samet Oymak , Kevin R. Coombes , Arindam Banerjee

Most data sets comprise of measurements on continuous and categorical variables. In regression and classification Statistics literature, modeling high-dimensional mixed predictors has received limited attention. In this paper we study the…

Statistics Theory · Mathematics 2021-10-26 Efstathia Bura , Liliana Forzani , Rodrigo García Arancibia , Pamela Llop , Diego Tomassi

This paper studies the case of possibly high-dimensional covariates in the regression discontinuity design (RDD) analysis. In particular, we propose estimation and inference methods for the RDD models with covariate selection which perform…

Econometrics · Economics 2026-01-21 Yoichi Arai , Taisuke Otsu , Myung Hwan Seo
‹ Prev 1 2 3 10 Next ›