Related papers: A High-dimensional M-estimator Framework for Bi-le…

BIVAS: A scalable Bayesian method for bi-level variable selection with applications

In this paper, we consider a Bayesian bi-level variable selection problem in high-dimensional regressions. In many practical situations, it is natural to assign group membership to each predictor. Examples include that genetic variants can…

Applications · Statistics 2018-03-29 Mingxuan Cai , Mingwei Dai , Jingsi Ming , Heng Peng , Jin Liu , Can Yang

A Two-Stage Variable Selection Approach for Correlated High Dimensional Predictors

When fitting statistical models, some predictors are often found to be correlated with each other, and functioning together. Many group variable selection methods are developed to select the groups of predictors that are closely related to…

Methodology · Statistics 2021-03-25 Zhiyuan Li

Regularization Methods for High-Dimensional Instrumental Variables Regression With an Application to Genetical Genomics

In genetical genomics studies, it is important to jointly analyze gene expression data and genetic variants in exploring their associations with complex traits, where the dimensionality of gene expressions and genetic variants can both be…

Methodology · Statistics 2014-04-15 Wei Lin , Rui Feng , Hongzhe Li

Parallel subgroup analysis of high-dimensional data via M-regression

It becomes an interesting problem to identify subgroup structures in data analysis as populations are probably heterogeneous in practice. In this paper, we consider M-estimators together with both concave and pairwise fusion penalties,…

Methodology · Statistics 2020-05-04 Chao Cheng , Xingdong Feng

Bayesian Bi-level Sparse Group Regressions for Macroeconomic Density Forecasting

We propose a Machine Learning approach for optimal macroeconomic density forecasting in a high-dimensional setting where the underlying model exhibits a known group structure. Our approach is general enough to encompass specific forecasting…

Econometrics · Economics 2024-11-18 Matteo Mogliani , Anna Simoni

Beyond Support in Two-Stage Variable Selection

Numerous variable selection methods rely on a two-stage procedure, where a sparsity-inducing penalty is used in the first stage to predict the support, which is then conveyed to the second stage for estimation or inference purposes. In this…

Applications · Statistics 2015-05-28 Jean-Michel Bécu , Yves Grandvalet , Christophe Ambroise , Cyril Dalmasso

M-estimation in multistage sampling procedures

Multi-stage (designed) procedures, obtained by splitting the sampling budget suitably across stages, and designing the sampling at a particular stage based on information about the parameter obtained from previous stages, are often…

Methodology · Statistics 2014-01-08 Atul Mallik , Moulinath Banerjee , George Michailidis

Selection of variables and decision boundaries for functional data via bi-level selection

Sparsity-inducing penalties are useful tools for variable selection and they are also effective for regression settings where the data are functions. We consider the problem of selecting not only variables but also decision boundaries in…

Methodology · Statistics 2020-06-01 Hidetoshi Matsui

Linear Discriminant Analysis with High-dimensional Mixed Variables

Datasets containing both categorical and continuous variables are frequently encountered in many areas, and with the rapid development of modern measurement technologies, the dimensions of these variables can be very high. Despite the…

Methodology · Statistics 2024-01-03 Binyan Jiang , Chenlei Leng , Cheng Wang , Zhongqing Yang , Xinyang Yu

Improving Group Lasso for high-dimensional categorical data

Sparse modelling or model selection with categorical data is challenging even for a moderate number of variables, because one parameter is roughly needed to encode one category or level. The Group Lasso is a well known efficient algorithm…

Methodology · Statistics 2022-11-14 Szymon Nowakowski , Piotr Pokarowski , Wojciech Rejchel , Agnieszka Sołtys

Estimation and variable selection in high dimension in nonlinear mixed-effects models

We consider nonlinear mixed effects models including high-dimensional covariates to model individual parameters variability. The objective is to identify relevant covariates among a large set under sparsity assumption and to estimate model…

Statistics Theory · Mathematics 2025-08-06 Antoine Caillebotte , Estelle Kuhn , Sarah Lemler

Two-Stage Robust and Sparse Distributed Statistical Inference for Large-Scale Data

In this paper, we address the problem of conducting statistical inference in settings involving large-scale data that may be high-dimensional and contaminated by outliers. The high volume and dimensionality of the data require distributed…

Machine Learning · Statistics 2022-11-30 Emadaldin Mozafari-Majd , Visa Koivunen

Estimation for ultra-high dimensional factor model: a pivotal variable detection based approach

For factor model, the involved covariance matrix often has no row sparse structure because the common factors may lead some variables to strongly associate with many others. Under the ultra-high dimensional paradigm, this feature causes…

Methodology · Statistics 2014-09-22 Junlong Zhao , Hongyu Zhao , Lixing Zhu

Sparse covariance thresholding for high-dimensional variable selection

In high-dimensions, many variable selection methods, such as the lasso, are often limited by excessive variability and rank deficiency of the sample covariance matrix. Covariance sparsity is a natural phenomenon in high-dimensional…

Methodology · Statistics 2010-06-08 X. Jessie Jeng And Z. John Daye

Sparse Learning and Class Probability Estimation with Weighted Support Vector Machines

Classification and probability estimation are fundamental tasks with broad applications across modern machine learning and data science, spanning fields such as biology, medicine, engineering, and computer science. Recent development of…

Methodology · Statistics 2026-03-25 Liyun Zeng , Hao Helen Zhang

M-estimation in high-dimensional linear model

We mainly study the M-estimation method for the high-dimensional linear regression model, and discuss the properties of M-estimator when the penalty term is the local linear approximation. In fact, M-estimation method is a framework, which…

Probability · Mathematics 2018-10-31 Kai Wang , Yanling Zhu

Relax and penalize: a new bilevel approach to mixed-binary hyperparameter optimization

In recent years, bilevel approaches have become very popular to efficiently estimate high-dimensional hyperparameters of machine learning models. However, to date, binary parameters are handled by continuous relaxation and rounding…

Machine Learning · Computer Science 2025-03-20 Sara Venturini , Marianna de Santis , Jordan Patracone , Francesco Rinaldi , Saverio Salzo , Martin Schmidt

Optimal Feature Selection in High-Dimensional Discriminant Analysis

We consider the high-dimensional discriminant analysis problem. For this problem, different methods have been proposed and justified by establishing exact convergence rates for the classification risk, as well as the l2 convergence results…

Machine Learning · Statistics 2013-06-28 Mladen Kolar , Han Liu

Bayesian Variable Selection Under High-dimensional Settings With Grouped Covariates

Consider the normal linear regression setup when the number of covariates p is much larger than the sample size n, and the covariates form correlated groups. The response variable y is not related to an entire group of covariates in all or…

Methodology · Statistics 2023-09-06 Pranay Agarwal , Subhajit Dutta , Minerva Mukhopadhyay

On Iterative Hard Thresholding Methods for High-dimensional M-Estimation

The use of M-estimators in generalized linear regression models in high dimensional settings requires risk minimization with hard $L_0$ constraints. Of the known methods, the class of projected gradient descent (also known as iterative hard…

Machine Learning · Computer Science 2014-10-22 Prateek Jain , Ambuj Tewari , Purushottam Kar