Related papers: Modelling Interactions in High-dimensional Data wi…

Model-based regression clustering for high-dimensional data. Application to functional data

Finite mixture regression models are useful for modeling the relationship between response and predictors, arising from different subpopulations. In this article, we study high-dimensional predic- tors and high-dimensional response, and…

Statistics Theory · Mathematics 2016-01-07 Emilie Devijver

Lasso Penalization for High-Dimensional Beta Regression Models: Computation, Analysis, and Inference

Beta regression is commonly employed when the outcome variable is a proportion. Since its conception, the approach has been widely used in applications spanning various scientific fields. A series of extensions have been proposed over time,…

Methodology · Statistics 2025-07-29 Niloofar Ramezani , Martin Slawski

Inference in High Dimensions with the Penalized Score Test

In recent years, there has been considerable theoretical development regarding variable selection consistency of penalized regression techniques, such as the lasso. However, there has been relatively little work on quantifying the…

Methodology · Statistics 2014-05-21 Arend Voorman , Ali Shojaie , Daniela Witten

Accurate Inference for Penalized Logistic Regression

Inference for high-dimensional logistic regression models using penalized methods has been a challenging research problem. As an illustration, a major difficulty is the significant bias of the Lasso estimator, which limits its direct…

Methodology · Statistics 2024-10-29 Yuming Zhang , Stéphane Guerrier , Runze Li

"Pre-conditioning" for feature selection and regression in high-dimensional problems

We consider regression problems where the number of predictors greatly exceeds the number of observations. We propose a method for variable selection that first estimates the regression function, yielding a "pre-conditioned" response…

Statistics Theory · Mathematics 2013-04-16 Debashis Paul , Eric Bair , Trevor Hastie , Robert Tibshirani

Bayesian Classification and Regression with High Dimensional Features

This thesis responds to the challenges of using a large number, such as thousands, of features in regression and classification problems. There are two situations where such high dimensional features arise. One is when high dimensional…

Machine Learning · Statistics 2007-09-20 Longhai Li

Data-Driven Tuning Parameter Selection for High-Dimensional Vector Autoregressions

Lasso-type estimators are routinely used to estimate high-dimensional time series models. The theoretical guarantees established for these estimators typically require the penalty level to be chosen in a suitable fashion often depending on…

Econometrics · Economics 2024-12-24 Anders Bredahl Kock , Rasmus Søndergaard Pedersen , Jesper Riis-Vestergaard Sørensen

A lasso for hierarchical interactions

We add a set of convex constraints to the lasso to produce sparse interaction models that honor the hierarchy restriction that an interaction only be included in a model if one or both variables are marginally important. We give a precise…

Methodology · Statistics 2013-06-20 Jacob Bien , Jonathan Taylor , Robert Tibshirani

Post-Lasso Inference for High-Dimensional Regression

Among the most popular variable selection procedures in high-dimensional regression, Lasso provides a solution path to rank the variables and determines a cut-off position on the path to select variables and estimate coefficients. In this…

Methodology · Statistics 2018-06-19 X. Jessie Jeng , Huimin Peng , Wenbin Lu

Large Vector Auto Regressions

One popular approach for nonstructural economic and financial forecasting is to include a large number of economic and financial variables, which has been shown to lead to significant improvements for forecasting, for example, by the…

Machine Learning · Statistics 2011-06-21 Song Song , Peter J. Bickel

Cooperative Thresholded Lasso for Sparse Linear Bandit

We present a novel approach to address the multi-agent sparse contextual linear bandit problem, in which the feature vectors have a high dimension $d$ whereas the reward function depends on only a limited set of features - precisely $s_0…

Machine Learning · Computer Science 2023-05-31 Haniyeh Barghi , Xiaotong Cheng , Setareh Maghsudi

High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking

Penalized likelihood approaches are widely used for high-dimensional regression. Although many methods have been proposed and the associated theory is now well-developed, the relative efficacy of different approaches in finite-sample…

Methodology · Statistics 2020-01-29 Fan Wang , Sach Mukherjee , Sylvia Richardson , Steven M. Hill

High Dimensional Causal Inference with Variational Backdoor Adjustment

Backdoor adjustment is a technique in causal inference for estimating interventional quantities from purely observational data. For example, in medical settings, backdoor adjustment can be used to control for confounding and estimate the…

Artificial Intelligence · Computer Science 2023-10-11 Daniel Israel , Aditya Grover , Guy Van den Broeck

High-dimensional Interactions Detection with Sparse Principal Hessian Matrix

In statistical learning framework with regressions, interactions are the contributions to the response variable from the products of the explanatory variables. In high-dimensional problems, detecting interactions is challenging due to…

Methodology · Statistics 2019-10-01 Cheng Yong Tang , Ethan X. Fang , Yuexiao Dong

High-dimensional variable selection via tilting

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz

Reluctant Interaction Modeling

Including pairwise interactions between the predictors of a regression model can produce better predicting models. However, to fit such interaction models on typical data sets in biology and other fields can often require solving enormous…

Methodology · Statistics 2023-02-14 Guo Yu , Jacob Bien , Ryan Tibshirani

Interaction pursuit in high-dimensional multi-response regression via distance correlation

Feature interactions can contribute to a large proportion of variation in many prediction models. In the era of big data, the coexistence of high dimensionality in both responses and covariates poses unprecedented challenges in identifying…

Methodology · Statistics 2016-05-12 Yinfei Kong , Daoji Li , Yingying Fan , Jinchi Lv

A Survey of Estimation Methods for Sparse High-dimensional Time Series Models

High-dimensional time series datasets are becoming increasingly common in many areas of biological and social sciences. Some important applications include gene regulatory network reconstruction using time course gene expression data, brain…

Methodology · Statistics 2021-08-02 Sumanta Basu , David S. Matteson

Look-Ahead Screening Rules for the Lasso

The lasso is a popular method to induce shrinkage and sparsity in the solution vector (coefficients) of regression problems, particularly when there are many predictors relative to the number of observations. Solving the lasso in this…

Machine Learning · Statistics 2024-05-14 Johan Larsson

Recall Traces: Backtracking Models for Efficient Reinforcement Learning

In many environments only a tiny subset of all states yield high reward. In these cases, few of the interactions with the environment provide a relevant learning signal. Hence, we may want to preferentially train on those high-reward states…

Machine Learning · Computer Science 2019-01-30 Anirudh Goyal , Philemon Brakel , William Fedus , Soumye Singhal , Timothy Lillicrap , Sergey Levine , Hugo Larochelle , Yoshua Bengio