English
Related papers

Related papers: Variable selection via thresholding

200 papers

Threshold selection plays a key role for various aspects of statistical inference of rare events. Most classical approaches tackling this problem for heavy-tailed distributions crucially depend on tuning parameters or critical values to be…

Methodology · Statistics 2019-03-07 Laura Fee Schneider , Andrea Krajina , Tatyana Krivobokova

Variable selection is a procedure to attain the truly important predictors from inputs. Complex nonlinear dependencies and strong coupling pose great challenges for variable selection in high-dimensional data. In addition, real-world…

Methodology · Statistics 2023-07-04 Keyao Wang , Huiwen Wang , Jichang Zhao , Lihong Wang

This note discusses the problem of choosing between hypotheses in a situation with many, correlated non-normal variables. A new method is introduced to shrink the many variables into a smaller subset of variables with zero mean, unit…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Byron P. Roe

Threshold selection is a fundamental problem in any threshold-based extreme value analysis. While models are asymptotically motivated, selecting an appropriate threshold for finite samples is difficult and highly subjective through standard…

Methodology · Statistics 2024-10-30 Conor Murphy , Jonathan A. Tawn , Zak Varty

In this paper, we are concerned with how to select significant variables in semiparametric modeling. Variable selection for semiparametric regression models consists of two components: model selection for nonparametric components and…

Statistics Theory · Mathematics 2008-12-18 Runze Li , Hua Liang

This paper studies sparse covariance operator estimation for nonstationary processes with sharply varying marginal variance and small correlation lengthscale. We introduce a covariance operator estimator that adaptively thresholds the…

Statistics Theory · Mathematics 2025-06-23 Omar Al-Ghattas , Daniel Sanz-Alonso

In [Lavielle and Ludena 07], a random thresholding metho d is intro duced to select the significant, or non null, mean terms among a collection of independent random variables, and applied to the problem of recovering the significant…

Methodology · Statistics 2010-10-27 Merlin Keller , Marc Lavielle

Variable selection plays a fundamental role in high-dimensional data analysis. Various methods have been developed for variable selection in recent years. Well-known examples are forward stepwise regression (FSR) and least angle regression…

Methodology · Statistics 2018-02-01 Siliang Gong , Kai Zhang , Yufeng Liu

Variable selection, also known as feature selection in machine learning, plays an important role in modeling high dimensional data and is key to data-driven scientific discoveries. We consider here the problem of detecting influential…

Methodology · Statistics 2014-09-24 Bo Jiang , Jun S. Liu

Variable selection has received widespread attention over the last decade as we routinely encounter high-throughput datasets in complex biological and environment research. Most Bayesian variable selection methods are restricted to mixture…

Methodology · Statistics 2015-03-24 Hanning Li , Debdeep Pati

Soft-thresholding is a sparse modeling method that is typically applied to wavelet denoising in statistical signal processing and analysis. It has a single parameter that controls a threshold level on wavelet coefficients and,…

Methodology · Statistics 2016-02-01 Katsuyuki Hagiwara

In high-dimensions, many variable selection methods, such as the lasso, are often limited by excessive variability and rank deficiency of the sample covariance matrix. Covariance sparsity is a natural phenomenon in high-dimensional…

Methodology · Statistics 2010-06-08 X. Jessie Jeng And Z. John Daye

Variable selection over a potentially large set of covariates in a linear model is quite popular. In the Bayesian context, common prior choices can lead to a posterior expectation of the regression coefficients that is a sparse (or nearly…

Methodology · Statistics 2025-12-02 Debamita Kundu , Riten Mitra , Jeremy T. Gaskins

We propose a method for variable selection in discriminant analysis with mixed categorical and continuous variables. This method is based on a criterion that permits to reduce the variable selection problem to a problem of estimating…

Statistics Theory · Mathematics 2017-03-14 Alban Mbina Mbina , Guy Martial Nkiet , Fulgence Eyi Obiang

Efficient recovery of a low-dimensional structure from high-dimensional data has been pursued in various settings including wavelet denoising, generalized linear models and low-rank matrix estimation. By thresholding some parameters to…

Methodology · Statistics 2017-08-14 Caroline Giacobino , Sylvain Sardy , Jairo Diaz-Rodriguez , Nick Hengartner

In big data analysis, a simple task such as linear regression can become very challenging as the variable dimension $p$ grows. As a result, variable screening is inevitable in many scientific studies. In recent years, randomized algorithms…

Methodology · Statistics 2019-02-13 Yu-Hsiang Cheng , Tzee-Ming Huang , Su-Yun Huang

We study a linear high-dimensional regression model in a semi-supervised setting, where for many observations only the vector of covariates $X$ is given with no response $Y$. We do not make any sparsity assumptions on the vector of…

Statistics Theory · Mathematics 2021-09-03 Ilan Livne , David Azriel , Yair Goldberg

We consider testing whether a set of Gaussian variables, selected from the data, is independent of the remaining variables. We assume that this set is selected via a very simple approach that is commonly used across scientific disciplines:…

Methodology · Statistics 2022-11-04 Arkajyoti Saha , Daniela Witten , Jacob Bien

We propose a new method for input variable selection in nonlinear regression. The method is embedded into a kernel regression machine that can model general nonlinear functions, not being a priori limited to additive models. This is the…

Machine Learning · Computer Science 2018-09-05 Magda Gregorová , Jason Ramapuram , Alexandros Kalousis , Stéphane Marchand-Maillet

We consider the problem of online active learning to collect data for regression modeling. Specifically, we consider a decision maker with a limited experimentation budget who must efficiently learn an underlying linear population model.…

Machine Learning · Statistics 2016-12-22 Carlos Riquelme , Ramesh Johari , Baosen Zhang
‹ Prev 1 2 3 10 Next ›