Related papers: Analysis of Testing-Based Forward Model Selection

Forward regression is a crucial methodology for automatically identifying important predictors from a large pool of potential covariates. In contexts with moderate predictor correlation, forward selection techniques can achieve screening…

Methodology · Statistics 2024-08-23 Xuejun Jiang , Yue Ma , Haofeng Wang

Sharp Convergence Rates for Forward Regression in High-Dimensional Sparse Linear Models

Forward regression is a statistical model selection and estimation procedure which inductively selects covariates that add predictive power into a working statistical regression model. Once a model is selected, unknown regression parameters…

Machine Learning · Statistics 2018-04-12 Damian Kozbur

Default Bayesian Model Selection of Constrained Multivariate Normal Linear Models

The multivariate normal linear model is one of the most widely employed models for statistical inference in applied research. Special cases include (multivariate) t testing, (M)AN(C)OVA, (multivariate) multiple regression, and repeated…

Methodology · Statistics 2021-03-15 J. Mulder , H. Hoijtink , X. Gu

Targeted Function Balancing

This paper introduces Targeted Function Balancing (TFB), a covariate balancing weights framework for estimating the average treatment effect of a binary intervention. TFB first regresses an outcome on covariates, and then selects weights…

Methodology · Statistics 2025-04-10 Leonard Wainstein , He Bai

Understanding the Implicit Biases of Design Choices for Time Series Foundation Models

Time series foundation models (TSFMs) are a class of potentially powerful, general-purpose tools for time series forecasting and related temporal tasks, but their behavior is strongly shaped by subtle inductive biases in their design.…

Machine Learning · Computer Science 2025-10-23 Annan Yu , Danielle C. Maddix , Boran Han , Xiyuan Zhang , Abdul Fatir Ansari , Oleksandr Shchur , Christos Faloutsos , Andrew Gordon Wilson , Michael W. Mahoney , Yuyang Wang

Forward Stability and Model Path Selection

Most scientific publications follow the familiar recipe of (i) obtain data, (ii) fit a model, and (iii) comment on the scientific relevance of the effects of particular covariates in that model. This approach, however, ignores the fact that…

Methodology · Statistics 2021-03-08 Nicholas Kissel , Lucas Mentch

Calibrating Model-Based Inferences and Decisions

As the frontiers of applied statistics progress through increasingly complex experiments we must exploit increasingly sophisticated inferential models to analyze the observations we make. In order to avoid misleading or outright erroneous…

Methodology · Statistics 2018-03-23 Michael Betancourt

Methods of Selective Inference for Linear Mixed Models: a Review and Empirical Comparison

Selective inference aims at providing valid inference after a data-driven selection of models or hypotheses. It is essential to avoid overconfident results and replicability issues. While significant advances have been made in this area for…

Methodology · Statistics 2025-03-14 Matteo D'Alessandro , Magne Thoresen

TSFM in-context learning for time-series classification of bearing-health status

We introduce a classification method based on in-context learning using time-series foundation models (TSFMs). We demonstrate how data not included in the TSFM training can be classified without fine-tuning the foundation model or training…

Machine Learning · Computer Science 2026-03-11 Michel Tokic , Slobodan Djukanović , Anja von Beuningen , Cheng Feng

Test Case Selection and Prioritization Using Machine Learning: A Systematic Literature Review

Regression testing is an essential activity to assure that software code changes do not adversely affect existing functionalities. With the wide adoption of Continuous Integration (CI) in software projects, which increases the frequency of…

Software Engineering · Computer Science 2022-09-07 Rongqi Pan , Mojtaba Bagherzadeh , Taher A. Ghaleb , Lionel Briand

A Statistical-Modelling Approach to Feedforward Neural Network Model Selection

Feedforward neural networks (FNNs) can be viewed as non-linear regression models, where covariates enter the model through a combination of weighted summations and non-linear functions. Although these models have some similarities to the…

Methodology · Statistics 2024-05-02 Andrew McInerney , Kevin Burke

Safe preselection in lasso-type problems by cross-validation freezing

We propose a new approach to safe variable preselection in high-dimensional penalized regression, such as the lasso. Preselection - to start with a manageable set of covariates - has often been implemented without clear appreciation of its…

Methodology · Statistics 2012-12-10 Linn Cecilie Bergersen , Ismaïl Ahmed , Arnoldo Frigessi , Ingrid K. Glad , Sylvia Richardson

A projection-based model checking for heterogeneous treatment effect

In this paper, we investigate the hypothesis testing problem that checks whether part of covariates / confounders significantly affect the heterogeneous treatment effect given all covariates. This model checking is particularly useful in…

Statistics Theory · Mathematics 2020-09-24 Niwen Zhou , Xu Guo , Lixing Zhu

A Flexible Framework for Hypothesis Testing in High-dimensions

Hypothesis testing in the linear regression model is a fundamental statistical problem. We consider linear regression in the high-dimensional regime where the number of parameters exceeds the number of samples ($p> n$). In order to make…

Statistics Theory · Mathematics 2019-09-24 Adel Javanmard , Jason D. Lee

Time Series Foundation Models for Process Model Forecasting

Process Model Forecasting (PMF) aims to predict how the control-flow structure of a process evolves over time by modeling the temporal dynamics of directly-follows (DF) relations, complementing predictive process monitoring that focuses on…

Machine Learning · Computer Science 2025-12-09 Yongbo Yu , Jari Peeperkorn , Johannes De Smedt , Jochen De Weerdt

Posterior error bounds for prior-driven balancing in linear Gaussian inverse problems

In large-scale Bayesian inverse problems, it is often necessary to apply approximate forward models to reduce the cost of forward model evaluations, while controlling approximation quality. In the context of Bayesian inverse problems with…

Numerical Analysis · Mathematics 2026-01-08 Josie König , Han Cheng Lie

Forward Regression via Gram-Schmidt Orthogonalization for Ultra-High Dimensional Linear Models

Forward regression is a classical and effective tool for variable screening in ultra-high dimensional linear models, but its standard projection-based implementation can be computationally costly and numerically unstable when predictors are…

Methodology · Statistics 2026-03-20 Jialuo Chen , Zhaoxing Gao , Yifan Jiang , Ruey S. Tsay

Forward variable selection for sparse ultra-high dimensional varying coefficient models

Varying coefficient models have numerous applications in a wide scope of scientific areas. While enjoying nice interpretability, they also allow flexibility in modeling dynamic impacts of the covariates. But, in the new era of big data, it…

Methodology · Statistics 2014-10-27 Ming-Yen Cheng , Toshio Honda , Jin-Ting Zhang

Inference in Linear Regression Models with Many Covariates and Heteroskedasticity

The linear regression model is widely used in empirical work in Economics, Statistics, and many other disciplines. Researchers often include many covariates in their linear model specification in an attempt to control for confounders. We…

Statistics Theory · Mathematics 2017-12-12 Matias D. Cattaneo , Michael Jansson , Whitney K. Newey

Post-Selection Confidence Bounds for Prediction Performance

In machine learning, the selection of a promising model from a potentially large number of competing models and the assessment of its generalization performance are critical tasks that need careful consideration. Typically, model selection…

Machine Learning · Statistics 2023-02-06 Pascal Rink , Werner Brannath