English
Related papers

Related papers: Analysis of Testing-Based Forward Model Selection

200 papers

Forward regression is a crucial methodology for automatically identifying important predictors from a large pool of potential covariates. In contexts with moderate predictor correlation, forward selection techniques can achieve screening…

Methodology · Statistics 2024-08-23 Xuejun Jiang , Yue Ma , Haofeng Wang

Forward regression is a statistical model selection and estimation procedure which inductively selects covariates that add predictive power into a working statistical regression model. Once a model is selected, unknown regression parameters…

Machine Learning · Statistics 2018-04-12 Damian Kozbur

The multivariate normal linear model is one of the most widely employed models for statistical inference in applied research. Special cases include (multivariate) t testing, (M)AN(C)OVA, (multivariate) multiple regression, and repeated…

Methodology · Statistics 2021-03-15 J. Mulder , H. Hoijtink , X. Gu

This paper introduces Targeted Function Balancing (TFB), a covariate balancing weights framework for estimating the average treatment effect of a binary intervention. TFB first regresses an outcome on covariates, and then selects weights…

Methodology · Statistics 2025-04-10 Leonard Wainstein , He Bai

Time series foundation models (TSFMs) are a class of potentially powerful, general-purpose tools for time series forecasting and related temporal tasks, but their behavior is strongly shaped by subtle inductive biases in their design.…

Most scientific publications follow the familiar recipe of (i) obtain data, (ii) fit a model, and (iii) comment on the scientific relevance of the effects of particular covariates in that model. This approach, however, ignores the fact that…

Methodology · Statistics 2021-03-08 Nicholas Kissel , Lucas Mentch

As the frontiers of applied statistics progress through increasingly complex experiments we must exploit increasingly sophisticated inferential models to analyze the observations we make. In order to avoid misleading or outright erroneous…

Methodology · Statistics 2018-03-23 Michael Betancourt

Selective inference aims at providing valid inference after a data-driven selection of models or hypotheses. It is essential to avoid overconfident results and replicability issues. While significant advances have been made in this area for…

Methodology · Statistics 2025-03-14 Matteo D'Alessandro , Magne Thoresen

We introduce a classification method based on in-context learning using time-series foundation models (TSFMs). We demonstrate how data not included in the TSFM training can be classified without fine-tuning the foundation model or training…

Machine Learning · Computer Science 2026-03-11 Michel Tokic , Slobodan Djukanović , Anja von Beuningen , Cheng Feng

Regression testing is an essential activity to assure that software code changes do not adversely affect existing functionalities. With the wide adoption of Continuous Integration (CI) in software projects, which increases the frequency of…

Software Engineering · Computer Science 2022-09-07 Rongqi Pan , Mojtaba Bagherzadeh , Taher A. Ghaleb , Lionel Briand

Feedforward neural networks (FNNs) can be viewed as non-linear regression models, where covariates enter the model through a combination of weighted summations and non-linear functions. Although these models have some similarities to the…

Methodology · Statistics 2024-05-02 Andrew McInerney , Kevin Burke

We propose a new approach to safe variable preselection in high-dimensional penalized regression, such as the lasso. Preselection - to start with a manageable set of covariates - has often been implemented without clear appreciation of its…

In this paper, we investigate the hypothesis testing problem that checks whether part of covariates / confounders significantly affect the heterogeneous treatment effect given all covariates. This model checking is particularly useful in…

Statistics Theory · Mathematics 2020-09-24 Niwen Zhou , Xu Guo , Lixing Zhu

Hypothesis testing in the linear regression model is a fundamental statistical problem. We consider linear regression in the high-dimensional regime where the number of parameters exceeds the number of samples ($p> n$). In order to make…

Statistics Theory · Mathematics 2019-09-24 Adel Javanmard , Jason D. Lee

Process Model Forecasting (PMF) aims to predict how the control-flow structure of a process evolves over time by modeling the temporal dynamics of directly-follows (DF) relations, complementing predictive process monitoring that focuses on…

Machine Learning · Computer Science 2025-12-09 Yongbo Yu , Jari Peeperkorn , Johannes De Smedt , Jochen De Weerdt

In large-scale Bayesian inverse problems, it is often necessary to apply approximate forward models to reduce the cost of forward model evaluations, while controlling approximation quality. In the context of Bayesian inverse problems with…

Numerical Analysis · Mathematics 2026-01-08 Josie König , Han Cheng Lie

Forward regression is a classical and effective tool for variable screening in ultra-high dimensional linear models, but its standard projection-based implementation can be computationally costly and numerically unstable when predictors are…

Methodology · Statistics 2026-03-20 Jialuo Chen , Zhaoxing Gao , Yifan Jiang , Ruey S. Tsay

Varying coefficient models have numerous applications in a wide scope of scientific areas. While enjoying nice interpretability, they also allow flexibility in modeling dynamic impacts of the covariates. But, in the new era of big data, it…

Methodology · Statistics 2014-10-27 Ming-Yen Cheng , Toshio Honda , Jin-Ting Zhang

The linear regression model is widely used in empirical work in Economics, Statistics, and many other disciplines. Researchers often include many covariates in their linear model specification in an attempt to control for confounders. We…

Statistics Theory · Mathematics 2017-12-12 Matias D. Cattaneo , Michael Jansson , Whitney K. Newey

In machine learning, the selection of a promising model from a potentially large number of competing models and the assessment of its generalization performance are critical tasks that need careful consideration. Typically, model selection…

Machine Learning · Statistics 2023-02-06 Pascal Rink , Werner Brannath
‹ Prev 1 2 3 10 Next ›