English

Nonparametric FBST for Validating Linear Models

Methodology 2025-07-23 v2

Abstract

The Full Bayesian Significance Test (FBST) possesses many desirable aspects, such as dismissing the need for hypotheses to have positive prior probability and providing a measure of evidence against H0H_0. Still, few attempts have been made to bring the FBST to nonparametric settings, with the main drawback being the need to obtain the highest posterior density (HPD) in a function space. In this work, we use a Gaussian processes prior to derive the FBST for hypotheses of the type H0:g(x)=b(x)β,xX,βRk, H_0: g(\boldsymbol{x}) = \boldsymbol{b}(\boldsymbol{x})\boldsymbol{\beta}, \quad \forall \boldsymbol{x} \in \mathcal{X}, \quad \boldsymbol{\beta} \in \mathbb{R}^k, where g()g(\cdot) is the regression function, b()\boldsymbol{b}(\cdot) is a vector of linearly independent linear functions -- such as b(x)=x\boldsymbol{b}(\boldsymbol{x}) = \boldsymbol{x}' -- and X\mathcal{X} is the covariates' domain. We also make use of pragmatic hypotheses to verify if the data might be compatible with a linear model when factors such as measurement errors or utility judgments are accounted for. This contribution extends the theory of the FBST, allowing its application in nonparametric settings and providing a procedure that easily tests if linear models are adequate for the data and that can automatically perform variable selection.

Keywords

Cite

@article{arxiv.2406.15608,
  title  = {Nonparametric FBST for Validating Linear Models},
  author = {Rodrigo F. L. Lassance and Julio M. Stern and Rafael B. Stern},
  journal= {arXiv preprint arXiv:2406.15608},
  year   = {2025}
}

Comments

All code available in https://github.com/rflassance/lmFBST