Related papers: Bayesian Classification and Regression with High D…

Bayesian inference in high-dimensional models

Models with dimension more than the available sample size are now commonly used in various applications. A sensible inference is possible using a lower-dimensional structure. In regression problems with a large number of predictors, the…

Statistics Theory · Mathematics 2025-11-25 Sayantan Banerjee , Ismaël Castillo , Subhashis Ghosal

A Method for Avoiding Bias from Feature Selection with Application to Naive Bayes Classification Models

For many classification and regression problems, a large number of features are available for possible use - this is typical of DNA microarray data on gene expression, for example. Often, for computational or other reasons, only a small…

Statistics Theory · Mathematics 2007-06-13 Longhai Li , Jianguo Zhang , Radford M. Neal

A Method for Compressing Parameters in Bayesian Models with Application to Logistic Sequence Prediction Models

Bayesian classification and regression with high order interactions is largely infeasible because Markov chain Monte Carlo (MCMC) would need to be applied with a great many parameters, whose number increases rapidly with the order. In this…

Machine Learning · Statistics 2017-04-28 Longhai Li , Radford M. Neal

Bayesian Compressed Regression

As an alternative to variable selection or shrinkage in high dimensional regression, we propose to randomly compress the predictors prior to analysis. This dramatically reduces storage and computational bottlenecks, performing well when the…

Machine Learning · Statistics 2013-03-26 Rajarshi Guhaniyogi , David B. Dunson

High-Dimensional Bayesian Inference in Nonparametric Additive Models

A fully Bayesian approach is proposed for ultrahigh-dimensional nonparametric additive models in which the number of additive components may be larger than the sample size, though ideally the true model is believed to include only a small…

Methodology · Statistics 2013-09-24 Zuofeng Shang , Ping Li

High-dimensional Feature Selection Using Hierarchical Bayesian Logistic Regression with Heavy-tailed Priors

The problem of selecting the most useful features from a great many (eg, thousands) of candidates arises in many areas of modern sciences. An interesting problem from genomic research is that, from thousands of genes that are active…

Applications · Statistics 2018-05-15 Longhai Li , Weixin Yao

We Still Don't Understand High-Dimensional Bayesian Optimization

Existing high-dimensional Bayesian optimization (BO) methods aim to overcome the curse of dimensionality by carefully encoding structural assumptions, from locality to sparsity to smoothness, into the optimization procedure. Surprisingly,…

Machine Learning · Computer Science 2026-04-10 Colin Doumont , Donney Fan , Natalie Maus , Jacob R. Gardner , Henry Moss , Geoff Pleiss

High dimensional gaussian classification

High dimensional data analysis is known to be as a challenging problem. In this article, we give a theoretical analysis of high dimensional classification of Gaussian data which relies on a geometrical analysis of the error measure. It…

Statistics Theory · Mathematics 2008-07-10 Robin Girard

Bayesian variable selection for high dimensional generalized linear models: convergence rates of the fitted densities

Bayesian variable selection has gained much empirical success recently in a variety of applications when the number $K$ of explanatory variables $(x_1,...,x_K)$ is possibly much larger than the sample size $n$. For generalized linear…

Statistics Theory · Mathematics 2009-09-29 Wenxin Jiang

Compressed Bayesian Tensor Regression

To address the common problem of high dimensionality in tensor regressions, we introduce a generalized tensor random projection method that embeds high-dimensional tensor-valued covariates into low-dimensional subspaces with minimal loss of…

Methodology · Statistics 2025-10-03 Roberto Casarin , Radu Craiu , Qing Wang

Hierarchical Bayesian data selection

There are many issues that can cause problems when attempting to infer model parameters from data. Data and models are both imperfect, and as such there are multiple scenarios in which standard methods of inference will lead to misleading…

Computation · Statistics 2024-05-01 Simon L. Cotter

Bayesian Bi-clustering Methods with Applications in Computational Biology

Bi-clustering is a useful approach in analyzing biological data when observations come from heterogeneous groups and have a large number of features. We outline a general Bayesian approach in tackling bi-clustering problems in moderate to…

Applications · Statistics 2021-02-11 Han Yan , Jiexing Wu , Yang Li , Jun S. Liu

On high-dimensional classification by sparse generalized Bayesian logistic regression

This work addresses the problem of high-dimensional classification by exploring the generalized Bayesian logistic regression method under a sparsity-inducing prior distribution. The method involves utilizing a fractional power of the…

Statistics Theory · Mathematics 2024-03-20 The Tien Mai

Bayesian model and dimension reduction for uncertainty propagation: applications in random media

Well-established methods for the solution of stochastic partial differential equations (SPDEs) typically struggle in problems with high-dimensional inputs/outputs. Such difficulties are only amplified in large-scale applications where even…

Machine Learning · Statistics 2019-09-10 Constantin Grigo , Phaedon-Stelios Koutsourelakis

Non-linear regression models for Approximate Bayesian Computation

Approximate Bayesian inference on the basis of summary statistics is well-suited to complex problems for which the likelihood is either mathematically or computationally intractable. However the methods that use rejection suffer from the…

Computation · Statistics 2010-05-04 M. G. B. Blum , O. Francois

Dimension and model reduction approaches for linear Bayesian inverse problems with rank-deficient prior covariances

Bayesian inverse problems use observed data to update a prior probability distribution for an unknown state or parameter of a scientific system to a posterior distribution conditioned on the data. In many applications, the unknown parameter…

Numerical Analysis · Mathematics 2026-05-12 Josie König , Elizabeth Qian , Melina A. Freitag

Feature and Variable Selection in Classification

The amount of information in the form of features and variables avail- able to machine learning algorithms is ever increasing. This can lead to classifiers that are prone to overfitting in high dimensions, high di- mensional models do not…

Machine Learning · Computer Science 2014-02-12 Aaron Karper

Flexible Bayesian Nonlinear Model Configuration

Regression models are used in a wide range of applications providing a powerful scientific tool for researchers from different fields. Linear, or simple parametric, models are often not sufficient to describe complex relationships between…

Machine Learning · Statistics 2021-11-24 Aliaksandr Hubin , Geir Storvik , Florian Frommlet

Density Estimation and Classification via Bayesian Nonparametric Learning of Affine Subspaces

It is now practically the norm for data to be very high dimensional in areas such as genetics, machine vision, image analysis and many others. When analyzing such data, parametric models are often too inflexible while nonparametric…

Methodology · Statistics 2011-05-31 Abhishek Bhattacharya , Garritt Page , David Dunson

Learning Densities Conditional on Many Interacting Features

Learning a distribution conditional on a set of discrete-valued features is a commonly encountered task. This becomes more challenging with a high-dimensional feature set when there is the possibility of interaction between the features. In…

Machine Learning · Statistics 2013-05-01 David C. Kessler , Jack Taylor , David B. Dunson