English
Related papers

Related papers: Scalable multiscale density estimation

200 papers

We develop Bayesian models for density regression with emphasis on discrete outcomes. The problem of density regression is approached by considering methods for multivariate density estimation of mixed scale variables, and obtaining…

Methodology · Statistics 2019-08-14 Georgios Papageorgiou

Data sets for statistical analysis become extremely large even with some difficulty of being stored on one single machine. Even when the data can be stored in one machine, the computational cost would still be intimidating. We propose a…

Methodology · Statistics 2020-02-18 Ya Su

Nonparametric estimation of the conditional distribution of a response given high-dimensional features is a challenging problem. It is important to allow not only the mean but also the variance and shape of the response density to change…

Machine Learning · Statistics 2013-12-05 Francesca Petralia , Joshua Vogelstein , David B. Dunson

It is now practically the norm for data to be very high dimensional in areas such as genetics, machine vision, image analysis and many others. When analyzing such data, parametric models are often too inflexible while nonparametric…

Methodology · Statistics 2011-05-31 Abhishek Bhattacharya , Garritt Page , David Dunson

Existing high-dimensional Bayesian optimization (BO) methods aim to overcome the curse of dimensionality by carefully encoding structural assumptions, from locality to sparsity to smoothness, into the optimization procedure. Surprisingly,…

Machine Learning · Computer Science 2026-04-10 Colin Doumont , Donney Fan , Natalie Maus , Jacob R. Gardner , Henry Moss , Geoff Pleiss

Although continuous density estimation has received abundant attention in the Bayesian nonparametrics literature, there is limited theory on multivariate mixed scale density estimation. In this note, we consider a general framework to…

Statistics Theory · Mathematics 2014-05-26 Antonio Canale , David B. Dunson

While existing mathematical descriptions can accurately account for phenomena at microscopic scales (e.g. molecular dynamics), these are often high-dimensional, stochastic and their applicability over macroscopic time scales of physical…

Machine Learning · Statistics 2016-09-08 P. S. Koutsourelakis , Elias Bilionis

We introduce a novel and scalable Bayesian framework for multivariate-density-density regression (DDR), designed to model relationships between multivariate distributions. Our approach addresses the critical issue of distributions residing…

Methodology · Statistics 2025-09-24 Khai Nguyen , Yang Ni , Peter Mueller

Recently developed techniques have made it possible to quickly learn accurate probability density functions from data in low-dimensional continuous space. In particular, mixtures of Gaussians can be fitted to data very quickly using an…

Machine Learning · Computer Science 2013-01-18 Scott Davies , Andrew Moore

This paper is concerned with the numerical solution of model-based, Bayesian inverse problems. We are particularly interested in cases where the cost of each likelihood evaluation (forward-model call) is expensive and the number of un-…

Computation · Statistics 2016-07-25 Isabell M. Franck , P. S. Koutsourelakis

In many modern applications, there is interest in analyzing enormous data sets that cannot be easily moved across computers or loaded into memory on a single computer. In such settings, it is very common to be interested in clustering.…

Computation · Statistics 2020-05-15 Hanyu Song , Yingjian Wang , David B. Dunson

Classically, Bayesian clustering interprets each component of a mixture model as a cluster. The inferred clustering posterior is highly sensitive to any inaccuracies in the kernel within each component. As this kernel is made more flexible,…

Methodology · Statistics 2025-12-12 David Buch , Miheer Dewaskar , David B. Dunson

Bayesian optimization (BO) is a leading method for optimizing expensive black-box optimization and has been successfully applied across various scenarios. However, BO suffers from the curse of dimensionality, making it challenging to scale…

Machine Learning · Computer Science 2025-04-03 Vu Viet Hoang , Hung The Tran , Sunil Gupta , Vu Nguyen

Bayesian variable selection is a powerful tool for data analysis, as it offers a principled method for variable selection that accounts for prior information and uncertainty. However, wider adoption of Bayesian variable selection has been…

Methodology · Statistics 2023-12-06 Martin Jankowiak

This thesis responds to the challenges of using a large number, such as thousands, of features in regression and classification problems. There are two situations where such high dimensional features arise. One is when high dimensional…

Machine Learning · Statistics 2007-09-20 Longhai Li

Well-established methods for the solution of stochastic partial differential equations (SPDEs) typically struggle in problems with high-dimensional inputs/outputs. Such difficulties are only amplified in large-scale applications where even…

Machine Learning · Statistics 2019-09-10 Constantin Grigo , Phaedon-Stelios Koutsourelakis

Impactful applications such as materials discovery, hardware design, neural architecture search, or portfolio optimization require optimizing high-dimensional black-box functions with mixed and combinatorial input spaces. While Bayesian…

Machine Learning · Computer Science 2024-03-21 Leonard Papenmeier , Luigi Nardi , Matthias Poloczek

Our focus is on constructing a multiscale nonparametric prior for densities. The Bayes density estimation literature is dominated by single scale methods, with the exception of Polya trees, which favor overly-spiky densities even when the…

Methodology · Statistics 2014-10-06 Antonio Canale , David B. Dunson

High resolution geospatial data are challenging because standard geostatistical models based on Gaussian processes are known to not scale to large data sizes. While progress has been made towards methods that can be computed more…

Methodology · Statistics 2020-12-03 Michele Peruzzi , David B. Dunson

Bi-clustering is a useful approach in analyzing biological data when observations come from heterogeneous groups and have a large number of features. We outline a general Bayesian approach in tackling bi-clustering problems in moderate to…

Applications · Statistics 2021-02-11 Han Yan , Jiexing Wu , Yang Li , Jun S. Liu
‹ Prev 1 2 3 10 Next ›