David Draper — Scifaro

The Bayesian Method of Tensor Networks

Bayesian learning is a powerful learning framework which combines the external information of the data (background information) with the internal information (training data) in a logically consistent way in inference and prediction. By…

Machine Learning · Statistics 2026-02-11 Erdong Guo , David Draper

Representation Theorem for Matrix Product States

In this work, we investigate the universal representation capacity of the Matrix Product States (MPS) from the perspective of boolean functions and continuous functions. We show that MPS can accurately realize arbitrary boolean functions by…

Machine Learning · Statistics 2025-10-16 Erdong Guo , David Draper

A Unified Framework for Cluster Methods with Tensor Networks

Markov Chain Monte Carlo (MCMC), and Tensor Networks (TN) are two powerful frameworks for numerically investigating many-body systems, each offering distinct advantages. MCMC, with its flexibility and theoretical consistency, is well-suited…

Methodology · Statistics 2024-09-10 Erdong Guo , David Draper

Discussion of Martingale Posterior Distributions by E. Fong, C. Holmes, and S. G. Walker

In this discussion note, we respond to the fascinating paper "Martingale Posterior Distributions" by E. Fong, C. Holmes, and S. G. Walker with a couple of comments. On the basis of previous research, a theorem is stated regarding the…

Methodology · Statistics 2023-02-16 David Draper , Erdong Guo

Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks

Model calibration, which is concerned with how frequently the model predicts correctly, not only plays a vital part in statistical model design, but also has substantial practical applications, such as optimal decision-making in the real…

Machine Learning · Statistics 2023-01-18 Erdong Guo , David Draper , Maria De Iorio

Neural Tangent Kernel of Matrix Product States: Convergence and Applications

In this work, we study the Neural Tangent Kernel (NTK) of Matrix Product States (MPS) and the convergence of its NTK in the infinite bond dimensional limit. We prove that the NTK of MPS asymptotically converges to a constant matrix during…

Machine Learning · Statistics 2021-11-30 Erdong Guo , David Draper

A Simple Necessary Condition For Independence of Real-Valued Random Variables

The standard method to check for the independence of two real-valued random variables -- demonstrating that the bivariate joint distribution factors into the product of its marginals -- is both necessary and sufficient. Here we present a…

Probability · Mathematics 2021-11-30 David Draper , Erdong Guo , Robert Lund , Jon Woody

The Practical Scope of the Central Limit Theorem

The \textit{Central Limit Theorem (CLT)} is at the heart of a great deal of applied problem-solving in statistics and data science, but the theorem is silent on an important implementation issue: \textit{how much data do you need for the…

Other Statistics · Statistics 2021-11-25 David Draper , Erdong Guo

Infinitely Wide Tensor Networks as Gaussian Process

Gaussian Process is a non-parametric prior which can be understood as a distribution on the function space intuitively. It is known that by introducing appropriate prior to the weights of the neural networks, Gaussian Process can be…

Machine Learning · Statistics 2021-01-08 Erdong Guo , David Draper

P\'olya Urn Latent Dirichlet Allocation: a doubly sparse massively parallel sampler

Latent Dirichlet Allocation (LDA) is a topic model widely used in natural language processing and machine learning. Most approaches to training the model rely on iterative algorithms, which makes it difficult to run LDA on big corpora that…

Machine Learning · Statistics 2020-10-23 Alexander Terenin , Måns Magnusson , Leif Jonsson , David Draper

Asynchronous Gibbs Sampling

Gibbs sampling is a Markov Chain Monte Carlo (MCMC) method often used in Bayesian learning. MCMC methods can be difficult to deploy on parallel and distributed systems due to their inherently sequential nature. We study asynchronous Gibbs…

Computation · Statistics 2020-03-03 Alexander Terenin , Daniel Simpson , David Draper

Cox's Theorem and the Jaynesian Interpretation of Probability

There are multiple proposed interpretations of probability theory: one such interpretation is true-false logic under uncertainty. Cox's Theorem is a representation theorem that states, under a certain set of axioms describing the meaning of…

Statistics Theory · Mathematics 2020-02-11 Alexander Terenin , David Draper

GPU-accelerated Gibbs sampling: a case study of the Horseshoe Probit model

Gibbs sampling is a widely used Markov chain Monte Carlo (MCMC) method for numerically approximating integrals of interest in Bayesian statistics and other mathematical sciences. Many implementations of MCMC methods do not extend easily to…

Computation · Statistics 2019-06-03 Alexander Terenin , Shawfeng Dong , David Draper

Comment: A brief survey of the current state of play for Bayesian computation in data science at Big-Data scale

We wish to contribute to the discussion of "Comparing Consensus Monte Carlo Strategies for Distributed Bayesian Computation" by offering our views on the current best methods for Bayesian computation, both at big-data scale and with smaller…

Computation · Statistics 2017-12-15 David Draper , Alexander Terenin

A Noninformative Prior on a Space of Distribution Functions

In a given problem, the Bayesian statistical paradigm requires the specification of a prior distribution that quantifies relevant information about the unknowns of main interest external to the data. In cases where little such information…

Statistics Theory · Mathematics 2017-10-11 Alexander Terenin , David Draper

A Space-Based Observational Strategy for Characterizing the First Stars and Galaxies Using the Redshifted 21-cm Global Spectrum

The redshifted 21-cm monopole is expected to be a powerful probe of the epoch of the first stars and galaxies ($10<z<35$). The global 21-cm signal is sensitive to the thermal and ionization state of hydrogen gas and thus provides a tracer…

Instrumentation and Methods for Astrophysics · Physics 2017-07-26 Jack O. Burns , Richard Bradley , Keith Tauscher , Steven Furlanetto , Jordan Mirocha , Raul Monsalve , David Rapetti , William Purcell , David Newell , David Draper , Robert MacDowall , Judd Bowman , Bang Nhan , Edward J. Wollack , Anastasia Fialkov , Dayton Jones , Justin C. Kasper , Abraham Loeb , Abhirup Datta , Jonathan Pritchard , Eric Switzer , Michael Bicay

A nonparametric Bayesian analysis of heterogeneous treatment effects in digital experimentation

Randomized controlled trials play an important role in how Internet companies predict the impact of policy decisions and product changes. In these `digital experiments', different units (people, devices, products) respond differently to the…

Applications · Statistics 2015-12-21 Matt Taddy , Matt Gardner , Liyun Chen , David Draper

Causal Inference in Repeated Observational Studies: A Case Study of eBay Product Releases

Causal inference in observational studies is notoriously difficult, due to the fact that the experimenter is not in charge of the treatment assignment mechanism. Many potential con- founding factors (PCFs) exist in such a scenario, and if…

Applications · Statistics 2015-09-15 Vadim von Brzeski , Matt Taddy , David Draper

Power-Expected-Posterior Priors for Variable Selection in Gaussian Linear Models

In the context of the expected-posterior prior (EPP) approach to Bayesian variable selection in linear models, we combine ideas from power-prior and unit-information-prior methodologies to simultaneously produce a minimally-informative…

Computation · Statistics 2015-04-27 Dimitris Fouskakis , Ioannis Ntzoufras , David Draper