Benjamin Bloem-Reddy

Debiased Counterfactual Generation via Flow Matching from Observations

Estimating counterfactual distributions under interventions is central to treatment risk assessment and counterfactual generation tasks. Existing approaches model the counterfactual distribution as a standalone generative target, without…

Machine Learning · Statistics 2026-05-11 Hugh Dance , Johnny Xi , Peter Orbanz , Benjamin Bloem-Reddy

Randomization Tests for Conditional Group Symmetry

Symmetry plays a central role in the sciences, machine learning, and statistics. While statistical tests for the presence of distributional invariance with respect to groups have a long history, tests for conditional symmetry in the form of…

Methodology · Statistics 2025-12-12 Kenny Chiu , Alex Sharp , Benjamin Bloem-Reddy

Counterfactual Cocycles: A Framework for Robust and Coherent Counterfactual Transports

Estimating joint distributions (a.k.a. couplings) over counterfactual outcomes is central to personalized decision-making and treatment risk assessment. Two emergent frameworks with identifiability guarantees are: (i) bijective structural…

Methodology · Statistics 2025-09-26 Hugh Dance , Benjamin Bloem-Reddy

CN-SBM: Categorical Block Modelling For Primary and Residual Copy Number Variation

Cancer is a genetic disorder whose clonal evolution can be monitored by tracking noisy genome-wide copy number variants. We introduce the Copy Number Stochastic Block Model (CN-SBM), a probabilistic framework that jointly clusters samples…

Machine Learning · Statistics 2025-07-01 Kevin Lam , William Daniels , J Maxwell Douglas , Daniel Lai , Samuel Aparicio , Benjamin Bloem-Reddy , Yongjin Park

Distinguishing Cause from Effect with Causal Velocity Models

Bivariate structural causal models (SCM) are often used to infer causal direction by examining their goodness-of-fit under restricted model classes. In this paper, we describe a parametrization of bivariate SCMs in terms of a causal…

Machine Learning · Statistics 2025-06-11 Johnny Xi , Hugh Dance , Peter Orbanz , Benjamin Bloem-Reddy

Identifying Metric Structures of Deep Latent Variable Models

Deep latent variable models learn condensed representations of data that, hopefully, reflect the inner workings of the studied phenomena. Unfortunately, these latent representations are not statistically identifiable, meaning they cannot be…

Machine Learning · Statistics 2025-06-02 Stas Syrota , Yevgen Zainchkovskyy , Johnny Xi , Benjamin Bloem-Reddy , Søren Hauberg

Non-parametric Hypothesis Tests for Distributional Group Symmetry

Symmetry plays a central role in the sciences, machine learning, and statistics. For situations in which data are known to obey a symmetry, a multitude of methods that exploit symmetry have been developed. Statistical tests for the presence…

Methodology · Statistics 2024-12-24 Kenny Chiu , Benjamin Bloem-Reddy

Mixed Variational Flows for Discrete Variables

Variational flows allow practitioners to learn complex continuous distributions, but approximating discrete distributions remains a challenge. Current methodologies typically embed the discrete target in a continuous space - usually via…

Computation · Statistics 2024-02-27 Gian Carlo Diluvi , Benjamin Bloem-Reddy , Trevor Campbell

Indeterminacy in Generative Models: Characterization and Strong Identifiability

Most modern probabilistic generative models, such as the variational autoencoder (VAE), have certain indeterminacies that are unresolvable even with an infinite amount of data. Different tasks tolerate different indeterminacies, however…

Machine Learning · Statistics 2023-03-06 Quanhan Xi , Benjamin Bloem-Reddy

Lossy Compression for Lossless Prediction

Most data is automatically collected and only ever "seen" by algorithms. Yet, data compressors preserve perceptual fidelity rather than just the information needed by algorithms performing downstream tasks. In this paper, we characterize…

Machine Learning · Computer Science 2022-01-31 Yann Dubois , Benjamin Bloem-Reddy , Karen Ullrich , Chris J. Maddison

Uncertainty in Neural Processes

We explore the effects of architecture and training objective choice on amortized posterior predictive inference in probabilistic conditional generative models. We aim this work to be a counterpoint to a recent trend in the literature that…

Machine Learning · Computer Science 2020-10-09 Saeid Naderiparizi , Kenny Chiu , Benjamin Bloem-Reddy , Frank Wood

Probabilistic symmetries and invariant neural networks

Treating neural network inputs and outputs as random variables, we characterize the structure of neural networks that can be used to model data that are invariant or equivariant under the action of a compact group. Much recent research has…

Machine Learning · Statistics 2020-09-18 Benjamin Bloem-Reddy , Yee Whye Teh

On the Benefits of Invariance in Neural Networks

Many real world data analysis problems exhibit invariant structure, and models that take advantage of this structure have shown impressive empirical performance, particularly in deep learning. While the literature contains a variety of…

Machine Learning · Computer Science 2020-05-04 Clare Lyle , Mark van der Wilk , Marta Kwiatkowska , Yarin Gal , Benjamin Bloem-Reddy

Sequential sampling of Gaussian process latent variable models

We consider the problem of inferring a latent function in a probabilistic model of data. When dependencies of the latent function are specified by a Gaussian process and the data likelihood is complex, efficient computation often involve…

Machine Learning · Statistics 2018-07-23 Martin Tegner , Benjamin Bloem-Reddy , Stephen Roberts

Random Walk Models of Network Formation and Sequential Monte Carlo Methods for Graphs

We introduce a class of generative network models that insert edges by connecting the starting and terminal vertices of a random walk on the network graph. Within the taxonomy of statistical network models, this class is distinguished by…

Methodology · Statistics 2018-07-11 Benjamin Bloem-Reddy , Peter Orbanz

Sampling and Inference for Beta Neutral-to-the-Left Models of Sparse Networks

Empirical evidence suggests that heavy-tailed degree distributions occurring in many real networks are well-approximated by power laws with exponents $\eta$ that may take values either less than and greater than two. Models based on various…

Machine Learning · Statistics 2018-07-10 Benjamin Bloem-Reddy , Adam Foster , Emile Mathieu , Yee Whye Teh

Preferential Attachment and Vertex Arrival Times

We study preferential attachment mechanisms in random graphs that are parameterized by (i) a constant bias affecting the degree-biased distribution on the vertex set and (ii) the distribution of times at which new vertices are created by…

Probability · Mathematics 2017-10-09 Benjamin Bloem-Reddy , Peter Orbanz