David Azriel — Scifaro

Block Designs that Provide Optimal Power in the Cochran-Mantel-Haenszel Test

We consider the asymptotic power performance under local alternatives of the Cochran-Mantel-Haenszel test. Our setting is non-traditional: we investigate randomized experiments that assign subjects via Fisher's blocking design. We show that…

Methodology · Statistics 2025-07-14 David Azriel , Adam Kapelner , Abba M. Krieger

The Optimality of Blocking Designs in Equally and Unequally Allocated Randomized Experiments with General Response

We consider the performance of the difference-in-means estimator in a two-arm randomized experiment under common experimental endpoints such as continuous (regression), incidence, proportion and survival. We examine performance under both…

Statistics Theory · Mathematics 2025-07-08 David Azriel , Abba M. Krieger , Adam Kapelner

Consistency of heritability estimation from summary statistics in high-dimensional linear models

In Genome-Wide Association Studies (GWAS), heritability is defined as the fraction of variance of an outcome explained by a large number of genetic predictors in a high-dimensional polygenic linear model. This work studies the asymptotic…

Statistics Theory · Mathematics 2025-02-27 David Azriel , Samuel Davenport , Armin Schwartzman

The Pairwise Matching Design is Optimal under Extreme Noise and Assignments

We consider the general performance of the difference-in-means estimator in an equally-allocated two-arm randomized experiment under common experimental endpoints such as continuous (regression), incidence, proportion, count and uncensored…

Methodology · Statistics 2024-11-07 David Azriel , Abba M. Krieger , Adam Kapelner

Optimal confidence interval for the difference of proportions

Estimating the probability of the binomial distribution is a basic problem, which appears in almost all introductory statistics courses and is performed frequently in various studies. In some cases, the parameter of interest is a difference…

Computation · Statistics 2024-08-21 Almog Peer , David Azriel

Surgery duration prediction using multi-task feature selection

Efficient optimization of operating room (OR) activity poses a significant challenge for hospital managers due to the complex and risky nature of the environment. The traditional "one size fits all" approach to OR scheduling is no longer…

Applications · Statistics 2024-03-21 David Azriel , Yosef Rinott , Orna Tal , Benyamine Abbou , Nadav Rappoport

Optimal minimax random designs for weighted least squares estimators

This work studies an experimental design problem where {the values of a predictor variable, denoted by $x$}, are to be determined with the goal of estimating a function $m(x)$, which is observed with noise. A linear model is fitted to…

Statistics Theory · Mathematics 2023-05-03 David Azriel

The Role of Pairwise Matching in Experimental Design for an Incidence Outcome

We consider the problem of evaluating designs for a two-arm randomized experiment with an incidence (binary) outcome under a nonparametric general response model. Our two main results are that the priori pair matching design of Greevy et…

Methodology · Statistics 2022-09-02 Adam Kapelner , Abba M. Krieger , David Azriel

Optimal designs for the development of personalized treatment rules

We study the design of multi-armed parallel group clinical trials to estimate personalized treatment rules that identify the best treatment for a given patient with given covariates. Assuming that the outcomes in each treatment arm are…

Statistics Theory · Mathematics 2022-07-13 David Azriel , Yosef Rinott , Martin Posch

Empirical Bayes approach to Truth Discovery problems

When aggregating information from conflicting sources, one's goal is to find the truth. Most real-value \emph{truth discovery} (TD) algorithms try to achieve this goal by estimating the competence of each source and then aggregating the…

Machine Learning · Computer Science 2022-06-13 Tsviel Ben Shabat , Reshef Meir , David Azriel

A zero-estimator approach for estimating the signal level in a high-dimensional model-free setting

We study a high-dimensional regression setting under the assumption of known covariate distribution. We aim at estimating the amount of explained variation in the response by the best linear function of the covariates (the signal level). In…

Statistics Theory · Mathematics 2022-05-12 Ilan Livne , David Azriel , Yair Goldberg

Optimal selection of sample-size dependent common subsets of covariates for multi-task regression prediction

An analyst is given a training set consisting of regression datasets $D_j$ of different sizes, which are distributed according to some $G_j$, $j=1,\ldots,\cal J$, where the distributions $G_j$ are assumed to form a random sample generated…

Statistics Theory · Mathematics 2021-09-07 David Azriel , Yosef Rinott

Improved Estimators for Semi-supervised High-dimensional Regression Model

We study a linear high-dimensional regression model in a semi-supervised setting, where for many observations only the vector of covariates $X$ is given with no response $Y$. We do not make any sparsity assumptions on the vector of…

Statistics Theory · Mathematics 2021-09-03 Ilan Livne , David Azriel , Yair Goldberg

Semi-Supervised linear regression

We study a regression problem where for some part of the data we observe both the label variable ($Y$) and the predictors (${\bf X}$), while for other part of the data only the predictors are given. Such a problem arises, for example, when…

Statistics Theory · Mathematics 2021-04-14 David Azriel , Lawrence D. Brown , Michael Sklar , Richard Berk , Andreas Buja , Linda Zhao

Better Experimental Design by Hybridizing Binary Matching with Imbalance Optimization

We present a new experimental design procedure that divides a set of experimental units into two groups in order to minimize error in estimating an additive treatment effect. One concern is minimizing error at the experimental design stage…

Methodology · Statistics 2021-02-02 Abba M. Krieger , David Azriel , Adam Kapelner

Optimal Rerandomization via a Criterion that Provides Insurance Against Failed Experiments

We present an optimized rerandomization design procedure for a non-sequential treatment-control experiment. Randomized experiments are the gold standard for finding causal effects in nature. But sometimes random assignments result in…

Methodology · Statistics 2021-01-26 Adam Kapelner , Abba M. Krieger , Michael Sklar , David Azriel

Improving the Power of the Randomization Test

We consider the problem of evaluating designs for a two-arm randomized experiment with the criterion being the power of the randomization test for the one-sided null hypothesis. Our evaluation assumes a response that is linear in one…

Methodology · Statistics 2020-08-14 Abba M. Krieger , David Azriel , Michael Sklar , Adam Kapelner

The conditionality principle in high-dimensional regression

Consider a high-dimensional linear regression problem, where the number of covariates is larger than the number of observations and the interest is in estimating the conditional variance of the response variable given the covariates. A…

Statistics Theory · Mathematics 2019-03-29 David Azriel

Harmonizing Fully Optimal Designs with Classic Randomization in Fixed Trial Experiments

There is a movement in design of experiments away from the classic randomization put forward by Fisher, Cochran and others to one based on optimization. In fixed-sample trials comparing two groups, measurements of subjects are known in…

Methodology · Statistics 2018-10-22 Adam Kapelner , Abba M. Krieger , Uri Shalit , David Azriel

Nearly Random Designs with Greatly Improved Balance

We present a new experimental design procedure that divides a set of experimental units into two groups so that the two groups are balanced on a prespecified set of covariates and being almost as random as complete randomization. Under…

Statistics Theory · Mathematics 2016-12-08 Abba M. Krieger , David Azriel , Adam Kapelner