统计方法学 — Scifaro

Causal Inference with Missing Exposures and Missing Outcomes

Missing data are ubiquitous in public health research. When estimating causal effects, there are well-established methods to address bias to due missing outcomes. Commonly, causal estimands are defined under hypothetical interventions to…

统计方法学 · 统计学 2026-04-15 Kirsten E. Landsiedel , Rachel Abbott , Atukunda Mucunguzi , Florence Mwangwa , Elijah Kakande , Edwin D. Charlebois , Carina Marquez , Moses R. Kamya , Laura B. Balzer

Quantifying structural uncertainty in chemical reaction network inference

Dynamical systems in biology are complex, and one often does not have comprehensive knowledge about the interactions involved. Chemical reaction network (CRN) inference aims to identify, from observing species concentrations over time, the…

统计方法学 · 统计学 2026-04-15 Yong See Foo , Adriana Zanca , Jennifer A. Flegg , Ivo Siekmann

Finite-Sample Risk Approximation and Risk-Consistent Tuning for Generalized Ridge Estimation in Nonlinear Models: Controlling Extreme Realizations

Maximum likelihood estimation in nonlinear models can exhibit substantial instability in finite samples when the data provide limited information about certain parameters. Such instability is driven by rare but extreme realizations of the…

统计方法学 · 统计学 2026-04-15 Masamune Iwasawa

Valid post-selection inference for penalized G-estimation

Understanding treatment effect heterogeneity is important for decision making in medical and clinical practices, or handling various engineering and marketing challenges. When dealing with high-dimensional covariates or when the effect…

统计方法学 · 统计学 2026-04-15 Ajmery Jaman , Ashkan Ertefaie , Michèle Bally , Renée Lévesque , Robert W. Platt , Mireille E. Schnitzer

Estimation of time-varying treatment effects using marginal structural models dependent on partial treatment history

Inverse probability (IP) weighting of marginal structural models (MSMs) can provide consistent estimators of time-varying treatment effects under correct model specifications and identifiability assumptions, even in the presence of…

统计方法学 · 统计学 2026-04-15 Nodoka Seya , Masataka Taguri , Takeo Ishii

Graphical lasso for extremes

In this paper, we estimate the sparse dependence structure in the tail region of a multivariate random vector, potentially of high dimension. The tail dependence is modeled via a graphical model for extremes embedded in the H\"usler-Reiss…

统计方法学 · 统计学 2026-04-15 Phyllis Wan , Chen Zhou

Penalized Likelihood Methods for Modeling Count Data

The paper considers parameter estimation in count data models using penalized likelihood methods. The motivating data consists of multiple independent count variables with a moderate sample size per variable. The data were collected during…

统计方法学 · 统计学 2026-04-15 Minh Thu Bui , Cornelis J. Potgieter , Akihito Kamata

Inferring Change Points in Regression via Sample Weighting

We study the problem of identifying change points in high-dimensional generalized linear models, and propose an approach based on sample-weighted empirical risk minimization. Our method, Weighted ERM, encodes priors on the change points via…

统计方法学 · 统计学 2026-04-14 Gabriel Arpino , Ramji Venkataramanan

Nested Atoms Model with Application to Clustering Big Population-Scale Single-Cell Data

We consider the problem of clustering nested or hierarchical data, where observations are grouped and there are both group-level and observation-level variables. In our motivating OneK1K dataset, observations consist of single-cell…

统计方法学 · 统计学 2026-04-14 Arhit Chakrabarti , Yang Ni , Yuchao Jiang , Bani K. Mallick

NetworkNet: A Deep Neural Network Approach for Random Networks with Sparse Nodal Attributes and Complex Nodal Heterogeneity

Heterogeneous network data with rich nodal information become increasingly prevalent across multidisciplinary research, yet accurately modeling complex nodal heterogeneity and simultaneously selecting influential nodal attributes remains an…

统计方法学 · 统计学 2026-04-14 Zhaoyu Xing , Xiufan Yu

A novel reference prior for Gaussian hierarchical models with intrinsic conditional autoregressive random effects

We develop a novel reference prior for Gaussian hierarchical models with intrinsic conditional autoregressive (ICAR) random effects. This is particularly important in the context of objective Bayes variable selection with sample size $n$…

统计方法学 · 统计学 2026-04-14 Marco A. R. Ferreira

Principled Inference in Dense High-Dimensional Linear Models via Local Conditional Sparsity

High-dimensional inference methods often rely on coefficient sparsity, an assumption that can be restrictive when signals are dense but individually weak. In such settings, valid inference may still be possible if the covariates exhibit…

统计方法学 · 统计学 2026-04-14 Wenjun Xiong , Yan Chen , Mingya Long , Qizhai Li

An Empirical Comparison of Methods for Quantifying the Similarity of Categorical Datasets

Quantifying the similarity of two or more datasets has widespread applications in statistics and machine learning. The method choice is, however, difficult due to the abundance of proposed methods and the lack of neutral comparison studies,…

统计方法学 · 统计学 2026-04-14 Marieke Stolte , Jörg Rahnenführer , Andrea Bommert

Optimized questionnaire item selection for tracking the progression of motor symptoms in Parkinson's disease

Long questionnaires increase the response burden for patients and healthcare workers. In the treatment of Parkinson's disease, the MDS-UPDRS questionnaire to track disease progression may be underutilized due to time requirements. While…

统计方法学 · 统计学 2026-04-14 Karl Sigfrid , Ellinor Fackle-Fornius , Frank Miller

Prediction decomposition for causal analysis

There is rising interest in using Machine Learning (ML) model predictions as outcomes in causal analysis. However, these methods have faced challenges in finding the true treatment effects. It is also challenging to make choices about which…

统计方法学 · 统计学 2026-04-14 Ofir Reich

Optimal multiple testing under family-wise error control: elementary symmetric polynomials and a scalable algorithm

Simultaneously testing $K$ hypotheses while controlling the family-wise error rate is a fundamental problem in statistics. Existing procedures (Bonferroni, Holm, Hochberg, Hommel) provide valid control but sacrifice power, increasingly so…

统计方法学 · 统计学 2026-04-14 Prasanjit Dubey , Xiaoming Huo

Restricted Search Space Graph MCMC via Birth-Death Processes

Inferring directed acyclic graphs (DAGs) from data via Markov chain Monte Carlo (MCMC) is computationally challenging in moderate-to-high dimensional settings because their discrete sampling space grows super-exponentially with the number…

统计方法学 · 统计学 2026-04-14 Morris Greenberg , Kieran R Campbell , Radu Craiu

Integrative learning of individualized treatment rules from multiple studies with partially overlapping treatments

An individualized treatment rule (ITR) tailors treatments to a patient's specific characteristics. However, randomized controlled trials (RCTs) are often underpowered to detect the treatment effect heterogeneity needed for reliable ITR…

统计方法学 · 统计学 2026-04-14 Yuan Bian , Donglin Zeng , Hyun-Joon Yang , Leanne M. Williams , Yuanjia Wang

Causal mediation in cluster-randomized trials with multiple mediators: spillover-aware decomposition, identification, and semiparametric efficient inference

Causal mediation analysis in cluster-randomized trials (CRTs) is complicated by the presence of multiple mediators, intracluster correlation, and within-cluster interference. Existing mediation methods often fall short in accommodating…

统计方法学 · 统计学 2026-04-14 Jiaqi Tong , Chao Cheng , Fan Li

Multiple Imputation Diagnostics when using Electronic Health Record Data in Observational Studies: A Case Study

Missing values in electronic health record (EHR) data pose a significant challenge for epidemiologic research. Traditional methods for handling missing data, like mean imputation, may introduce bias. Multiple imputation (MI) offers a…

统计方法学 · 统计学 2026-04-14 Nrupen A. Bhavsar , Lingyu Zhou , Samuel I. Berchuck , Matthew L. Maciejewski , Jerome P. Reiter