Related papers: Randomization Does Not Justify Logistic Regression

On regression adjustments in experiments with several treatments

Regression adjustments are often made to experimental data. Since randomization does not justify the models, bias is likely; nor are the usual variance calculations to be trusted. Here, we evaluate regression adjustments using Neyman's…

Applications · Statistics 2008-12-18 David A. Freedman

Estimating the logistic regression equation when the model is incorrect

Protesting mildly against the notion of an exactly correct parametric model the view is adopted that the logistic regression equation is merely an approximation to the underlying, true function. The behaviour of likelihood based estimators…

Statistics Theory · Mathematics 2026-05-27 Nils Lid Hjort

Identification of inferential parameters in the covariate-normalized linear conditional logit model

The conditional logit model is a standard workhorse approach to estimating customers' product feature preferences using choice data. Using these models at scale, however, can result in numerical imprecision and optimization failure due to a…

Econometrics · Economics 2020-12-16 Philip Erickson

The LOOP Estimator: Adjusting for Covariates in Randomized Experiments

When conducting a randomized controlled trial, it is common to specify in advance the statistical analyses that will be used to analyze the data. Typically these analyses will involve adjusting for small imbalances in baseline covariates.…

Applications · Statistics 2017-08-04 Edward Wu , Johann Gagnon-Bartsch

An efficient stochastic Newton algorithm for parameter estimation in logistic regressions

Logistic regression is a well-known statistical model which is commonly used in the situation where the output is a binary random variable. It has a wide range of applications including machine learning, public health, social sciences,…

Statistics Theory · Mathematics 2019-04-18 Bernard Bercu , Antoine Godichon-Baggioni , Bruno Portier

On doubly robust estimation for logistic partially linear models

Consider a logistic partially linear model, in which the logit of the mean of a binary response is related to a linear function of some covariates and a nonparametric function of other covariates. We derive simple, doubly robust estimators…

Methodology · Statistics 2019-01-29 Zhiqiang Tan

Robust Ranking of Happiness Outcomes: A Median Regression Perspective

Ordered probit and logit models have been frequently used to estimate the mean ranking of happiness outcomes (and other ordinal data) across groups. However, it has been recently highlighted that such ranking may not be identified in most…

Econometrics · Economics 2022-06-08 Le-Yu Chen , Ekaterina Oparina , Nattavudh Powdthavee , Sorawoot Srisuma

Robust estimation for functional logistic regression models

This paper addresses the problem of providing robust estimators under a functional logistic regression model. Logistic regression is a popular tool in classification problems with two populations. As in functional linear regression,…

Methodology · Statistics 2023-08-16 Graciela Boente , Marina Valdora

Evaluating LLMs When They Do Not Know the Answer: Statistical Evaluation of Mathematical Reasoning via Comparative Signals

Evaluating mathematical reasoning in LLMs is constrained by limited benchmark sizes and inherent model stochasticity, yielding high-variance accuracy estimates and unstable rankings across platforms. On difficult problems, an LLM may fail…

Machine Learning · Computer Science 2026-02-04 Zihan Dong , Zhixian Zhang , Yang Zhou , Can Jin , Ruijia Wu , Linjun Zhang

Generating Probabilities From Numerical Weather Forecasts by Logistic Regression

Logistic models are studied as a tool to convert output from numerical weather forecasting systems (deterministic and ensemble) into probability forecasts for binary events. A logistic model obtains by putting the logarithmic odds ratio…

Atmospheric and Oceanic Physics · Physics 2009-01-29 Jochen Bröcker

Regularized zero-inflated Bernoulli regression model

Logistic regression model is widely used in many studies to investigate the relationship between a binary response variable Y and a set of potential predictors $X_1,\ldots, X_p$ (for example: $Y = 1$ if the outcome occurred and $Y = 0$…

Methodology · Statistics 2025-02-25 Mouhamed Ndoye , Aba Diop

Robust and efficient estimation of nonparametric generalized linear models

Generalized linear models are flexible tools for the analysis of diverse datasets, but the classical formulation requires that the parametric component is correctly specified and the data contain no atypical observations. To address these…

Methodology · Statistics 2023-04-21 Ioannis Kalogridis , Gerda Claeskens , Stefan Van Aelst

Flexibility without foresight: the predictive limitations of mixture models

Models allowing for random heterogeneity, such as mixed logit and latent class, are generally observed to obtain superior model fit and yield detailed insights into unobserved preference heterogeneity. Using theoretical arguments and two…

Econometrics · Economics 2025-10-13 Stephane Hess , Sander van Cranenburgh

Bounding the Test Log-Likelihood of Generative Models

Several interesting generative learning algorithms involve a complex probability distribution over many random variables, involving intractable normalization constants or latent variable normalization. Some of them may even not have an…

Machine Learning · Computer Science 2014-05-13 Yoshua Bengio , Li Yao , Kyunghyun Cho

Logistic Regression Equivalence: A Framework for Comparing Logistic Regression Models Across Populations

In this paper we discuss how to evaluate the differences between fitted logistic regression models across sub-populations. Our motivating example is in studying computerized diagnosis for learning disabilities, where sub-populations based…

Methodology · Statistics 2023-03-24 Guy Ashiri-Prossner , Yuval Benjamini

Long-term Averages of the Stochastic Logistic Map

The logistic map is a nonlinear difference equation well studied in the literature, used to model self-limiting growth in certain populations. It is known that, under certain regularity conditions, the stochastic logistic map, where the…

Dynamical Systems · Mathematics 2023-10-05 Maricela Cruz , Austin Wei , Johanna Hardin , Ami Radunskaya

A Benchmark and Comparison of Active Learning for Logistic Regression

Logistic regression is by far the most widely used classifier in real-world applications. In this paper, we benchmark the state-of-the-art active learning methods for logistic regression and discuss and illustrate their underlying…

Machine Learning · Statistics 2018-07-04 Yazhou Yang , Marco Loog

A Compound Logistic Regression Model for Binary Responses

Logistic regression is the most commonly used method for constructing predictive models for binary responses. One significant drawback to this approach, however, is that the asymptotes of the logistic response function are fixed at 0 and 1,…

Methodology · Statistics 2026-02-09 Anthony Almudevar , Jacob Almudevar

This human study did not involve human subjects: Validating LLM simulations as behavioral evidence

A growing literature uses large language models (LLMs) as synthetic participants to generate cost-effective and nearly instantaneous responses in social science experiments. However, there is limited guidance on when such simulations…

Artificial Intelligence · Computer Science 2026-02-18 Jessica Hullman , David Broska , Huaman Sun , Aaron Shaw

Are you using test log-likelihood correctly?

Test log-likelihood is commonly used to compare different models of the same data or different approximate inference algorithms for fitting the same probabilistic model. We present simple examples demonstrating how comparisons based on test…

Machine Learning · Statistics 2024-01-22 Sameer K. Deshpande , Soumya Ghosh , Tin D. Nguyen , Tamara Broderick