Related papers: Generalized Linear Models for Aggregated Data

Regression with Label Permutation in Generalized Linear Model

The assumption that response and predictor belong to the same statistical unit may be violated in practice. Unbiased estimation and recovery of true label ordering based on unlabeled data are challenging tasks and have attracted increasing…

Methodology · Statistics 2022-06-24 Guanhua Fang , Ping Li

Linear mixed modelling of federated data when only the mean, covariance, and sample size are available

In medical research, individual-level patient data provide invaluable information, but the patients' right to confidentiality remains of utmost priority. This poses a huge challenge when estimating statistical models such as linear mixed…

Methodology · Statistics 2025-01-24 Marie Analiz April Limpoco , Christel Faes , Niel Hens

A Data Analytics Framework for Aggregate Data Analysis

In many contexts, we have access to aggregate data, but individual level data is unavailable. For example, medical studies sometimes report only aggregate statistics about disease prevalence because of privacy concerns. Even so, many a time…

Machine Learning · Computer Science 2018-09-18 Sanket Tavarageri , Nag Mani , Anand Ramasubramanian , Jaskiran Kalsi

Federated generalized linear mixed models based on one-time shared summary statistics

Data privacy has increasingly become a daunting challenge because it limits data availability, which is essential in estimating statistical models such as generalized linear mixed models. Access to personal data often involves considerable…

Methodology · Statistics 2026-05-05 Marie Analiz April Limpoco , Christel Faes , Niel Hens

Assumption-lean inference for generalised linear model parameters

Inference for the parameters indexing generalised linear models is routinely based on the assumption that the model is correct and a priori specified. This is unsatisfactory because the chosen model is usually the result of a data-adaptive…

Methodology · Statistics 2020-06-16 Stijn Vansteelandt , Oliver Dukes

An Extension of Generalized Linear Models to Finite Mixture Outcome Distributions

Finite mixture distributions arise in sampling a heterogeneous population. Data drawn from such a population will exhibit extra variability relative to any single subpopulation. Statistical models based on finite mixtures can assist in the…

Methodology · Statistics 2024-01-19 Andrew M. Raim , Nagaraj K. Neerchal , Jorge G. Morel

A connection between the pattern classification problem and the General Linear Model for statistical inference

A connection between the General Linear Model (GLM) in combination with classical statistical inference and the machine learning (MLE)-based inference is described in this paper. Firstly, the estimation of the GLM parameters is expressed as…

Machine Learning · Statistics 2022-02-10 Juan Manuel Gorriz , SIPBA group , John Suckling

Generalized Linear Rule Models

This paper considers generalized linear models using rule-based features, also referred to as rule ensembles, for regression and probabilistic classification. Rules facilitate model interpretation while also capturing nonlinear dependences…

Machine Learning · Computer Science 2019-06-06 Dennis Wei , Sanjeeb Dash , Tian Gao , Oktay Günlük

Generalised Linear Models for Dependent Binary Outcomes with Applications to Household Stratified Pandemic Influenza Data

Much traditional statistical modelling assumes that the outcome variables of interest are independent of each other when conditioned on the explanatory variables. This assumption is strongly violated in the case of infectious diseases,…

Populations and Evolution · Quantitative Biology 2019-11-28 Timothy Kinyanjui , Thomas House

A Simple Correction Procedure for High-Dimensional Generalized Linear Models with Measurement Error

We consider high-dimensional generalized linear models when the covariates are contaminated by measurement error. Estimates from errors-in-variables regression models are well-known to be biased in traditional low-dimensional settings if…

Computation · Statistics 2020-01-06 Michael Byrd , Monnie McGee

Generalized Linear Models for Longitudinal Data with Biased Sampling Designs: A Sequential Offsetted Regressions Approach

Biased sampling designs can be highly efficient when studying rare (binary) or low variability (continuous) endpoints. We consider longitudinal data settings in which the probability of being sampled depends on a repeatedly measured…

Methodology · Statistics 2020-01-14 Lee S. McDaniel , Jonathan S. Schildcrout , Enrique F. Schisterman , Paul J. Rathouz

Towards Data-Algorithm Dependent Generalization: a Case Study on Overparameterized Linear Regression

One of the major open problems in machine learning is to characterize generalization in the overparameterized regime, where most traditional generalization bounds become inconsistent even for overparameterized linear regression. In many…

Machine Learning · Computer Science 2023-11-22 Jing Xu , Jiaye Teng , Yang Yuan , Andrew Chi-Chih Yao

Generalized Regression with Conditional GANs

Regression is typically treated as a curve-fitting process where the goal is to fit a prediction function to data. With the help of conditional generative adversarial networks, we propose to solve this age-old problem in a different way; we…

Machine Learning · Computer Science 2024-04-23 Deddy Jobson , Eddy Hudson

Marginally Interpretable Generalized Linear Mixed Models

Two popular approaches for relating correlated measurements of a non-Gaussian response variable to a set of predictors are to fit a marginal model using generalized estimating equations and to fit a generalized linear mixed model by…

Methodology · Statistics 2017-02-23 Jeffrey J. Gory , Peter F. Craigmile , Steven N. MacEachern

Design Issues for Generalized Linear Models: A Review

Generalized linear models (GLMs) have been used quite effectively in the modeling of a mean response under nonstandard conditions, where discrete as well as continuous data distributions can be accommodated. The choice of design for a GLM…

Statistics Theory · Mathematics 2016-08-14 André I. Khuri , Bhramar Mukherjee , Bikas K. Sinha , Malay Ghosh

A General Latent Embedding Approach for Modeling Non-uniform High-dimensional Sparse Hypergraphs with Multiplicity

Recent research has shown growing interest in modeling hypergraphs, which capture polyadic interactions among entities beyond traditional dyadic relations. However, most existing methodologies for hypergraphs face significant limitations,…

Methodology · Statistics 2025-11-04 Shihao Wu , Gongjun Xu , Ji Zhu

Adaptive Inference on General Graphical Models

Many algorithms and applications involve repeatedly solving variations of the same inference problem; for example we may want to introduce new evidence to the model or perform updates to conditional dependencies. The goal of adaptive…

Data Structures and Algorithms · Computer Science 2012-06-18 Umut A. Acar , Alexander T. Ihler , Ramgopal Mettu , Ozgur Sumer

Are Gaussian data all you need? Extents and limits of universality in high-dimensional generalized linear estimation

In this manuscript we consider the problem of generalized linear estimation on Gaussian mixture data with labels given by a single-index model. Our first result is a sharp asymptotic expression for the test and training errors in the…

Statistics Theory · Mathematics 2023-02-20 Luca Pesce , Florent Krzakala , Bruno Loureiro , Ludovic Stephan

Permutation-based multiple testing when fitting many generalized linear models

In many applied sciences a popular analysis strategy for high-dimensional data is to fit many multivariate generalized linear models in parallel. This paper presents a novel approach to address the resulting multiple testing problem by…

Statistics Theory · Mathematics 2024-10-07 Riccardo De Santis , Jelle J. Goeman , Samuel Davenport , Jesse Hemerik , Livio Finos

Additive Partially Linear Models for Massive Heterogeneous Data

We consider an additive partially linear framework for modelling massive heterogeneous data. The major goal is to extract multiple common features simultaneously across all sub-populations while exploring heterogeneity of each…

Methodology · Statistics 2019-01-01 Binhuan Wang , Yixin Fang , Heng Lian , Hua Liang