English
Related papers

Related papers: Generalized Linear Models for Aggregated Data

200 papers

The assumption that response and predictor belong to the same statistical unit may be violated in practice. Unbiased estimation and recovery of true label ordering based on unlabeled data are challenging tasks and have attracted increasing…

Methodology · Statistics 2022-06-24 Guanhua Fang , Ping Li

In medical research, individual-level patient data provide invaluable information, but the patients' right to confidentiality remains of utmost priority. This poses a huge challenge when estimating statistical models such as linear mixed…

Methodology · Statistics 2025-01-24 Marie Analiz April Limpoco , Christel Faes , Niel Hens

In many contexts, we have access to aggregate data, but individual level data is unavailable. For example, medical studies sometimes report only aggregate statistics about disease prevalence because of privacy concerns. Even so, many a time…

Machine Learning · Computer Science 2018-09-18 Sanket Tavarageri , Nag Mani , Anand Ramasubramanian , Jaskiran Kalsi

Data privacy has increasingly become a daunting challenge because it limits data availability, which is essential in estimating statistical models such as generalized linear mixed models. Access to personal data often involves considerable…

Methodology · Statistics 2026-05-05 Marie Analiz April Limpoco , Christel Faes , Niel Hens

Inference for the parameters indexing generalised linear models is routinely based on the assumption that the model is correct and a priori specified. This is unsatisfactory because the chosen model is usually the result of a data-adaptive…

Methodology · Statistics 2020-06-16 Stijn Vansteelandt , Oliver Dukes

Finite mixture distributions arise in sampling a heterogeneous population. Data drawn from such a population will exhibit extra variability relative to any single subpopulation. Statistical models based on finite mixtures can assist in the…

Methodology · Statistics 2024-01-19 Andrew M. Raim , Nagaraj K. Neerchal , Jorge G. Morel

A connection between the General Linear Model (GLM) in combination with classical statistical inference and the machine learning (MLE)-based inference is described in this paper. Firstly, the estimation of the GLM parameters is expressed as…

Machine Learning · Statistics 2022-02-10 Juan Manuel Gorriz , SIPBA group , John Suckling

This paper considers generalized linear models using rule-based features, also referred to as rule ensembles, for regression and probabilistic classification. Rules facilitate model interpretation while also capturing nonlinear dependences…

Machine Learning · Computer Science 2019-06-06 Dennis Wei , Sanjeeb Dash , Tian Gao , Oktay Günlük

Much traditional statistical modelling assumes that the outcome variables of interest are independent of each other when conditioned on the explanatory variables. This assumption is strongly violated in the case of infectious diseases,…

Populations and Evolution · Quantitative Biology 2019-11-28 Timothy Kinyanjui , Thomas House

We consider high-dimensional generalized linear models when the covariates are contaminated by measurement error. Estimates from errors-in-variables regression models are well-known to be biased in traditional low-dimensional settings if…

Computation · Statistics 2020-01-06 Michael Byrd , Monnie McGee

Biased sampling designs can be highly efficient when studying rare (binary) or low variability (continuous) endpoints. We consider longitudinal data settings in which the probability of being sampled depends on a repeatedly measured…

One of the major open problems in machine learning is to characterize generalization in the overparameterized regime, where most traditional generalization bounds become inconsistent even for overparameterized linear regression. In many…

Machine Learning · Computer Science 2023-11-22 Jing Xu , Jiaye Teng , Yang Yuan , Andrew Chi-Chih Yao

Regression is typically treated as a curve-fitting process where the goal is to fit a prediction function to data. With the help of conditional generative adversarial networks, we propose to solve this age-old problem in a different way; we…

Machine Learning · Computer Science 2024-04-23 Deddy Jobson , Eddy Hudson

Two popular approaches for relating correlated measurements of a non-Gaussian response variable to a set of predictors are to fit a marginal model using generalized estimating equations and to fit a generalized linear mixed model by…

Methodology · Statistics 2017-02-23 Jeffrey J. Gory , Peter F. Craigmile , Steven N. MacEachern

Generalized linear models (GLMs) have been used quite effectively in the modeling of a mean response under nonstandard conditions, where discrete as well as continuous data distributions can be accommodated. The choice of design for a GLM…

Statistics Theory · Mathematics 2016-08-14 André I. Khuri , Bhramar Mukherjee , Bikas K. Sinha , Malay Ghosh

Recent research has shown growing interest in modeling hypergraphs, which capture polyadic interactions among entities beyond traditional dyadic relations. However, most existing methodologies for hypergraphs face significant limitations,…

Methodology · Statistics 2025-11-04 Shihao Wu , Gongjun Xu , Ji Zhu

Many algorithms and applications involve repeatedly solving variations of the same inference problem; for example we may want to introduce new evidence to the model or perform updates to conditional dependencies. The goal of adaptive…

Data Structures and Algorithms · Computer Science 2012-06-18 Umut A. Acar , Alexander T. Ihler , Ramgopal Mettu , Ozgur Sumer

In this manuscript we consider the problem of generalized linear estimation on Gaussian mixture data with labels given by a single-index model. Our first result is a sharp asymptotic expression for the test and training errors in the…

Statistics Theory · Mathematics 2023-02-20 Luca Pesce , Florent Krzakala , Bruno Loureiro , Ludovic Stephan

In many applied sciences a popular analysis strategy for high-dimensional data is to fit many multivariate generalized linear models in parallel. This paper presents a novel approach to address the resulting multiple testing problem by…

Statistics Theory · Mathematics 2024-10-07 Riccardo De Santis , Jelle J. Goeman , Samuel Davenport , Jesse Hemerik , Livio Finos

We consider an additive partially linear framework for modelling massive heterogeneous data. The major goal is to extract multiple common features simultaneously across all sub-populations while exploring heterogeneity of each…

Methodology · Statistics 2019-01-01 Binhuan Wang , Yixin Fang , Heng Lian , Hua Liang
‹ Prev 1 2 3 10 Next ›