Related papers: Distributed multinomial regression

Multinomial Inverse Regression for Text Analysis

Text data, including speeches, stories, and other document forms, are often connected to sentiment variables that are of interest for research in marketing, economics, and elsewhere. It is also very high dimensional and difficult to…

Methodology · Statistics 2015-03-17 Matt Taddy

Review of Probability Distributions for Modeling Count Data

Count data take on non-negative integer values and are challenging to properly analyze using standard linear-Gaussian methods such as linear regression and principal components analysis. Generalized linear models enable direct modeling of…

Methodology · Statistics 2020-01-14 F. William Townes

A new multivariate Poisson model

Multi-dimensional data frequently occur in many different fields, including risk management, insurance, biology, environmental sciences, and many more. In analyzing multivariate data, it is imperative that the underlying modelling…

Methodology · Statistics 2025-06-23 Orla A. Murphy , Juliana Schulz

Iterative Distributed Multinomial Regression

This article introduces an iterative distributed computing estimator for the multinomial logistic regression model with large choice sets. Compared to the maximum likelihood estimator, the proposed iterative distributed estimator achieves…

Econometrics · Economics 2024-12-03 Yanqin Fan , Yigit Okar , Xuetao Shi

High-Dimensional Covariate-Augmented Overdispersed Multi-Study Poisson Factor Model

Factor analysis for high-dimensional data is a canonical problem in statistics and has a wide range of applications. However, there is currently no factor model tailored to effectively analyze high-dimensional count responses with…

Methodology · Statistics 2024-08-21 Wei Liu , Qingzhi Zhong

A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution

The Poisson distribution has been widely studied and used for modeling univariate count-valued data. Multivariate generalizations of the Poisson distribution that permit dependencies, however, have been far less popular. Yet, real-world…

Methodology · Statistics 2016-12-28 David I. Inouye , Eunho Yang , Genevera I. Allen , Pradeep Ravikumar

A Generalized Multinomial Distribution from Dependent Categorical Random Variables

Categorical random variables are a common staple in machine learning methods and other applications across disciplines. Many times, correlation within categorical predictors exists, and has been noted to have an effect on various algorithm…

Probability · Mathematics 2017-01-25 Rachel Traylor

A flexible regression model for count data

Poisson regression is a popular tool for modeling count data and is applied in a vast array of applications from the social to the physical sciences and beyond. Real data, however, are often over- or under-dispersed and, thus, not conducive…

Applications · Statistics 2010-11-10 Kimberly F. Sellers , Galit Shmueli

Evaluating High-Order Predictive Distributions in Deep Learning

Most work on supervised learning research has focused on marginal predictions. In decision problems, joint predictive distributions are essential for good performance. Previous work has developed methods for assessing low-order predictive…

Machine Learning · Statistics 2022-03-01 Ian Osband , Zheng Wen , Seyed Mohammad Asghari , Vikranth Dwaracherla , Xiuyuan Lu , Benjamin Van Roy

DMAP: A Distribution Map for Text

Large Language Models (LLMs) are a powerful tool for statistical text analysis, with derived sequences of next-token probability distributions offering a wealth of information. Extracting this signal typically relies on metrics such as…

Computation and Language · Computer Science 2026-05-15 Tom Kempton , Julia Rozanova , Parameswaran Kamalaruban , Maeve Madigan , Karolina Wresilo , Yoann L. Launay , David Sutton , Stuart Burrell

Transition Models for Count Data: a Flexible Alternative to Fixed Distribution Models

A flexible semiparametric class of models is introduced that offers an alternative to classical regression models for count data as the Poisson and negative binomial model, as well as to more general models accounting for excess zeros that…

Methodology · Statistics 2020-03-30 Moritz Berger , Gerhard Tutz

Distributed Parameter Estimation via Pseudo-likelihood

Estimating statistical models within sensor networks requires distributed algorithms, in which both data and computation are distributed across the nodes of the network. We propose a general approach for distributed learning based on…

Machine Learning · Computer Science 2012-07-03 Qiang Liu , Alexander Ihler

Multivariate Count Autoregression

We are studying the problems of modeling and inference for multivariate count time series data with Poisson marginals. The focus is on linear and log-linear models. For studying the properties of such processes we develop a novel conceptual…

Methodology · Statistics 2017-04-10 Paul Doukhan , Konstantinos Fokianos , Bård Støve , Dag Tjøstheim

Multivariate generalized linear mixed models for underdispersed count data

Researchers are often interested in understanding the relationship between a set of covariates and a set of response variables. To achieve this goal, the use of regression analysis, either linear or generalized linear models, is largely…

Methodology · Statistics 2023-03-08 Guilherme Parreira da Silva , Henrique Aparecido Laureano , Ricardo Rasmussen Petterle , Paulo Justiniano Ribeiro Júnior , Wagner Hugo Bonat

Distributional Models and Deep Learning Embeddings: Combining the Best of Both Worlds

There are two main approaches to the distributed representation of words: low-dimensional deep learning embeddings and high-dimensional distributional models, in which each dimension corresponds to a context word. In this paper, we combine…

Computation and Language · Computer Science 2014-02-19 Irina Sergienya , Hinrich Schütze

A Simple and Adaptive Dispersion Regression Model for Count Data

Regression for count data is widely performed by models such as Poisson, negative binomial (NB) and zero-inflated regression. A challenge often faced by practitioners is the selection of the right model to take into account dispersion,…

Methodology · Statistics 2018-08-02 Hadeel S. Klakattawi , Veronica Vinciotti , Keming Yu

Boosting Distributional Copula Regression

Capturing complex dependence structures between outcome variables (e.g., study endpoints) is of high relevance in contemporary biomedical data problems and medical research. Distributional copula regression provides a flexible tool to model…

Methodology · Statistics 2022-02-28 Nicolai Hans , Nadja Klein , Florian Faschingbauer , Michael Schneider , Andreas Mayr

Bayesian Multinomial Logistic Regression for Numerous Categories

Bayesian multinomial logistic regression provides a principled, interpretable approach to multiclass classification, but posterior sampling becomes increasingly expensive as the model dimension grows. Prior work has studied scalability in…

Computation · Statistics 2026-02-27 Jared D. Fisher , Kyle R. McEvoy

Multilingual Distributed Representations without Word Alignment

Distributed representations of meaning are a natural way to encode covariance relationships between words and phrases in NLP. By overcoming data sparsity problems, as well as providing information about semantic relatedness which is not…

Computation and Language · Computer Science 2014-03-21 Karl Moritz Hermann , Phil Blunsom

Adapting Deep Learning for Sentiment Classification of Code-Switched Informal Short Text

Nowadays, an abundance of short text is being generated that uses nonstandard writing styles influenced by regional languages. Such informal and code-switched content are under-resourced in terms of labeled datasets and language models even…

Computation and Language · Computer Science 2020-04-07 Muhammad Haroon Shakeel , Asim Karim