Related papers: Generalized Matrix Factorization: efficient algori…

Bayesian Non-linear Latent Variable Modeling via Random Fourier Features

The Gaussian process latent variable model (GPLVM) is a popular probabilistic method used for nonlinear dimension reduction, matrix factorization, and state-space modeling. Inference for GPLVMs is computationally tractable only when the…

Machine Learning · Statistics 2023-06-16 Michael Minyi Zhang , Gregory W. Gundersen , Barbara E. Engelhardt

GLMMLasso: An Algorithm for High-Dimensional Generalized Linear Mixed Models Using L1-Penalization

We propose an L1-penalized algorithm for fitting high-dimensional generalized linear mixed models. Generalized linear mixed models (GLMMs) can be viewed as an extension of generalized linear models for clustered observations. This…

Computation · Statistics 2014-06-03 Jürg Schelldorfer , Lukas Meier , Peter Bühlmann

Generalized Matrix Factor Model

This article introduces a nonlinear generalized matrix factor model (GMFM) that allows for mixed-type variables, extending the scope of linear matrix factor models (LMFM) that are so far limited to handling continuous variables. We…

Methodology · Statistics 2024-09-17 Xinbing Kong , Tong Zhang

Efficient Computation of High-Dimensional Penalized Generalized Linear Mixed Models by Latent Factor Modeling of the Random Effects

Modern biomedical datasets are increasingly high dimensional and exhibit complex correlation structures. Generalized Linear Mixed Models (GLMMs) have long been employed to account for such dependencies. However, proper specification of the…

Methodology · Statistics 2024-04-18 Hillary M. Heiling , Naim U. Rashid , Quefeng Li , Xianlu L. Peng , Jen Jen Yeh , Joseph G. Ibrahim

Learning Latent Features with Pairwise Penalties in Low-Rank Matrix Completion

Low-rank matrix completion has achieved great success in many real-world data applications. A matrix factorization model that learns latent features is usually employed and, to improve prediction performance, the similarities between latent…

Machine Learning · Statistics 2020-01-28 Kaiyi Ji , Jian Tan , Jinfeng Xu , Yuejie Chi

Variational Inference for Generalized Linear Mixed Models Using Partially Noncentered Parametrizations

The effects of different parametrizations on the convergence of Bayesian computational algorithms for hierarchical models are well explored. Techniques such as centering, noncentering and partial noncentering can be used to accelerate…

Computation · Statistics 2015-03-20 Linda S. L. Tan , David J. Nott

Multivariate Covariance Generalized Linear Models

We propose a general framework for non-normal multivariate data analysis called multivariate covariance generalized linear models (McGLMs), designed to handle multivariate response variables, along with a wide range of temporal and spatial…

Methodology · Statistics 2017-04-25 Wagner Hugo Bonat , Bent Jørgensen

Detection of Interacting Variables for Generalized Linear Models via Neural Networks

The quality of generalized linear models (GLMs), frequently used by insurance companies, depends on the choice of interacting variables. The search for interactions is time-consuming, especially for data sets with a large number of…

Machine Learning · Statistics 2025-05-21 Yevhen Havrylenko , Julia Heger

Gaussian Process Latent Variable Model Factorization for Context-aware Recommender Systems

Context-aware recommender systems (CARS) have gained increasing attention due to their ability to utilize contextual information. Compared to traditional recommender systems, CARS are, in general, able to generate more accurate…

Machine Learning · Computer Science 2019-12-23 Wei Huang , Richard Yi Da Xu

Latent variable modeling with random features

Gaussian process-based latent variable models are flexible and theoretically grounded tools for nonlinear dimension reduction, but generalizing to non-Gaussian data likelihoods within this nonlinear framework is statistically challenging.…

Machine Learning · Statistics 2020-06-22 Gregory W. Gundersen , Michael Minyi Zhang , Barbara E. Engelhardt

LMLFM: Longitudinal Multi-Level Factorization Machine

We consider the problem of learning predictive models from longitudinal data, consisting of irregularly repeated, sparse observations from a set of individuals over time. Such data often exhibit {\em longitudinal correlation} (LC)…

Machine Learning · Statistics 2019-11-25 Junjie Liang , Dongkuan Xu , Yiwei Sun , Vasant Honavar

Generic Unsupervised Optimization for a Latent Variable Model With Exponential Family Observables

Latent variable models (LVMs) represent observed variables by parameterized functions of latent variables. Prominent examples of LVMs for unsupervised learning are probabilistic PCA or probabilistic SC which both assume a weighted linear…

Machine Learning · Computer Science 2023-12-18 Hamid Mousavi , Jakob Drefs , Florian Hirschberger , Jörg Lücke

Statistical Inference for Covariate-Adjusted and Interpretable Generalized Factor Model with Application to Testing Fairness

Latent variable models are popularly used to measure latent factors (e.g., abilities and personalities) from large-scale assessment data. Beyond understanding these latent factors, the covariate effect on responses controlling for latent…

Methodology · Statistics 2026-01-12 Jing Ouyang , Chengyu Cui , Kean Ming Tan , Gongjun Xu

LR-GLM: High-Dimensional Bayesian Inference Using Low-Rank Data Approximations

Due to the ease of modern data collection, applied statisticians often have access to a large set of covariates that they wish to relate to some observed outcome. Generalized linear models (GLMs) offer a particularly interpretable framework…

Computation · Statistics 2019-05-21 Brian L. Trippe , Jonathan H. Huggins , Raj Agrawal , Tamara Broderick

Fast Algorithms for Learning Latent Variables in Graphical Models

We study the problem of learning latent variables in Gaussian graphical models. Existing methods for this problem assume that the precision matrix of the observed variables is the superposition of a sparse and a low-rank component. In this…

Machine Learning · Statistics 2017-07-12 Mohammadreza Soltani , Chinmay Hegde

Efficient Penalized Generalized Linear Mixed Models for Variable Selection and Genetic Risk Prediction in High-Dimensional Data

Sparse regularized regression methods are now widely used in genome-wide association studies (GWAS) to address the multiple testing burden that limits discovery of potentially important predictors. Linear mixed models (LMMs) have become an…

Methodology · Statistics 2022-06-27 Julien St-Pierre , Karim Oualkacha , Sahir Rai Bhatnagar

G-HIVE: Parameter Estimation and Approximate Inference for Multivariate Response Generalized Linear Models with Hidden Variables

In practice, there often exist unobserved variables, also termed hidden variables, associated with both the response and covariates. Existing works in the literature mostly focus on linear regression with hidden variables. However, when the…

Methodology · Statistics 2025-09-03 Inbeom Lee , Yang Ning

Deviance Matrix Factorization

We investigate a general matrix factorization for deviance-based data losses, extending the ubiquitous singular value decomposition beyond squared error loss. While similar approaches have been explored before, our method leverages…

Machine Learning · Statistics 2023-07-04 Liang Wang , Luis Carvalho

Scalable Bayesian Non-linear Matrix Completion

Matrix completion aims to predict missing elements in a partially observed data matrix which in typical applications, such as collaborative filtering, is large and extremely sparsely observed. A standard solution is matrix factorization,…

Machine Learning · Computer Science 2019-08-06 Xiangju Qin , Paul Blomstedt , Samuel Kaski

Sublinear Variational Optimization of Gaussian Mixture Models with Millions to Billions of Parameters

Gaussian Mixture Models (GMMs) range among the most frequently used models in machine learning. However, training large, general GMMs becomes computationally prohibitive for datasets that have many data points $N$ of high-dimensionality…

Machine Learning · Statistics 2025-12-12 Sebastian Salwig , Till Kahlke , Florian Hirschberger , Dennis Forster , Jörg Lücke