Related papers: Blocked Clusterwise Regression

Inference in High-Dimensional Panel Models: Two-Way Dependence and Unobserved Heterogeneity

Panel data allows for the modeling of unobserved heterogeneity, significantly raising the number of nuisance parameters and making high dimensionality a practical issue. Meanwhile, temporal and cross-sectional dependence in panel data…

Econometrics · Economics 2025-12-23 Kaicheng Chen

Panel data models with randomly generated groups

We develop a structural framework for modeling and inferring unobserved heterogeneity in dynamic panel-data models. Unlike methods treating clustering as a descriptive device, we model heterogeneity as arising from a latent clustering…

Econometrics · Economics 2025-10-29 Jean-Pierre Florens , Anna Simoni

Improved Clustering with Augmented k-means

Identifying a set of homogeneous clusters in a heterogeneous dataset is one of the most important classes of problems in statistical modeling. In the realm of unsupervised partitional clustering, k-means is a very important algorithm for…

Machine Learning · Statistics 2017-05-23 J. Andrew Howe

Hierarchical and Density-based Causal Clustering

Understanding treatment effect heterogeneity is vital for scientific and policy research. However, identifying and evaluating heterogeneous treatment effects pose significant challenges due to the typically unknown subgroup structure.…

Methodology · Statistics 2024-11-05 Kwangho Kim , Jisu Kim , Larry A. Wasserman , Edward H. Kennedy

A unified framework for model-based clustering, linear regression and multiple cluster structure detection

A general framework for dealing with both linear regression and clustering problems is described. It includes Gaussian clusterwise linear regression analysis with random covariates and cluster analysis via Gaussian mixture models with…

Methodology · Statistics 2015-10-13 Giuliano Galimberti , Annamaria Manisi , Gabriele Soffritti

Inference after discretizing time-varying unobserved heterogeneity

Approximating time-varying unobserved heterogeneity by discrete types has become increasingly popular in economics. Yet, provably valid post-clustering inference for target parameters in models that do not impose an exact group structure is…

Econometrics · Economics 2025-10-20 Jad Beyhum , Martin Mugnier

Testing Clustered Equal Predictive Ability with Unknown Clusters

This paper proposes a selective inference procedure for testing equal predictive ability in panel data settings with unknown heterogeneity. The framework allows predictive performance to vary across unobserved clusters and accounts for the…

Econometrics · Economics 2025-07-29 Oguzhan Akgun , Alain Pirotte , Giovanni Urga , Zhenlin Yang

Discretizing Unobserved Heterogeneity

We study discrete panel data methods where unobserved heterogeneity is revealed in a first step, in environments where population heterogeneity is not discrete. We focus on two-step grouped fixed-effects (GFE) estimators, where individuals…

Econometrics · Economics 2021-02-04 Stéphane Bonhomme Thibaut Lamadon Elena Manresa

Nested hidden Markov chains for modeling dynamic unobserved heterogeneity in multilevel longitudinal data

In the context of multilevel longitudinal data, where sample units are collected in clusters, an important aspect that should be accounted for is the unobserved heterogeneity between sample units and between clusters. For this aim we…

Statistics Theory · Mathematics 2012-08-10 F. Bartolucci , M. Lupparelli

Homogeneity Pursuit in Single Index Models based Panel Data Analysis

Panel data analysis is an important topic in statistics and econometrics. Traditionally, in panel data analysis, all individuals are assumed to share the same unknown parameters, e.g. the same coefficients of covariates when the linear…

Statistics Theory · Mathematics 2017-06-09 Heng Lian , Xinghao Qiao , Wenyang Zhang

An introduction and tutorial to model-based clustering in education via Gaussian mixture modelling

Heterogeneity has been a hot topic in recent educational literature. Several calls have been voiced to adopt methods that capture different patterns or subgroups within students behavior or functioning. Assuming that there is an average…

Methodology · Statistics 2023-06-13 Luca Scrucca , Mohammed Saqr , Sonsoles López-Pernas , Keefe Murphy

Identification and Estimation of Discrete Choice Models with Unobserved Choice Sets

We propose a framework for nonparametric identification and estimation of discrete choice models with unobserved choice sets. We recover the joint distribution of choice sets and preferences from a panel dataset on choices. We assume that…

Econometrics · Economics 2021-06-22 Victor H. Aguiar , Nail Kashaev

Clustered Covariate Regression

High covariate dimensionality is increasingly occurrent in model estimation, and existing techniques to address this issue typically require sparsity or discrete heterogeneity of the \emph{unobservable} parameter vector. However, neither…

Econometrics · Economics 2025-07-31 Abdul-Nasah Soale , Emmanuel Selorm Tsyawo

Deep Unsupervised Clustering with Clustered Generator Model

This paper addresses the problem of unsupervised clustering which remains one of the most fundamental challenges in machine learning and artificial intelligence. We propose the clustered generator model for clustering which contains both…

Machine Learning · Statistics 2019-11-20 Dandan Zhu , Tian Han , Linqi Zhou , Xiaokang Yang , Ying Nian Wu

Unsupervised Learning in a General Semiparametric Clusterwise Index Distribution Model

This study introduces a general semiparametric clusterwise index distribution model to analyze how latent clusters affect the covariate-response relationships. By employing sufficient dimension reduction to account for the effects of…

Methodology · Statistics 2025-09-30 Jen-Chieh Teng , Chin-Tsang Chiang

Scalable Regularised Joint Mixture Models

In many applications, data can be heterogeneous in the sense of spanning latent groups with different underlying distributions. When predictive models are applied to such data the heterogeneity can affect both predictive performance and…

Machine Learning · Statistics 2022-05-04 Thomas Lartigue , Sach Mukherjee

Supervised Convex Clustering

Clustering has long been a popular unsupervised learning approach to identify groups of similar objects and discover patterns from unlabeled data in many applications. Yet, coming up with meaningful interpretations of the estimated clusters…

Methodology · Statistics 2020-05-26 Minjie Wang , Tianyi Yao , Genevera I. Allen

Group-Average and Convex Clustering for Partially Heterogeneous Linear Regression

In this paper, a subgroup least squares and a convex clustering are introduced for inferring a partially heterogenous linear regression that has potential application in the areas of precision marketing and precision medicine. The…

Methodology · Statistics 2017-11-02 Lu Lin , Jun Lu , Chen Lin

Irregular Identification of Structural Models with Nonparametric Unobserved Heterogeneity

One of the most important empirical findings in microeconometrics is the pervasiveness of heterogeneity in economic behaviour (cf. Heckman 2001). This paper shows that cumulative distribution functions and quantiles of the nonparametric…

Econometrics · Economics 2020-05-19 Juan Carlos Escanciano

Benchmark and application of unsupervised classification approaches for univariate data

Unsupervised machine learning, and in particular data clustering, is a powerful approach for the analysis of datasets and identification of characteristic features occurring throughout a dataset. It is gaining popularity across scientific…

Mesoscale and Nanoscale Physics · Physics 2021-03-23 Maria El Abbassi , Jan Overbeck , Oliver Braun , Michel Calame , Herre S. J. van der Zant , Mickael L. Perrin