Related papers: Penalized Clustering of Large Scale Functional Dat…

Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables

Clustering analysis is one of the most widely used statistical tools in many emerging areas such as microarray data analysis. For microarray and other high-dimensional data, the presence of many noise variables may mask underlying…

Machine Learning · Statistics 2008-03-26 Benhuai Xie , Wei Pan , Xiaotong Shen

Multivariate Functional Clustering with Variable Selection and Application to Sensor Data from Engineering Systems

Multi-sensor data that track system operating behaviors are widely available nowadays from various engineering systems. Measurements from each sensor over time form a curve and can be viewed as functional data. Clustering of these…

Methodology · Statistics 2024-01-08 Zhongnan Jin , Jie Min , Yili Hong , Pang Du , Qingyu Yang

A Cluster Elastic Net for Multivariate Regression

We propose a method for estimating coefficients in multivariate regression when there is a clustering structure to the response variables. The proposed method includes a fusion penalty, to shrink the difference in fitted values from…

Machine Learning · Statistics 2018-03-28 Bradley S. Price , Ben Sherwood

Clustering of longitudinal curves via a penalized method and EM algorithm

In this article, a new method, called FWP, is proposed for clustering longitudinal curves. In the proposed method, clusters of mean functions are identified through a weighted concave pairwise fusion method. The EM algorithm and the…

Methodology · Statistics 2023-06-14 Xin Wang

Model-based Clustering with Sparse Covariance Matrices

Finite Gaussian mixture models are widely used for model-based clustering of continuous data. Nevertheless, since the number of model parameters scales quadratically with the number of variables, these models can be easily…

Methodology · Statistics 2018-09-25 Michael Fop , Thomas Brendan Murphy , Luca Scrucca

Clustering and variable selection for categorical multivariate data

This article investigates unsupervised classification techniques for categorical multivariate data. The study employs multivariate multinomial mixture modeling, which is a type of model particularly applicable to multilocus genotypic data.…

Statistics Theory · Mathematics 2014-03-11 Dominique Bontemps , Wilson Toussile

Cluster weighted models with multivariate skewed distributions for functional data

We propose a clustering method, funWeightClustSkew, based on mixtures of functional linear regression models and three skewed multivariate distributions: the variance-gamma distribution, the skew-t distribution, and the normal-inverse…

Methodology · Statistics 2025-04-18 Cristina Anton , Roy Shivam Ram Shreshtth

Cluster weighted models for functional data

We propose a method, funWeightClust, based on a family of parsimonious models for clustering heterogeneous functional linear regression data. These models extend cluster weighted models to functional data, and they allow for multivariate…

Methodology · Statistics 2025-03-10 Cristina Anton , Iain Smith

Clustering functional data with measurement errors: a simulation-based approach

Clustering analysis of functional data, which comprises observations that evolve continuously over time or space, has gained increasing attention across various scientific disciplines. Practical applications often involve functional data…

Methodology · Statistics 2024-06-19 Tingyu Zhu , Lan Xue , Carmen Tekwe , Keith Diaz , Mark Benden , Roger Zoh

L1-Penalization for Mixture Regression Models

We consider a finite mixture of regressions (FMR) model for high-dimensional inhomogeneous data where the number of covariates may be much larger than sample size. We propose an l1-penalized maximum likelihood estimator in an appropriate…

Methodology · Statistics 2012-02-28 Nicolas Städler , Peter Bühlmann , Sara van de Geer

Clustering of functional data prone to complex heteroscedastic measurement error

Several factors make clustering of functional data challenging, including the infinite-dimensional space to which observations belong and the lack of a defined probability density function for the functional random variable. To overcome…

Methodology · Statistics 2025-02-03 Andi Mai , Lan Xue , Roger Zoh , Carmen Tekwe

Inference for Multivariate Normal Mixtures

Multivariate normal mixtures provide a flexible model for high-dimensional data. They are widely used in statistical genetics, statistical finance, and other disciplines. Due to the unboundedness of the likelihood function, classical…

Statistics Theory · Mathematics 2008-05-27 Jiahua Chen , Xianming Tan

Flexible Clustering with a Sparse Mixture of Generalized Hyperbolic Distributions

Robust clustering of high-dimensional data is an important topic because clusters in real datasets are often heavy-tailed and/or asymmetric. Traditional approaches to model-based clustering often fail for high dimensional data, e.g., due to…

Methodology · Statistics 2024-06-07 Alexa A. Sochaniwsky , Michael P. B. Gallaugher , Yang Tang , Paul D. McNicholas

Probability Weighted Clustered Coefficients Regression Models in Complex Survey Sampling

Regression analysis is commonly conducted in survey sampling. However, existing methods fail when the relationships vary across different areas or domains. In this paper, we propose a unified framework to study the group-wise covariate…

Methodology · Statistics 2024-09-25 Mingjun Gang , Xin Wang , Zhonglei Wang , Wei Zhong

Fast Penalized Generalized Estimating Equations for Large Longitudinal Functional Datasets

Longitudinal binary or count functional data are common in neuroscience, but are often too large to analyze with existing functional regression methods. We propose one-step penalized generalized estimating equations that supports…

Methodology · Statistics 2026-03-31 Gabriel Loewinger , Alex W. Levis , Erjia Cui , Francisco Pereira

Model-Based Clustering and Classification of Functional Data

The problem of complex data analysis is a central topic of modern statistical science and learning systems and is becoming of broader interest with the increasing prevalence of high-dimensional data. The challenge is to develop statistical…

Machine Learning · Statistics 2018-03-05 Faicel Chamroukhi , Hien D. Nguyen

For data with high-dimensional covariates but small to moderate sample sizes, the analysis of single datasets often generates unsatisfactory results. The integrative analysis of multiple independent datasets provides an effective way of…

Methodology · Statistics 2015-01-19 Yuan Huang , Qingzhao Zhang , Sanguo Zhang , Jian Huang , Shuangge Ma

Model-Based Clustering of Functional Data Via Random Projection Ensembles

Clustering functional data is a challenging task due to intrinsic infinite-dimensionality and the need for stable, data-adaptive partitioning. In this work, we propose a clustering framework based on Random Projections, which simultaneously…

Methodology · Statistics 2025-12-18 Matteo Mori , Laura Anderlucci

Hierarchical Total Variations and Doubly Penalized ANOVA Modeling for Multivariate Nonparametric Regression

For multivariate nonparametric regression, functional analysis-of-variance (ANOVA) modeling aims to capture the relationship between a response and covariates by decomposing the unknown function into various components, representing main…

Methodology · Statistics 2019-06-20 Ting Yang , Zhiqiang Tan

Clustering multivariate functional data using unsupervised binary trees

We propose a model-based clustering algorithm for a general class of functional data for which the components could be curves or images. The random functional data realizations could be measured with error at discrete, and possibly random,…

Machine Learning · Statistics 2022-03-14 Steven Golovkine , Nicolas Klutchnikoff , Valentin Patilea