Related papers: Large-dimensional Robust Factor Analysis with Grou…

Penalized Principal Component Analysis for Large-dimension Factor Model with Group Pursuit

This paper investigates the intrinsic group structures within the framework of large-dimensional approximate factor models, which portrays homogeneous effects of the common factors on the individuals that fall into the same group. To this…

Methodology · Statistics 2025-03-18 Yong He , Dong Liu , Guangming Pan , Yiming Wang

Factor Modelling for Biclustering Large-dimensional Matrix-valued Time Series

A novel unsupervised learning method is proposed in this paper for biclustering large-dimensional matrix-valued time series based on an entirely new latent two-way factor structure. Each block cluster is characterized by its own row and…

Methodology · Statistics 2025-02-11 Yong He , Xiaoyang Ma , Xingheng Wang , Yalin Wang

Robust Clustered Federated Learning for Heterogeneous High-dimensional Data

Federated learning has attracted significant attention as a privacy-preserving framework for training personalised models on multi-source heterogeneous data. However, most existing approaches are unable to handle scenarios where subgroup…

Methodology · Statistics 2025-10-14 Changxin Yang , Zhongyi Zhu , Heng Lian

Detection of latent heteroscedasticity and group-based regression effects in linear models via Bayesian model selection

Standard linear modeling approaches make potentially simplistic assumptions regarding the structure of categorical effects that may obfuscate more complex relationships governing data. For example, recent work focused on the two-way…

Methodology · Statistics 2019-03-05 Thomas A. Metzger , Christopher T. Franck

Sparse-Group Factor Analysis for High-Dimensional Time Series

Factor analysis is a widely used technique for dimension reduction in high-dimensional data. However, a key challenge in factor models lies in the interpretability of the latent factors. One intuitive way to interpret these factors is…

Methodology · Statistics 2025-10-08 Xin Wang , Xialu Liu

Large-dimensional Factor Analysis without Moment Constraints

Large-dimensional factor model has drawn much attention in the big-data era, in order to reduce the dimensionality and extract underlying features using a few latent common factors. Conventional methods for estimating the factor model…

Methodology · Statistics 2020-06-02 Yong He , Xinbing Kong , Long Yu , Xinsheng Zhang

A time resolved clustering method revealing longterm structures and their short-term internal dynamics

The last decades have not only been characterized by an explosive growth of data, but also an increasing appreciation of data as a valuable resource. Their value comes with the ability to extract meaningful patterns that are of economic,…

Machine Learning · Statistics 2020-02-27 Jonas I. Liechti , Sebastian Bonhoeffer

A Unified Probabilistic Model for Learning Latent Factors and Their Connectivities from High-Dimensional Data

Connectivity estimation is challenging in the context of high-dimensional data. A useful preprocessing step is to group variables into clusters, however, it is not always clear how to do so from the perspective of connectivity estimation.…

Machine Learning · Statistics 2018-05-25 Ricardo Pio Monti , Aapo Hyvärinen

Tests for Group-Specific Heterogeneity in High-Dimensional Factor Models

Standard high-dimensional factor models assume that the comovements in a large set of variables could be modeled using a small number of latent factors that affect all variables. In many relevant applications in economics and finance,…

Econometrics · Economics 2022-02-08 Antoine Djogbenou , Razvan Sufana

High-dimensional Factor Analysis for Network-linked Data

Factor analysis is a widely used statistical tool in many scientific disciplines, such as psychology, economics, and sociology. As observations linked by networks become increasingly common, incorporating network structures into factor…

Methodology · Statistics 2024-03-27 Jinming Li , Gongjun Xu , Ji Zhu

Robust Statistical Inference for Large-dimensional Matrix-valued Time Series via Iterative Huber Regression

Matrix factor model is drawing growing attention for simultaneous two-way dimension reduction of well-structured matrix-valued observations. This paper focuses on robust statistical inference for matrix factor model in the ``diverging…

Methodology · Statistics 2023-06-07 Yong He , Xin-Bing Kong , Dong Liu , Ran Zhao

Robust Bayesian Cluster Enumeration Based on the $t$ Distribution

A major challenge in cluster analysis is that the number of data clusters is mostly unknown and it must be estimated prior to clustering the observed data. In real-world applications, the observed data is often subject to heavy tailed noise…

Machine Learning · Statistics 2020-05-06 Freweyni K. Teklehaymanot , Michael Muma , Abdelhak M. Zoubir

A Sparse Factor Model for Clustering High-Dimensional Longitudinal Data

Recent advances in engineering technologies have enabled the collection of a large number of longitudinal features. This wealth of information presents unique opportunities for researchers to investigate the complex nature of diseases and…

Methodology · Statistics 2023-11-27 Zihang Lu , Noirrit Kiran Chandra

Group Factor Analysis

Factor analysis provides linear factors that describe relationships between individual variables of a data set. We extend this classical formulation into linear factors that describe relationships between groups of variables, where each…

Machine Learning · Statistics 2014-12-03 Arto Klami , Seppo Virtanen , Eemeli Leppäaho , Samuel Kaski

Factor Models with Real Data: a Robust Estimation of the Number of Factors

Factor models are a very efficient way to describe high dimensional vectors of data in terms of a small number of common relevant factors. This problem, which is of fundamental importance in many disciplines, is usually reformulated in…

Optimization and Control · Mathematics 2018-06-13 Valentina Ciccone , Augusto Ferrante , Mattia Zorzi

Factor Modelling for Clustering High-dimensional Time Series

We propose a new unsupervised learning method for clustering a large number of time series based on a latent factor structure. Each cluster is characterized by its own cluster-specific factors in addition to some common factors which impact…

Statistics Theory · Mathematics 2022-09-09 Bo Zhang , Guangming Pan , Qiwei Yao , Wang Zhou

Robust estimation for number of factors in high dimensional factor modeling via Spearman correlation matrix

Determining the number of factors in high-dimensional factor modeling is essential but challenging, especially when the data are heavy-tailed. In this paper, we introduce a new estimator based on the spectral properties of Spearman sample…

Methodology · Statistics 2024-08-29 Jiaxin Qiu , Zeng Li , Jianfeng Yao

Improving Group Lasso for high-dimensional categorical data

Sparse modelling or model selection with categorical data is challenging even for a moderate number of variables, because one parameter is roughly needed to encode one category or level. The Group Lasso is a well known efficient algorithm…

Methodology · Statistics 2022-11-14 Szymon Nowakowski , Piotr Pokarowski , Wojciech Rejchel , Agnieszka Sołtys

Network-Assisted Estimation for Large-dimensional Factor Model with Guaranteed Convergence Rate Improvement

Network structure is growing popular for capturing the intrinsic relationship between large-scale variables. In the paper we propose to improve the estimation accuracy for large-dimensional factor model when a network structure between…

Methodology · Statistics 2020-01-30 Long Yu , Yong He , Xinsheng Zhang , Ji Zhu

Random matrix approach to estimation of high-dimensional factor models

In dealing with high-dimensional data sets, factor models are often useful for dimension reduction. The estimation of factor models has been actively studied in various fields. In the first part of this paper, we present a new approach to…

Statistical Finance · Quantitative Finance 2017-11-27 Joongyeub Yeo , George Papanicolaou