English
Related papers

Related papers: Distributed Matrix Completion and Robust Factoriza…

200 papers

We propose a distributed computing framework, based on a divide and conquer strategy and hierarchical modeling, to accelerate posterior inference for high-dimensional Bayesian factor models. Our approach distributes the task of…

Methodology · Statistics 2016-12-30 Gautam Sabnis , Debdeep Pati , Barbara Engelhardt , Natesh Pillai

The sparse factorization of a large matrix is fundamental in modern statistical learning. In particular, the sparse singular value decomposition and its variants have been utilized in multivariate regression, factor analysis, biclustering,…

Machine Learning · Statistics 2020-03-19 Kun Chen , Ruipeng Dong , Wanwan Xu , Zemin Zheng

In this paper, we propose a distributed framework for reducing the dimensionality of high-dimensional, large-scale, heterogeneous matrix-variate time series data using a factor model. The data are first partitioned column-wise (or row-wise)…

Machine Learning · Statistics 2026-01-19 Hangjin Jiang , Yuzhou Li , Zhaoxing Gao

We present a matrix-factorization algorithm that scales to input matrices with both huge number of rows and columns. Learned factors may be sparse or dense and/or non-negative, which makes our algorithm suitable for dictionary learning,…

Machine Learning · Statistics 2017-11-15 Arthur Mensch , Julien Mairal , Bertrand Thirion , Gael Varoquaux

Matrix factorization is a common machine learning technique for recommender systems. Despite its high prediction accuracy, the Bayesian Probabilistic Matrix Factorization algorithm (BPMF) has not been widely used on large scale data because…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-05-12 Tom Vander Aa , Imen Chakroun , Tom Haber

Matrix mechanisms are often used to provide unbiased differentially private query answers when publishing statistics or creating synthetic data. Recent work has developed matrix mechanisms, such as ResidualPlanner and Weighted Fourier…

Databases · Computer Science 2026-04-02 Guanlin He , Yingtai Xiao , Jiamu Bai , Xin Gu , Zeyu Ding , Wenpeng Yin , Daniel Kifer

Sparse matrix factorization is a popular tool to obtain interpretable data decompositions, which are also effective to perform data completion or denoising. Its applicability to large datasets has been addressed with online and randomized…

Machine Learning · Statistics 2017-11-15 Arthur Mensch , Julien Mairal , Bertrand Thirion , Gaël Varoquaux

This paper examines a general class of noisy matrix completion tasks where the goal is to estimate a matrix from observations obtained at a subset of its entries, each of which is subject to random noise or corruption. Our specific focus is…

Machine Learning · Statistics 2016-11-18 Akshay Soni , Swayambhoo Jain , Jarvis Haupt , Stefano Gonella

Matrix completion is one of the key problems in signal processing and machine learning. In recent years, deep-learning-based models have achieved state-of-the-art results in matrix completion. Nevertheless, they suffer from two drawbacks:…

Machine Learning · Computer Science 2018-12-05 Duc Minh Nguyen , Evaggelia Tsiligianni , Nikos Deligiannis

Noisy linear problems have been studied in various science and engineering disciplines. A class of "hard" noisy linear problems can be formulated as follows: Given a matrix $\hat{A}$ and a vector $\mathbf{b}$ constructed using a finite set…

Nonnegative matrix factorization (NMF) is a powerful technique for dimension reduction, extracting latent factors and learning part-based representation. For large datasets, NMF performance depends on some major issues: fast algorithms,…

Optimization and Control · Mathematics 2015-07-01 Duy-Khuong Nguyen , Tu-Bao Ho

We develop a message-passing algorithm for noisy matrix completion problems based on matrix factorization. The algorithm is derived by approximating message distributions of belief propagation with Gaussian distributions that share the same…

Machine Learning · Statistics 2021-10-27 Koki Okajima , Yoshiyuki Kabashima

This paper presents a class of new algorithms for distributed statistical estimation that exploit divide-and-conquer approach. We show that one of the key benefits of the divide-and-conquer strategy is robustness, an important…

Statistics Theory · Mathematics 2018-08-29 Stanislav Minsker , Nate Strawn

Matrix decomposition is one of the fundamental tools to discover knowledge from big data generated by modern applications. However, it is still inefficient or infeasible to process very big data using such a method in a single machine.…

Machine Learning · Computer Science 2020-02-11 Chihao Zhang , Yang Yang , Wei Zhang , Shihua Zhang

Bayesian matrix factorization (BMF) is a powerful tool for producing low-rank representations of matrices and for predicting missing values and providing confidence intervals. Scaling up the posterior inference for massive-scale matrices is…

Machine Learning · Statistics 2019-02-28 Xiangju Qin , Paul Blomstedt , Eemeli Leppäaho , Pekka Parviainen , Samuel Kaski

Dynamic graph clustering aims to detect and track time-varying clusters in dynamic graphs, revealing the evolutionary mechanisms of complex real-world dynamic systems. Matrix factorization-based methods are promising approaches for this…

Machine Learning · Computer Science 2025-02-11 Dongyuan Li , Satoshi Kosugi , Ying Zhang , Manabu Okumura , Feng Xia , Renhe Jiang

This paper introduces RankMap, a platform-aware end-to-end framework for efficient execution of a broad class of iterative learning algorithms for massive and dense datasets. Our framework exploits data structure to factorize it into an…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-10-28 Azalia Mirhoseini , Eva L. Dyer , Ebrahim. M. Songhori , Richard G. Baraniuk , Farinaz Koushanfar

Matrix factorization exploits the idea that, in complex high-dimensional data, the actual signal typically lies in lower-dimensional structures. These lower dimensional objects provide useful insight, with interpretability favored by sparse…

Methodology · Statistics 2022-12-14 Lorenzo Schiavon , Bernardo Nipoti , Antonio Canale

We present a matrix factorization algorithm that scales to input matrices that are large in both dimensions (i.e., that contains morethan 1TB of data). The algorithm streams the matrix columns while subsampling them, resulting in low…

Optimization and Control · Mathematics 2016-12-04 Arthur Mensch , Julien Mairal , Gaël Varoquaux , Bertrand Thirion

Advances in information technology have led to extremely large datasets that are often kept in different storage centers. Existing statistical methods must be adapted to overcome the resulting computational obstacles while retaining…

Methodology · Statistics 2021-11-12 Qiong Zhang , Jiahua Chen
‹ Prev 1 2 3 10 Next ›