English
Related papers

Related papers: Linear Regression with Distributed Learning: A Gen…

200 papers

Distributed learning facilitates the scaling-up of data processing by distributing the computational burden over several nodes. Despite the vast interest in distributed learning, generalization performance of such approaches is not well…

Machine Learning · Statistics 2020-05-05 Martin Hellkvist , Ayça Özçelikkale , Anders Ahlén

Distributed learning is an effective way to analyze big data. In distributed regression, a typical approach is to divide the big data into multiple blocks, apply a base regression algorithm on each of them, and then simply average the…

Machine Learning · Computer Science 2017-08-08 Zhengchu Guo , Lei Shi , Qiang Wu

Distributed statistical learning problems arise commonly when dealing with large datasets. In this setup, datasets are partitioned over machines, which compute locally, and communicate short messages. Communication is often the bottleneck.…

Statistics Theory · Mathematics 2022-10-25 Edgar Dobriban , Yue Sheng

We consider estimation under scenarios where the signals of interest exhibit change of characteristics over time. In particular, we consider the continual learning problem where different tasks, e.g., data with different distributions,…

Machine Learning · Computer Science 2023-12-05 Martin Hellkvist , Ayça Özçelikkale , Anders Ahlén

At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our…

Machine Learning · Computer Science 2020-05-04 Melikasadat Emami , Mojtaba Sahraee-Ardakan , Parthe Pandit , Sundeep Rangan , Alyson K. Fletcher

The generalization of machine learning models has a complex dependence on the data, model and learning algorithm. We study train and test performance, as well as the generalization gap given by the mean of their difference over different…

Machine Learning · Statistics 2022-06-29 Carlos A. Gomez-Uribe

In this paper, we investigate the effect of data heterogeneity across clients on the performance of distributed learning systems, i.e., one-round Federated Learning, as measured by the associated generalization error. Specifically, $K$…

Machine Learning · Computer Science 2025-05-21 Masoud Kavian , Romain Chor , Milad Sefidgaran , Abdellatif Zaidi

Most of today's distributed machine learning systems assume {\em reliable networks}: whenever two machines exchange information (e.g., gradients or models), the network should guarantee the delivery of the message. At the same time, recent…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-05-17 Chen Yu , Hanlin Tang , Cedric Renggli , Simon Kassing , Ankit Singla , Dan Alistarh , Ce Zhang , Ji Liu

Deep learning models have lately shown great performance in various fields such as computer vision, speech recognition, speech translation, and natural language processing. However, alongside their state-of-the-art performance, it is still…

Machine Learning · Computer Science 2019-04-09 Daniel Jakubovitz , Raja Giryes , Miguel R. D. Rodrigues

We study generalised linear regression and classification for a synthetically generated dataset encompassing different problems of interest, such as learning with random features, neural networks in the lazy training regime, and the hidden…

Statistics Theory · Mathematics 2022-03-28 Federica Gerace , Bruno Loureiro , Florent Krzakala , Marc Mézard , Lenka Zdeborová

A major challenge in understanding the generalization of deep learning is to explain why (stochastic) gradient descent can exploit the network architecture to find solutions that have good generalization performance when using high capacity…

Machine Learning · Computer Science 2019-02-12 Yifan Wu , Barnabas Poczos , Aarti Singh

Symbolic regression algorithms search a space of mathematical expressions for formulas that explain given data. Transformer-based models have emerged as a promising, scalable approach shifting the expensive combinatorial search to a…

Machine Learning · Computer Science 2025-09-25 Henrik Voigt , Paul Kahlmeyer , Kai Lawonn , Michael Habeck , Joachim Giesen

Distributed learning is the problem of inferring a function in the case where training data is distributed among multiple geographically separated sources. Particularly, the focus is on designing learning strategies with low computational…

Machine Learning · Statistics 2016-07-22 Simone Scardapane

There is emerging interest in performing regression between distributions. In contrast to prediction on single instances, these machine learning methods can be useful for population-based studies or on problems that are inherently…

Machine Learning · Computer Science 2019-06-03 Connie Kou , Hwee Kuan Lee , Jorge Sanz , Teck Khim Ng

We consider a distributed estimation method in a setting with heterogeneous streams of correlated data distributed across nodes in a network. In the considered approach, linear models are estimated locally (i.e., with only local data)…

Machine Learning · Computer Science 2021-02-11 Lingzhou Hong , Alfredo Garcia , Ceyhun Eksin

Machine learning has achieved tremendous success in a variety of domains in recent years. However, a lot of these success stories have been in places where the training and the testing distributions are extremely similar to each other. In…

Machine Learning · Statistics 2021-03-05 Martin Arjovsky

This paper focuses on generalization performance analysis for distributed algorithms in the framework of learning theory. Taking distributed kernel ridge regression (DKRR) for example, we succeed in deriving its optimal learning rates in…

Machine Learning · Computer Science 2020-03-30 Shao-Bo Lin , Di Wang , Ding-Xuan Zhou

In domains such as health care and finance, shortage of labeled data and computational resources is a critical issue while developing machine learning algorithms. To address the issue of labeled data scarcity in training and deployment of…

Machine Learning · Computer Science 2018-10-16 Otkrist Gupta , Ramesh Raskar

Domain generalization is the problem of machine learning when the training data and the test data come from different data domains. We present a simple theoretical model of learning to generalize across domains in which there is a…

Machine Learning · Computer Science 2020-02-14 Vikas K. Garg , Adam Kalai , Katrina Ligett , Zhiwei Steven Wu

Generalized linear model with $L_1$ and $L_2$ regularization is a widely used technique for solving classification, class probability estimation and regression problems. With the numbers of both features and examples growing rapidly in the…

Machine Learning · Statistics 2017-06-28 Ilya Trofimov , Alexander Genkin
‹ Prev 1 2 3 10 Next ›