Related papers: Divide and Conquer Local Average Regression

Divide-and-conquer methods for big data analysis

In the context of big data analysis, the divide-and-conquer methodology refers to a multiple-step process: first splitting a data set into several smaller ones; then analyzing each set separately; finally combining results from each…

Machine Learning · Statistics 2021-02-23 Xueying Chen , Jerry Q. Cheng , Min-ge Xie

Distributed Statistical Estimation and Rates of Convergence in Normal Approximation

This paper presents a class of new algorithms for distributed statistical estimation that exploit divide-and-conquer approach. We show that one of the key benefits of the divide-and-conquer strategy is robustness, an important…

Statistics Theory · Mathematics 2018-08-29 Stanislav Minsker , Nate Strawn

$DC^2$: A Divide-and-conquer Algorithm for Large-scale Kernel Learning with Application to Clustering

Divide-and-conquer is a general strategy to deal with large scale problems. It is typically applied to generate ensemble instances, which potentially limits the problem size it can handle. Additionally, the data are often divided by random…

Machine Learning · Computer Science 2019-11-19 Ke Alexander Wang , Xinran Bian , Pan Liu , Donghui Yan

Divide and Conquer Networks

We consider the learning of algorithmic tasks by mere observation of input-output pairs. Rather than studying this as a black-box discrete regression problem with no assumption whatsoever on the input-output mapping, we concentrate on tasks…

Machine Learning · Computer Science 2018-10-16 Alex Nowak-Vila , David Folqué , Joan Bruna

Max-Diversity Distributed Learning: Theory and Algorithms

We study the risk performance of distributed learning for the regularization empirical risk minimization with fast convergence rate, substantially improving the error analysis of the existing divide-and-conquer based distributed learning.…

Machine Learning · Computer Science 2019-01-21 Yong Liu , Jian Li , Weiping Wang

Two provably consistent divide and conquer clustering algorithms for large networks

In this article, we advance divide-and-conquer strategies for solving the community detection problem in networks. We propose two algorithms which perform clustering on a number of small subgraphs and finally patches the results into a…

Machine Learning · Statistics 2017-08-21 Soumendu Sundar Mukherjee , Purnamrita Sarkar , Peter J. Bickel

Divide-and-Conquer Reinforcement Learning

Standard model-free deep reinforcement learning (RL) algorithms sample a new initial state for each trial, allowing them to optimize policies that can perform well even in highly stochastic environments. However, problems that exhibit…

Machine Learning · Computer Science 2018-04-30 Dibya Ghosh , Avi Singh , Aravind Rajeswaran , Vikash Kumar , Sergey Levine

On the Feasibility of Distributed Kernel Regression for Big Data

In modern scientific research, massive datasets with huge numbers of observations are frequently encountered. To facilitate the computational process, a divide-and-conquer scheme is often used for the analysis of big data. In such a…

Machine Learning · Statistics 2015-05-06 Chen Xu , Yongquan Zhang , Runze Li

A Massive Data Framework for M-Estimators with Cubic-Rate

The divide and conquer method is a common strategy for handling massive data. In this article, we study the divide and conquer method for cubic-rate estimators under the massive data framework. We develop a general theory for establishing…

Statistics Theory · Mathematics 2017-04-06 Chengchun Shi , Wenbin Lu , Rui Song

Divide and Conquer Kernel Ridge Regression: A Distributed Algorithm with Minimax Optimal Rates

We establish optimal convergence rates for a decomposition-based scalable approach to kernel ridge regression. The method is simple to describe: it randomly partitions a dataset of size N into m subsets of equal size, computes an…

Statistics Theory · Mathematics 2014-05-01 Yuchen Zhang , John C. Duchi , Martin J. Wainwright

Robust and Explainable Divide-and-Conquer Learning for Intrusion Detection

Machine learning-based intrusion detection requires complex models to capture patterns in high-dimensional, noisy, and class-imbalanced raw network traffic, yet deploying such models remains impractical on resource-constrained devices with…

Machine Learning · Computer Science 2026-05-05 Yan Zhou , Kevin Hamlen , Michael De Lucia , Murat Kantarcioglu , Latifur Khan , Sharad Mehrotra , Ananthram Swami , Bhavani Thuraisingham

Uncertainty quantification for distributed regression

The ever-growing size of the datasets renders well-studied learning techniques, such as Kernel Ridge Regression, inapplicable, posing a serious computational challenge. Divide-and-conquer is a common remedy, suggesting to split the dataset…

Machine Learning · Statistics 2021-05-25 Valeriy Avanesov

Distributed sequential method for analyzing massive data

To analyse a very large data set containing lengthy variables, we adopt a sequential estimation idea and propose a parallel divide-and-conquer method. We conduct several conventional sequential estimation procedures separately, and properly…

Methodology · Statistics 2018-12-27 Zhanfeng Wang , Yuan-chin Ivan Chang

Divide and Learn: A Divide and Conquer Approach for Predict+Optimize

The predict+optimize problem combines machine learning ofproblem coefficients with a combinatorial optimization prob-lem that uses the predicted coefficients. While this problemcan be solved in two separate stages, it is better to…

Machine Learning · Computer Science 2020-12-07 Ali Ugur Guler , Emir Demirovic , Jeffrey Chan , James Bailey , Christopher Leckie , Peter J. Stuckey

Optimal Rates of Distributed Regression with Imperfect Kernels

Distributed machine learning systems have been receiving increasing attentions for their efficiency to process large scale data. Many distributed frameworks have been proposed for different machine learning tasks. In this paper, we study…

Machine Learning · Computer Science 2020-07-01 Hongwei Sun , Qiang Wu

A divide and conquer method for symbolic regression

Symbolic regression aims to find a function that best explains the relationship between independent variables and the objective value based on a given set of sample data. Genetic programming (GP) is usually considered as an appropriate…

Neural and Evolutionary Computing · Computer Science 2022-09-26 Changtong Luo , Chen Chen , Zonglin Jiang

Divide and Conquer in Non-standard Problems and the Super-efficiency Phenomenon

We study how the divide and conquer principle --- partition the available data into subsamples, compute an estimate from each subsample and combine these appropriately to form the final estimator --- works in non-standard problems where…

Statistics Theory · Mathematics 2016-11-18 Moulinath Banerjee , Cecile Durot , Bodhisattva Sen

Learning Theory of Distributed Regression with Bias Corrected Regularization Kernel Network

Distributed learning is an effective way to analyze big data. In distributed regression, a typical approach is to divide the big data into multiple blocks, apply a base regression algorithm on each of them, and then simply average the…

Machine Learning · Computer Science 2017-08-08 Zhengchu Guo , Lei Shi , Qiang Wu

Divide-and-conquer based Large-Scale Spectral Clustering

Spectral clustering is one of the most popular clustering methods. However, how to balance the efficiency and effectiveness of the large-scale spectral clustering with limited computing resources has not been properly solved for a long…

Machine Learning · Computer Science 2022-07-12 Hongmin Li , Xiucai Ye , Akira Imakura , Tetsuya Sakurai

Kernel Ridge Regression via Partitioning

In this paper, we investigate a divide and conquer approach to Kernel Ridge Regression (KRR). Given n samples, the division step involves separating the points based on some underlying disjoint partition of the input space (possibly via…

Machine Learning · Statistics 2016-08-08 Rashish Tandon , Si Si , Pradeep Ravikumar , Inderjit Dhillon