Related papers: Memory-efficient training with streaming dimension…

Efficient Principal Subspace Projection of Streaming Data Through Fast Similarity Matching

Big data problems frequently require processing datasets in a streaming fashion, either because all data are available at once but collectively are larger than available memory or because the data intrinsically arrive one data point at a…

Computation · Statistics 2018-08-08 Andrea Giovannucci , Victor Minden , Cengiz Pehlevan , Dmitri B. Chklovskii

On the performance of deep learning models for time series classification in streaming

Processing data streams arriving at high speed requires the development of models that can provide fast and accurate predictions. Although deep neural networks are the state-of-the-art for many machine learning tasks, their performance in…

Machine Learning · Computer Science 2020-04-07 Pedro Lara-Benítez , Manuel Carranza-García , Francisco Martínez-Álvarez , José C. Riquelme

History PCA: A New Algorithm for Streaming PCA

In this paper we propose a new algorithm for streaming principal component analysis. With limited memory, small devices cannot store all the samples in the high-dimensional regime. Streaming principal component analysis aims to find the…

Machine Learning · Statistics 2018-02-16 Puyudi Yang , Cho-Jui Hsieh , Jane-Ling Wang

Temporal Streaming Batch Principal Component Analysis for Time Series Classification

In multivariate time series classification, although current sequence analysis models have excellent classification capabilities, they show significant shortcomings when dealing with long sequence multivariate data, such as prolonged…

Machine Learning · Computer Science 2024-10-29 Enshuo Yan , Huachuan Wang , Weihao Xia

Spiking Neural Networks Through the Lens of Streaming Algorithms

We initiate the study of biological neural networks from the perspective of streaming algorithms. Like computers, human brains suffer from memory limitations which pose a significant obstacle when processing large scale and dynamically…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-10-06 Yael Hitron , Cameron Musco , Merav Parter

Revisiting Small Batch Training for Deep Neural Networks

Modern deep neural network training is typically based on mini-batch stochastic gradient optimization. While the use of large mini-batches increases the available computational parallelism, small batch training has been shown to provide…

Machine Learning · Computer Science 2018-04-23 Dominic Masters , Carlo Luschi

Scaling-up Distributed Processing of Data Streams for Machine Learning

Emerging applications of machine learning in numerous areas involve continuous gathering of and learning from streams of data. Real-time incorporation of streaming data into the learned models is essential for improved inference in these…

Machine Learning · Computer Science 2020-12-01 Matthew Nokleby , Haroon Raja , Waheed U. Bajwa

Continual Learning with Deep Streaming Regularized Discriminant Analysis

Continual learning is increasingly sought after in real world machine learning applications, as it enables learning in a more human-like manner. Conventional machine learning approaches fail to achieve this, as incrementally updating the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Joe Khawand , Peter Hanappe , David Colliaux

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

We introduce a streaming framework for analyzing stochastic approximation/optimization problems. This streaming framework is analogous to solving optimization problems using time-varying mini-batches that arrive sequentially. We provide…

Machine Learning · Computer Science 2023-04-25 Antoine Godichon-Baggioni , Nicklas Werge , Olivier Wintenberger

Intentional Updates for Streaming Reinforcement Learning

In gradient-based learning, a step size chosen in parameter units does not produce a predictable per-step change in function output. This often leads to instability in the streaming setting (i.e., batch size=1), where stochasticity is not…

Machine Learning · Computer Science 2026-04-22 Arsalan Sharifnassab , Mohamed Elsayed , Kris De Asis , A. Rupam Mahmood , Richard S. Sutton

Efficient Neural Network Training via Subset Pretraining

In training neural networks, it is common practice to use partial gradients computed over batches, mostly very small subsets of the training set. This approach is motivated by the argument that such a partial gradient is close to the true…

Machine Learning · Computer Science 2024-11-25 Jan Spörer , Bernhard Bermeitinger , Tomas Hrycej , Niklas Limacher , Siegfried Handschuh

Online Functional Principal Component Analysis on a Multidimensional Domain

Multidimensional functional data streams arise in diverse scientific fields, yet their analysis poses significant challenges. We propose a novel online framework for functional principal component analysis that enables efficient and…

Methodology · Statistics 2025-05-06 Muye Nanshan , Nan Zhang , Jiguo Cao

Gradient Boosting on Stochastic Data Streams

Boosting is a popular ensemble algorithm that generates more powerful learners by linearly combining base models from a simpler hypothesis class. In this work, we investigate the problem of adapting batch gradient boosting for minimizing…

Machine Learning · Computer Science 2017-03-02 Hanzhang Hu , Wen Sun , Arun Venkatraman , Martial Hebert , J. Andrew Bagnell

Distributed Networked Real-time Learning

Many machine learning algorithms have been developed under the assumption that data sets are already available in batch form. Yet in many application domains data is only available sequentially overtime via compute nodes in different…

Optimization and Control · Mathematics 2020-09-10 Alfredo Garcia , Luochao Wang , Jeff Huang , Lingzhou Hong

Streaming regularization parameter selection via stochastic gradient descent

We propose a framework to perform streaming covariance selection. Our approach employs regularization constraints where a time-varying sparsity parameter is iteratively estimated via stochastic gradient descent. This allows for the…

Machine Learning · Statistics 2016-11-03 Ricardo Pio Monti , Romy Lorenz , Robert Leech , Christoforos Anagnostopoulos , Giovanni Montana

Streaming Graph Algorithms in the Massively Parallel Computation Model

We initiate the study of graph algorithms in the streaming setting on massive distributed and parallel systems inspired by practical data processing systems. The objective is to design algorithms that can efficiently process evolving graphs…

Data Structures and Algorithms · Computer Science 2025-01-20 Artur Czumaj , Gopinath Mishra , Anish Mukherjee

Lifelong Machine Learning with Deep Streaming Linear Discriminant Analysis

When an agent acquires new information, ideally it would immediately be capable of using that information to understand its environment. This is not possible using conventional deep neural networks, which suffer from catastrophic forgetting…

Machine Learning · Computer Science 2020-04-20 Tyler L. Hayes , Christopher Kanan

BADM: Batch ADMM for Deep Learning

Stochastic gradient descent-based algorithms are widely used for training deep neural networks but often suffer from slow convergence. To address the challenge, we leverage the framework of the alternating direction method of multipliers…

Machine Learning · Computer Science 2025-02-03 Ouya Wang , Shenglong Zhou , Geoffrey Ye Li

Streaming Deep Reinforcement Learning Finally Works

Natural intelligence processes experience as a continuous stream, sensing, acting, and learning moment-by-moment in real time. Streaming learning, the modus operandi of classic reinforcement learning (RL) algorithms like Q-learning and TD,…

Machine Learning · Computer Science 2024-12-09 Mohamed Elsayed , Gautham Vasan , A. Rupam Mahmood

Adversarial Robustness of Streaming Algorithms through Importance Sampling

In this paper, we introduce adversarially robust streaming algorithms for central machine learning and algorithmic tasks, such as regression and clustering, as well as their more general counterparts, subspace embedding, low-rank…

Machine Learning · Computer Science 2021-10-27 Vladimir Braverman , Avinatan Hassidim , Yossi Matias , Mariano Schain , Sandeep Silwal , Samson Zhou