Related papers: Sparse Online Learning via Truncated Gradient

Stabilized Sparse Online Learning for Sparse Data

Stochastic gradient descent (SGD) is commonly used for optimization in large-scale machine learning problems. Langford et al. (2009) introduce a sparse online learning method to induce sparsity via truncated gradient. With high-dimensional…

Machine Learning · Statistics 2017-05-10 Yuting Ma , Tian Zheng

Quantum Algorithm for Sparse Online Learning with Truncated Gradient Descent

Logistic regression, the Support Vector Machine (SVM), and least squares are well-studied methods in the statistical and computer science community, with various practical applications. High-dimensional data arriving on a real-time basis…

Machine Learning · Computer Science 2024-11-07 Debbie Lim , Yixian Qiu , Patrick Rebentrost , Qisheng Wang

Learning where to learn: Gradient sparsity in meta and continual learning

Finding neural network weights that generalize well from small datasets is difficult. A promising approach is to learn a weight initialization such that a small number of weight changes results in low generalization error. We show that this…

Machine Learning · Computer Science 2021-10-28 Johannes von Oswald , Dominic Zhao , Seijin Kobayashi , Simon Schug , Massimo Caccia , Nicolas Zucchet , João Sacramento

Adaptive Gradient Sparsification for Efficient Federated Learning: An Online Learning Approach

Federated learning (FL) is an emerging technique for training machine learning models using geographically dispersed data collected by local entities. It includes local computation and synchronization steps. To reduce the communication…

Machine Learning · Computer Science 2020-03-23 Pengchao Han , Shiqiang Wang , Kin K. Leung

Learning Large Scale Sparse Models

In this work, we consider learning sparse models in large scale settings, where the number of samples and the feature dimension can grow as large as millions or billions. Two immediate issues occur under such challenging scenario: (i)…

Machine Learning · Statistics 2023-01-31 Atul Dhingra , Jie Shen , Nicholas Kleene

CONGO: Compressive Online Gradient Optimization

We address the challenge of zeroth-order online convex optimization where the objective function's gradient exhibits sparsity, indicating that only a small number of dimensions possess non-zero gradients. Our aim is to leverage this…

Machine Learning · Computer Science 2025-05-19 Jeremy Carleton , Prathik Vijaykumar , Divyanshu Saxena , Dheeraj Narasimha , Srinivas Shakkottai , Aditya Akella

Sparse learning enabled by constraints on connectivity and function

Sparse connectivity is a hallmark of the brain and a desired property of artificial neural networks. It promotes energy efficiency, simplifies training, and enhances the robustness of network function. Thus, a detailed understanding of how…

Disordered Systems and Neural Networks · Physics 2024-09-10 Mirza M. Junaid Baig , Armen Stepanyants

The Convergence of Sparsified Gradient Methods

Distributed training of massive machine learning models, in particular deep neural networks, via Stochastic Gradient Descent (SGD) is becoming commonplace. Several families of communication-reduction methods, such as quantization,…

Machine Learning · Computer Science 2018-09-28 Dan Alistarh , Torsten Hoefler , Mikael Johansson , Sarit Khirirat , Nikola Konstantinov , Cédric Renggli

Learning Model-Based Sparsity via Projected Gradient Descent

Several convex formulation methods have been proposed previously for statistical estimation with structured sparsity as the prior. These methods often require a carefully tuned regularization parameter, often a cumbersome or heuristic…

Machine Learning · Statistics 2016-03-23 Sohail Bahmani , Petros T. Boufounos , Bhiksha Raj

Online Learning Under A Separable Stochastic Approximation Framework

We propose an online learning algorithm for a class of machine learning models under a separable stochastic approximation framework. The essence of our idea lies in the observation that certain parameters in the models are easier to…

Machine Learning · Computer Science 2023-05-23 Min Gan , Xiang-xiang Su , Guang-yong Chen , Jing Chen

Gradient Sparsification for Communication-Efficient Distributed Optimization

Modern large scale machine learning applications require stochastic optimization algorithms to be implemented on distributed computational architectures. A key bottleneck is the communication overhead for exchanging information such as…

Machine Learning · Computer Science 2017-10-31 Jianqiao Wangni , Jialei Wang , Ji Liu , Tong Zhang

A generalization of regularized dual averaging and its dynamics

Excessive computational cost for learning large data and streaming data can be alleviated by using stochastic algorithms, such as stochastic gradient descent and its variants. Recent advances improve stochastic algorithms on convergence…

Machine Learning · Statistics 2019-09-24 Shih-Kang Chao , Guang Cheng

Communication-efficient Algorithm for Distributed Sparse Learning via Two-way Truncation

We propose a communicationally and computationally efficient algorithm for high-dimensional distributed sparse learning. At each iteration, local machines compute the gradient on local data and the master machine solves one shifted $l_1$…

Machine Learning · Statistics 2017-09-12 Jineng Ren , Jarvis Haupt

Efficient online algorithms for fast-rate regret bounds under sparsity

We consider the online convex optimization problem. In the setting of arbitrary sequences and finite set of parameters, we establish a new fast-rate quantile regret bound. Then we investigate the optimization into the L1-ball by…

Statistics Theory · Mathematics 2018-05-24 Pierre Gaillard , Olivier Wintenberger

Low Complexity Approximate Bayesian Logistic Regression for Sparse Online Learning

Theoretical results show that Bayesian methods can achieve lower bounds on regret for online logistic regression. In practice, however, such techniques may not be feasible especially for very large feature sets. Various approximations that,…

Machine Learning · Computer Science 2021-01-29 Gil I. Shamir , Wojciech Szpankowski

On the Generalization Ability of Online Learning Algorithms for Pairwise Loss Functions

In this paper, we study the generalization properties of online learning based stochastic methods for supervised learning problems where the loss function is dependent on more than one training sample (e.g., metric learning, ranking). We…

Machine Learning · Computer Science 2013-05-14 Purushottam Kar , Bharath K Sriperumbudur , Prateek Jain , Harish C Karnick

Gradient-Variation Online Learning under Generalized Smoothness

Gradient-variation online learning aims to achieve regret guarantees that scale with variations in the gradients of online functions, which has been shown to be crucial for attaining fast convergence in games and robustness in stochastic…

Machine Learning · Computer Science 2024-11-05 Yan-Feng Xie , Peng Zhao , Zhi-Hua Zhou

Grad Queue : A probabilistic framework to reinforce sparse gradients

Informative gradients are often lost in large batch updates. We propose a robust mechanism to reinforce the sparse components within a random batch of data points. A finite queue of online gradients is used to determine their expected…

Machine Learning · Computer Science 2024-04-29 Irfan Mohammad Al Hasib

Centralized and distributed online learning for sparse time-varying optimization

The development of online algorithms to track time-varying systems has drawn a lot of attention in the last years, in particular in the framework of online convex optimization. Meanwhile, sparse time-varying optimization has emerged as a…

Optimization and Control · Mathematics 2020-02-03 Sophie M. Fosson

Learning Sparse Latent Representations for Generator Model

Sparsity is a desirable attribute. It can lead to more efficient and more effective representations compared to the dense model. Meanwhile, learning sparse latent representations has been a challenging problem in the field of computer…

Computer Vision and Pattern Recognition · Computer Science 2022-09-22 Hanao Li , Tian Han