Related papers: How to sketch a learning algorithm

Descent-to-Delete: Gradient-Based Methods for Machine Unlearning

We study the data deletion problem for convex models. By leveraging techniques from convex optimization and reservoir sampling, we give the first data deletion algorithms that are able to handle an arbitrarily long sequence of adversarial…

Machine Learning · Statistics 2020-07-07 Seth Neel , Aaron Roth , Saeed Sharifi-Malvajerdi

Sketching for First Order Method: Efficient Algorithm for Low-Bandwidth Channel and Vulnerability

Sketching is one of the most fundamental tools in large-scale machine learning. It enables runtime and memory saving via randomly compressing the original large problem into lower dimensions. In this paper, we propose a novel sketching…

Machine Learning · Computer Science 2023-06-08 Zhao Song , Yitan Wang , Zheng Yu , Lichen Zhang

Compressive Learning for Semi-Parametric Models

In the compressive learning theory, instead of solving a statistical learning problem from the input data, a so-called sketch is computed from the data prior to learning. The sketch has to capture enough information to solve the problem…

Machine Learning · Statistics 2019-10-23 Michael P. Sheehan , Antoine Gonon , Mike E. Davies

SketchOGD: Memory-Efficient Continual Learning

When machine learning models are trained continually on a sequence of tasks, they are often liable to forget what they learned on previous tasks--a phenomenon known as catastrophic forgetting. Proposed solutions to catastrophic forgetting…

Machine Learning · Computer Science 2025-12-18 Youngjae Min , Benjamin Wright , Jeremy Bernstein , Navid Azizan

Sketching Datasets for Large-Scale Learning (long version)

This article considers "compressive learning," an approach to large-scale machine learning where datasets are massively compressed before learning (e.g., clustering, classification, or regression) is performed. In particular, a "sketch" is…

Machine Learning · Statistics 2021-06-28 Rémi Gribonval , Antoine Chatalic , Nicolas Keriven , Vincent Schellekens , Laurent Jacques , Philip Schniter

Dynamic Latent Separation for Deep Learning

A core problem in machine learning is to learn expressive latent variables for model prediction on complex data that involves multiple sub-components in a flexible and interpretable fashion. Here, we develop an approach that improves…

Machine Learning · Computer Science 2024-02-13 Yi-Lin Tuan , Zih-Yun Chiu , William Yang Wang

Forget Unlearning: Towards True Data-Deletion in Machine Learning

Unlearning algorithms aim to remove deleted data's influence from trained models at a cost lower than full retraining. However, prior guarantees of unlearning in literature are flawed and don't protect the privacy of deleted records. We…

Machine Learning · Statistics 2023-02-15 Rishav Chourasia , Neil Shah

Accelerating Deep Learning with Dynamic Data Pruning

Deep learning's success has been attributed to the training of large, overparameterized models on massive amounts of data. As this trend continues, model training has become prohibitively costly, requiring access to powerful computing…

Machine Learning · Computer Science 2021-11-25 Ravi S Raju , Kyle Daruwalla , Mikko Lipasti

Sketching Curvature for Efficient Out-of-Distribution Detection for Deep Neural Networks

In order to safely deploy Deep Neural Networks (DNNs) within the perception pipelines of real-time decision making systems, there is a need for safeguards that can detect out-of-training-distribution (OoD) inputs both efficiently and…

Machine Learning · Computer Science 2021-02-26 Apoorva Sharma , Navid Azizan , Marco Pavone

From Adaptive Query Release to Machine Unlearning

We formalize the problem of machine unlearning as design of efficient unlearning algorithms corresponding to learning algorithms which perform a selection of adaptive queries from structured query classes. We give efficient unlearning…

Machine Learning · Computer Science 2023-07-24 Enayat Ullah , Raman Arora

Statistical properties of sketching algorithms

Sketching is a probabilistic data compression technique that has been largely developed in the computer science community. Numerical operations on big datasets can be intolerably slow; sketching algorithms address this issue by generating a…

Methodology · Statistics 2019-04-04 Daniel Ahfock , William J. Astle , Sylvia Richardson

Fast AI Model Partition for Split Learning over Edge Networks

Split learning (SL) is a distributed learning paradigm that can enable computation-intensive artificial intelligence (AI) applications by partitioning AI models between mobile devices and edge servers. %fully utilizing distributed computing…

Machine Learning · Computer Science 2026-04-15 Zuguang Li , Wen Wu , Shaohua Wu , Xuemin , Shen

Sketching for Large-Scale Learning of Mixture Models

Learning parameters from voluminous data can be prohibitive in terms of memory and computational requirements. We propose a "compressive learning" framework where we estimate model parameters from a sketch of the training data. This sketch…

Machine Learning · Computer Science 2017-05-08 Nicolas Keriven , Anthony Bourrier , Rémi Gribonval , Patrick Pérez

On Dynamic Graph Algorithms with Predictions

We study dynamic algorithms in the model of algorithms with predictions. We assume the algorithm is given imperfect predictions regarding future updates, and we ask how such predictions can be used to improve the running time. This can be…

Data Structures and Algorithms · Computer Science 2023-12-11 Jan van den Brand , Sebastian Forster , Yasamin Nazari , Adam Polak

Online Algorithms with Limited Data Retention

We introduce a model of online algorithms subject to strict constraints on data retention. An online learning algorithm encounters a stream of data points, one per round, generated by some stationary process. Crucially, each data point can…

Machine Learning · Computer Science 2024-04-18 Nicole Immorlica , Brendan Lucier , Markus Mobius , James Siderius

Large-scale Dataset Pruning with Dynamic Uncertainty

The state of the art of many learning tasks, e.g., image classification, is advanced by collecting larger datasets and then training larger models on them. As the outcome, the increasing computational cost is becoming unaffordable. In this…

Machine Learning · Computer Science 2024-06-17 Muyang He , Shuo Yang , Tiejun Huang , Bo Zhao

A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks

In distributed training of deep neural networks, people usually run Stochastic Gradient Descent (SGD) or its variants on each machine and communicate with other machines periodically. However, SGD might converge slowly in training some deep…

Machine Learning · Computer Science 2022-10-14 Mingrui Liu , Zhenxun Zhuang , Yunwei Lei , Chunyang Liao

A One-Pass Private Sketch for Most Machine Learning Tasks

Differential privacy (DP) is a compelling privacy definition that explains the privacy-utility tradeoff via formal, provable guarantees. Inspired by recent progress toward general-purpose data release algorithms, we propose a private…

Data Structures and Algorithms · Computer Science 2020-06-17 Benjamin Coleman , Anshumali Shrivastava

Adaptive Machine Unlearning

Data deletion algorithms aim to remove the influence of deleted data points from trained models at a cheaper computational cost than fully retraining those models. However, for sequences of deletions, most prior work in the non-convex…

Machine Learning · Computer Science 2021-06-09 Varun Gupta , Christopher Jung , Seth Neel , Aaron Roth , Saeed Sharifi-Malvajerdi , Chris Waites

Predicting Training Time Without Training

We tackle the problem of predicting the number of optimization steps that a pre-trained deep network needs to converge to a given value of the loss function. To do so, we leverage the fact that the training dynamics of a deep network during…

Machine Learning · Computer Science 2020-08-31 Luca Zancato , Alessandro Achille , Avinash Ravichandran , Rahul Bhotika , Stefano Soatto