Related papers: Extractor-Based Time-Space Lower Bounds for Learni…

Memory-Sample Lower Bounds for Learning Parity with Noise

In this work, we show, for the well-studied problem of learning parity under noise, where a learner tries to learn $x=(x_1,\ldots,x_n) \in \{0,1\}^n$ from a stream of random linear equations over $\mathrm{F}_2$ that are correct with…

Machine Learning · Computer Science 2021-07-07 Sumegha Garg , Pravesh K. Kothari , Pengda Liu , Ran Raz

Fast Learning Requires Good Memory: A Time-Space Lower Bound for Parity Learning

We prove that any algorithm for learning parities requires either a memory of quadratic size or an exponential number of samples. This proves a recent conjecture of Steinhardt, Valiant and Wager and shows that for some learning problems a…

Machine Learning · Computer Science 2016-02-17 Ran Raz

Memory-Sample Tradeoffs for Linear Regression with Small Error

We consider the problem of performing linear regression over a stream of $d$-dimensional examples, and show that any algorithm that uses a subquadratic amount of memory exhibits a slower rate of convergence than can be achieved without…

Machine Learning · Computer Science 2020-10-13 Vatsal Sharan , Aaron Sidford , Gregory Valiant

Strong Memory Lower Bounds for Learning Natural Models

We give lower bounds on the amount of memory required by one-pass streaming algorithms for solving several natural learning problems. In a setting where examples lie in $\{0,1\}^d$ and the optimal classifier can be encoded using $\kappa$…

Machine Learning · Computer Science 2022-06-13 Gavin Brown , Mark Bun , Adam Smith

Tight Time-Space Lower Bounds for Constant-Pass Learning

In his breakthrough paper, Raz showed that any parity learning algorithm requires either quadratic memory or an exponential number of samples [FOCS'16, JACM'19]. A line of work that followed extended this result to a large class of learning…

Machine Learning · Computer Science 2023-10-13 Xin Lyu , Avishay Tal , Hongxun Wu , Junzhao Yang

Time-Space Tradeoffs for Learning from Small Test Spaces: Learning Low Degree Polynomial Functions

We develop an extension of recently developed methods for obtaining time-space tradeoff lower bounds for problems of learning from random test samples to handle the situation where the space of tests is signficantly smaller than the space…

Machine Learning · Computer Science 2017-08-10 Paul Beame , Shayan Oveis Gharan , Xin Yang

Trade-offs in Data Memorization via Strong Data Processing Inequalities

Recent research demonstrated that training large language models involves memorization of a significant fraction of training data. Such memorization can lead to privacy violations when training on sensitive user data and thus motivates the…

Machine Learning · Computer Science 2025-10-29 Vitaly Feldman , Guy Kornowski , Xin Lyu

Learning Internal Representations (COLT 1995)

Probably the most important problem in machine learning is the preliminary biasing of a learner's hypothesis space so that it is small enough to ensure good generalisation from reasonable training sets, yet large enough that it contains a…

Machine Learning · Computer Science 2019-12-20 Jonathan Baxter

Optimal Learners for Multiclass Problems

The fundamental theorem of statistical learning states that for binary classification problems, any Empirical Risk Minimization (ERM) learning rule has close to optimal sample complexity. In this paper we seek for a generic optimal learner…

Machine Learning · Computer Science 2014-05-13 Amit Daniely , Shai Shalev-Shwartz

Memory-Sample Lower Bounds for Learning with Classical-Quantum Hybrid Memory

In a work by Raz (J. ACM and FOCS 16), it was proved that any algorithm for parity learning on $n$ bits requires either $\Omega(n^2)$ bits of classical memory or an exponential number (in~$n$) of random samples. A line of recent works…

Quantum Physics · Physics 2023-03-02 Qipeng Liu , Ran Raz , Wei Zhan

Episodic memory for continual model learning

Both the human brain and artificial learning agents operating in real-world or comparably complex environments are faced with the challenge of online model selection. In principle this challenge can be overcome: hierarchical Bayesian…

Machine Learning · Computer Science 2017-12-05 David G. Nagy , Gergő Orbán

Toward Attribute Efficient Learning Algorithms

We make progress on two important problems regarding attribute efficient learnability. First, we give an algorithm for learning decision lists of length $k$ over $n$ variables using $2^{\tilde{O}(k^{1/3})} \log n$ examples and time…

Machine Learning · Computer Science 2007-05-23 Adam R. Klivans , Rocco A. Servedio

Linear-Sample Learning of Low-Rank Distributions

Many latent-variable applications, including community detection, collaborative filtering, genomic analysis, and NLP, model data as generated by low-rank matrices. Yet despite considerable research, except for very special cases, the number…

Machine Learning · Computer Science 2020-10-02 Ayush Jain , Alon Orlitsky

Improve Long-term Memory Learning Through Rescaling the Error Temporally

This paper studies the error metric selection for long-term memory learning in sequence modelling. We examine the bias towards short-term memory in commonly used errors, including mean absolute/squared error. Our findings show that all…

Machine Learning · Computer Science 2023-07-24 Shida Wang , Zhanglu Yan

Learning Model Bias

In this paper the problem of {\em learning} appropriate domain-specific bias is addressed. It is shown that this can be achieved by learning many related tasks from the same domain, and a theorem is given bounding the number tasks that must…

Machine Learning · Computer Science 2019-11-15 Jonathan Baxter

Theoretical bounds on estimation error for meta-learning

Machine learning models have traditionally been developed under the assumption that the training and test distributions match exactly. However, recent success in few-shot learning and related problems are encouraging signs that these models…

Machine Learning · Statistics 2020-10-15 James Lucas , Mengye Ren , Irene Kameni , Toniann Pitassi , Richard Zemel

On the Complexity of Learning with Kernels

A well-recognized limitation of kernel learning is the requirement to handle a kernel matrix, whose size is quadratic in the number of training examples. Many methods have been proposed to reduce this computational cost, mostly by using a…

Machine Learning · Computer Science 2014-11-06 Nicolò Cesa-Bianchi , Yishay Mansour , Ohad Shamir

The Element Extraction Problem and the Cost of Determinism and Limited Adaptivity in Linear Queries

Two widely-used computational paradigms for sublinear algorithms are using linear measurements to perform computations on a high dimensional input and using structured queries to access a massive input. Typically, algorithms in the former…

Computational Complexity · Computer Science 2021-07-14 Amit Chakrabarti , Manuel Stoeckl

Learning Halfspaces and Neural Networks with Random Initialization

We study non-convex empirical risk minimization for learning halfspaces and neural networks. For loss functions that are $L$-Lipschitz continuous, we present algorithms to learn halfspaces and multi-layer neural networks that achieve…

Machine Learning · Computer Science 2015-11-26 Yuchen Zhang , Jason D. Lee , Martin J. Wainwright , Michael I. Jordan

A Non-Binary Associative Memory with Exponential Pattern Retrieval Capacity and Iterative Learning: Extended Results

We consider the problem of neural association for a network of non-binary neurons. Here, the task is to first memorize a set of patterns using a network of neurons whose states assume values from a finite number of integer levels. Later,…

Neural and Evolutionary Computing · Computer Science 2013-02-18 Amir Hesam Salavati , K. Raj Kumar , Amin Shokrollahi