English
Related papers

Related papers: Overparameterization from Computational Constraint…

200 papers

Thanks to their extensive capacity, over-parameterized neural networks exhibit superior predictive capabilities and generalization. However, having a large parameter space is considered one of the main suspects of the neural networks'…

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Over recent years, devising classification algorithms that are robust to adversarial perturbations has emerged as a challenging problem. In particular, deep neural nets (DNNs) seem to be susceptible to small imperceptible changes over test…

Machine Learning · Computer Science 2019-12-20 Sanjam Garg , Somesh Jha , Saeed Mahloujifar , Mohammad Mahmoody

One of the arguments to explain the success of deep learning is the powerful approximation capacity of deep neural networks. Such capacity is generally accompanied by the explosive growth of the number of parameters, which, in turn, leads…

Machine Learning · Computer Science 2022-09-15 Zuowei Shen , Haizhao Yang , Shijun Zhang

Successful deep learning models often involve training neural network architectures that contain more parameters than the number of training samples. Such overparametrized models have been extensively studied in recent years, and the…

Machine Learning · Computer Science 2024-02-02 Hamed Hassani , Adel Javanmard

The robustness of neural networks to intended perturbations has recently attracted significant attention. In this paper, we propose a new method, \emph{learning with a strong adversary}, that learns robust classifiers from supervised data.…

Machine Learning · Computer Science 2016-01-19 Ruitong Huang , Bing Xu , Dale Schuurmans , Csaba Szepesvari

How many parameters are required for a model to execute a given task? It has been argued that large language models, pre-trained via self-supervised learning, exhibit emergent capabilities such as multi-step reasoning as their number of…

Machine Learning · Computer Science 2024-09-23 Ingvar Ziemann , Nikolai Matni , George J. Pappas

Overparameterization, the condition where models have more parameters than necessary to fit their training loss, is a crucial factor for the success of deep learning. However, the characteristics of the features learned by overparameterized…

Machine Learning · Computer Science 2024-07-02 Ahmet Cagri Duzgun , Samy Jelassi , Yuanzhi Li

Increasing model size has unlocked a dazzling array of capabilities in modern language models. At the same time, even frontier models remain vulnerable to jailbreaks and prompt injections, despite concerted efforts to make them robust. As…

Machine Learning · Computer Science 2025-06-06 Nikolaus Howe , Ian McKenzie , Oskar Hollinsworth , Michał Zajac , Tom Tseng , Aaron Tucker , Pierre-Luc Bacon , Adam Gleave

We investigate a new structure for machine learning classifiers applied to problems in high-energy physics by expanding the inputs to include not only measured features but also physics parameters. The physics parameters represent a…

High Energy Physics - Experiment · Physics 2016-05-25 Pierre Baldi , Kyle Cranmer , Taylor Faucett , Peter Sadowski , Daniel Whiteson

While deep learning has led to remarkable results on a number of challenging problems, researchers have discovered a vulnerability of neural networks in adversarial settings, where small but carefully chosen perturbations to the input can…

Neural and Evolutionary Computing · Computer Science 2018-11-26 Edward Grefenstette , Robert Stanforth , Brendan O'Donoghue , Jonathan Uesato , Grzegorz Swirszcz , Pushmeet Kohli

Why are classifiers in high dimension vulnerable to "adversarial" perturbations? We show that it is likely not due to information theoretic limitations, but rather it could be due to computational constraints. First we prove that, for a…

Machine Learning · Statistics 2018-05-28 Sébastien Bubeck , Eric Price , Ilya Razenshteyn

The scaling laws have become the de facto guidelines for designing large language models (LLMs), but they were studied under the assumption of unlimited computing resources for both training and inference. As LLMs are increasingly used as…

Overparametrization has become a de facto standard in machine learning. Despite numerous efforts, our understanding of how and where overparametrization helps model accuracy and robustness is still limited. To this end, here we conduct an…

Machine Learning · Computer Science 2022-02-22 Ali Borji

Compact neural network offers many benefits for real-world applications. However, it is usually challenging to train the compact neural networks with small parameter sizes and low computational costs to achieve the same or better model…

Machine Learning · Computer Science 2023-08-28 Shen Ren , Haosen Shi

Constraining the parameters of physical models with $>5-10$ parameters is a widespread problem in fields like particle physics and astronomy. The generation of data to explore this parameter space often requires large amounts of…

Machine Learning · Computer Science 2019-11-26 Sascha Caron , Tom Heskes , Sydney Otten , Bob Stienen

An overarching goal in machine learning is to build a generalizable model with few samples. To this end, overparameterization has been the subject of immense interest to explain the generalization ability of deep nets even when the size of…

Machine Learning · Computer Science 2022-01-19 Yue Sun , Adhyyan Narang , Halil Ibrahim Gulluk , Samet Oymak , Maryam Fazel

One of the most surprising and exciting discoveries in supervised learning was the benefit of overparameterization (i.e. training a very large model) to improving the optimization landscape of a problem, with minimal effect on statistical…

Machine Learning · Statistics 2020-07-17 Rares-Darius Buhai , Yoni Halpern , Yoon Kim , Andrej Risteski , David Sontag

We continue the study of statistical/computational tradeoffs in learning robust classifiers, following the recent work of Bubeck, Lee, Price and Razenshteyn who showed examples of classification tasks where (a) an efficient robust…

Machine Learning · Statistics 2019-06-06 Akshay Degwekar , Preetum Nakkiran , Vinod Vaikuntanathan

Automating algorithm configuration is growing increasingly necessary as algorithms come with more and more tunable parameters. It is common to tune parameters using machine learning, optimizing performance metrics such as runtime and…

Artificial Intelligence · Computer Science 2020-12-25 Maria-Florina Balcan , Tuomas Sandholm , Ellen Vitercik
‹ Prev 1 2 3 10 Next ›