Related papers: Overparameterization from Computational Constraint…

Over-parameterization and Adversarial Robustness in Neural Networks: An Overview and Empirical Analysis

Thanks to their extensive capacity, over-parameterized neural networks exhibit superior predictive capabilities and generalization. However, having a large parameter space is considered one of the main suspects of the neural networks'…

Machine Learning · Computer Science 2025-12-18 Srishti Gupta , Zhang Chen , Luca Demetrio , Xiaoyi Feng , Zhaoqiang Xia , Antonio Emanuele Cinà , Maura Pintor , Luca Oneto , Ambra Demontis , Battista Biggio , Fabio Roli

Distributed Training and Optimization Of Neural Networks

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Adversarially Robust Learning Could Leverage Computational Hardness

Over recent years, devising classification algorithms that are robust to adversarial perturbations has emerged as a challenging problem. In particular, deep neural nets (DNNs) seem to be susceptible to small imperceptible changes over test…

Machine Learning · Computer Science 2019-12-20 Sanjam Garg , Somesh Jha , Saeed Mahloujifar , Mohammad Mahmoody

Deep Network Approximation in Terms of Intrinsic Parameters

One of the arguments to explain the success of deep learning is the powerful approximation capacity of deep neural networks. Such capacity is generally accompanied by the explosive growth of the number of parameters, which, in turn, leads…

Machine Learning · Computer Science 2022-09-15 Zuowei Shen , Haizhao Yang , Shijun Zhang

The curse of overparametrization in adversarial training: Precise analysis of robust generalization for random features regression

Successful deep learning models often involve training neural network architectures that contain more parameters than the number of training samples. Such overparametrized models have been extensively studied in recent years, and the…

Machine Learning · Computer Science 2024-02-02 Hamed Hassani , Adel Javanmard

Learning with a Strong Adversary

The robustness of neural networks to intended perturbations has recently attracted significant attention. In this paper, we propose a new method, \emph{learning with a strong adversary}, that learns robust classifiers from supervised data.…

Machine Learning · Computer Science 2016-01-19 Ruitong Huang , Bing Xu , Dale Schuurmans , Csaba Szepesvari

State space models, emergence, and ergodicity: How many parameters are needed for stable predictions?

How many parameters are required for a model to execute a given task? It has been argued that large language models, pre-trained via self-supervised learning, exhibit emergent capabilities such as multi-step reasoning as their number of…

Machine Learning · Computer Science 2024-09-23 Ingvar Ziemann , Nikolai Matni , George J. Pappas

How Does Overparameterization Affect Features?

Overparameterization, the condition where models have more parameters than necessary to fit their training loss, is a crucial factor for the success of deep learning. However, the characteristics of the features learned by overparameterized…

Machine Learning · Computer Science 2024-07-02 Ahmet Cagri Duzgun , Samy Jelassi , Yuanzhi Li

Scaling Trends in Language Model Robustness

Increasing model size has unlocked a dazzling array of capabilities in modern language models. At the same time, even frontier models remain vulnerable to jailbreaks and prompt injections, despite concerted efforts to make them robust. As…

Machine Learning · Computer Science 2025-06-06 Nikolaus Howe , Ian McKenzie , Oskar Hollinsworth , Michał Zajac , Tom Tseng , Aaron Tucker , Pierre-Luc Bacon , Adam Gleave

Parameterized Machine Learning for High-Energy Physics

We investigate a new structure for machine learning classifiers applied to problems in high-energy physics by expanding the inputs to include not only measured features but also physics parameters. The physics parameters represent a…

High Energy Physics - Experiment · Physics 2016-05-25 Pierre Baldi , Kyle Cranmer , Taylor Faucett , Peter Sadowski , Daniel Whiteson

Strength in Numbers: Trading-off Robustness and Computation via Adversarially-Trained Ensembles

While deep learning has led to remarkable results on a number of challenging problems, researchers have discovered a vulnerability of neural networks in adversarial settings, where small but carefully chosen perturbations to the input can…

Neural and Evolutionary Computing · Computer Science 2018-11-26 Edward Grefenstette , Robert Stanforth , Brendan O'Donoghue , Jonathan Uesato , Grzegorz Swirszcz , Pushmeet Kohli

Adversarial examples from computational constraints

Why are classifiers in high dimension vulnerable to "adversarial" perturbations? We show that it is likely not due to information theoretic limitations, but rather it could be due to computational constraints. First we prove that, for a…

Machine Learning · Statistics 2018-05-28 Sébastien Bubeck , Eric Price , Ilya Razenshteyn

Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices

The scaling laws have become the de facto guidelines for designing large language models (LLMs), but they were studied under the assumption of unlimited computing resources for both training and inference. As LLMs are increasingly used as…

Machine Learning · Computer Science 2024-10-03 Ruiyang Qin , Dancheng Liu , Chenhui Xu , Zheyu Yan , Zhaoxuan Tan , Zhenge Jia , Amir Nassereldine , Jiajie Li , Meng Jiang , Ahmed Abbasi , Jinjun Xiong , Yiyu Shi

Overparametrization improves robustness against adversarial attacks: A replication study

Overparametrization has become a de facto standard in machine learning. Despite numerous efforts, our understanding of how and where overparametrization helps model accuracy and robustness is still limited. To this end, here we conduct an…

Machine Learning · Computer Science 2022-02-22 Ali Borji

Learning Compact Neural Networks with Deep Overparameterised Multitask Learning

Compact neural network offers many benefits for real-world applications. However, it is usually challenging to train the compact neural networks with small parameter sizes and low computational costs to achieve the same or better model…

Machine Learning · Computer Science 2023-08-28 Shen Ren , Haosen Shi

Constraining the Parameters of High-Dimensional Models with Active Learning

Constraining the parameters of physical models with $>5-10$ parameters is a widespread problem in fields like particle physics and astronomy. The generation of data to explore this parameter space often requires large amounts of…

Machine Learning · Computer Science 2019-11-26 Sascha Caron , Tom Heskes , Sydney Otten , Bob Stienen

Towards Sample-efficient Overparameterized Meta-learning

An overarching goal in machine learning is to build a generalizable model with few samples. To this end, overparameterization has been the subject of immense interest to explain the generalization ability of deep nets even when the size of…

Machine Learning · Computer Science 2022-01-19 Yue Sun , Adhyyan Narang , Halil Ibrahim Gulluk , Samet Oymak , Maryam Fazel

Empirical Study of the Benefits of Overparameterization in Learning Latent Variable Models

One of the most surprising and exciting discoveries in supervised learning was the benefit of overparameterization (i.e. training a very large model) to improving the optimization landscape of a problem, with minimal effect on statistical…

Machine Learning · Statistics 2020-07-17 Rares-Darius Buhai , Yoni Halpern , Yoon Kim , Andrej Risteski , David Sontag

Computational Limitations in Robust Classification and Win-Win Results

We continue the study of statistical/computational tradeoffs in learning robust classifiers, following the recent work of Bubeck, Lee, Price and Razenshteyn who showed examples of classification tasks where (a) an efficient robust…

Machine Learning · Statistics 2019-06-06 Akshay Degwekar , Preetum Nakkiran , Vinod Vaikuntanathan

Refined bounds for algorithm configuration: The knife-edge of dual class approximability

Automating algorithm configuration is growing increasingly necessary as algorithms come with more and more tunable parameters. It is common to tune parameters using machine learning, optimizing performance metrics such as runtime and…

Artificial Intelligence · Computer Science 2020-12-25 Maria-Florina Balcan , Tuomas Sandholm , Ellen Vitercik