English
Related papers

Related papers: Stop memorizing: A data-dependent regularization f…

200 papers

We examine the role of memorization in deep learning, drawing connections to capacity, generalization, and adversarial robustness. While deep networks are capable of memorizing noise data, our results suggest that they tend to prioritize…

Deep neural networks (DNNs) experience significant performance degradation when processing noisy labels, primarily due to overfitting on mislabeled data. Current mainstream approaches attempt to mitigate this issue by passively filtering…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Ningkang Peng , Jingyang Mao , Xiaoqian Peng , Peirong Ma , Xichen Yang , Weiguang Qu , Yanhui Gu

Deep learning (DL) methods have emerged as a powerful tool for the inversion of geophysical data. When applied to field data, these models often struggle without additional fine-tuning of the network. This is because they are built on the…

Machine Learning · Computer Science 2024-10-29 Koustav Ghosal , Arun Singh , Samir Malakar , Shalivahan Srivastava , Deepak Gupta

We investigate the effect of the dimensionality of the representations learned in Deep Neural Networks (DNNs) on their robustness to input perturbations, both adversarial and random. To achieve low dimensionality of learned representations,…

Machine Learning · Computer Science 2020-02-20 Amartya Sanyal , Varun Kanade , Philip H. S. Torr , Puneet K. Dokania

Deep neural networks (DNNs) trained for image denoising are able to generate high-quality samples with score-based reverse diffusion algorithms. These impressive capabilities seem to imply an escape from the curse of dimensionality, but…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Zahra Kadkhodaie , Florentin Guth , Eero P. Simoncelli , Stéphane Mallat

We consider learning deep neural networks (DNNs) that consist of low-precision weights and activations for efficient inference of fixed-point operations. In training low-precision networks, gradient descent in the backward pass is performed…

Computer Vision and Pattern Recognition · Computer Science 2020-05-26 Yoojin Choi , Mostafa El-Khamy , Jungwon Lee

Geometry-aware optimization algorithms, such as Muon, have achieved remarkable success in training deep neural networks (DNNs). These methods leverage the underlying geometry of DNNs by selecting appropriate norms for different layers and…

Machine Learning · Computer Science 2026-02-04 Jie Hao , Xiaochuan Gong , Jie Xu , Zhengdao Wang , Mingrui Liu

Deep Neural Networks (DNNs) are widely used for their ability to effectively approximate large classes of functions. This flexibility, however, makes the strict enforcement of constraints on DNNs an open problem. Here we present a framework…

Machine Learning · Computer Science 2023-02-10 Eric Marcus , Ray Sheombarsing , Jan-Jakob Sonke , Jonas Teuwen

The classical statistical learning theory implies that fitting too many parameters leads to overfitting and poor performance. That modern deep neural networks generalize well despite a large number of parameters contradicts this finding and…

Machine Learning · Statistics 2022-10-18 Masaaki Imaizumi , Johannes Schmidt-Hieber

Many recent efforts have been devoted to designing sophisticated deep learning structures, obtaining revolutionary results on benchmark datasets. The success of these deep learning methods mostly relies on an enormous volume of labeled…

Computer Vision and Pattern Recognition · Computer Science 2015-10-20 Jiaji Huang , Qiang Qiu , Robert Calderbank , Guillermo Sapiro

Graph Neural Networks (GNN) can capture the geometric properties of neural representations in EEG data. Here we utilise those to study how reinforcement-based motor learning affects neural activity patterns during motor planning, leveraging…

Machine Learning · Computer Science 2024-11-01 Federico Nardi , Jinpei Han , Shlomi Haar , A. Aldo Faisal

Deep neural networks (DNNs) are known to be vulnerable to adversarial geometric transformation. This paper aims to verify the robustness of large-scale DNNs against the combination of multiple geometric transformations with a provable…

Machine Learning · Computer Science 2023-04-03 Fu Wang , Peipei Xu , Wenjie Ruan , Xiaowei Huang

Embedding high-dimensional data onto a low-dimensional manifold is of both theoretical and practical value. In this paper, we propose to combine deep neural networks (DNN) with mathematics-guided embedding rules for high-dimensional data…

Machine Learning · Computer Science 2022-08-19 Zixia Zhou , Xinrui Zu , Yuanyuan Wang , Boudewijn P. F. Lelieveldt , Qian Tao

In this paper, we introduce the algorithms of Orthogonal Deep Neural Networks (OrthDNNs) to connect with recent interest of spectrally regularized deep learning methods. OrthDNNs are theoretically motivated by generalization analysis of…

Machine Learning · Computer Science 2019-10-16 Kui Jia , Shuai Li , Yuxin Wen , Tongliang Liu , Dacheng Tao

Deep neural networks often require copious amount of labeled-data to train their scads of parameters. Training larger and deeper networks is hard without appropriate regularization, particularly while using a small dataset. Laterally,…

Computer Vision and Pattern Recognition · Computer Science 2019-05-31 Xiang Xu , Xiong Zhou , Ragav Venkatesan , Gurumurthy Swaminathan , Orchid Majumder

Despite their impressive performance, Deep Neural Networks (DNNs) typically underperform Gradient Boosting Trees (GBTs) on many tabular-dataset learning tasks. We propose that applying a different regularization coefficient to each weight…

Machine Learning · Statistics 2018-10-25 Ira Shavitt , Eran Segal

In this work, we describe a new approach that uses deep neural networks (DNN) to obtain regularization parameters for solving inverse problems. We consider a supervised learning approach, where a network is trained to approximate the…

Numerical Analysis · Mathematics 2021-04-15 Babak Maboudi Afkham , Julianne Chung , Matthias Chung

The Deep Material Network (DMN) has emerged as a powerful framework for multiscale materials modeling, enabling efficient and accurate prediction of material behavior across different length scales. Unlike conventional data-driven…

Computational Engineering, Finance, and Science · Computer Science 2026-03-23 Ting-Ju Wei , Wen-Ning Wan , Chuin-Shan Chen

Recent studies have shown that deep neural networks (DNNs) perform significantly better than shallow networks and Gaussian mixture models (GMMs) on large vocabulary speech recognition tasks. In this paper, we argue that the improved…

Machine Learning · Computer Science 2018-12-06 Dong Yu , Michael L. Seltzer , Jinyu Li , Jui-Ting Huang , Frank Seide

Deep neural networks (DNNs) have set benchmarks on a wide array of supervised learning tasks. Trained DNNs, however, often lack robustness to minor adversarial perturbations to the input, which undermines their true practicality. Recent…

Machine Learning · Computer Science 2018-11-20 Farzan Farnia , Jesse M. Zhang , David Tse
‹ Prev 1 2 3 10 Next ›