Related papers: Stop memorizing: A data-dependent regularization f…

A Closer Look at Memorization in Deep Networks

We examine the role of memorization in deep learning, drawing connections to capacity, generalization, and adversarial robustness. While deep networks are capable of memorizing noise data, our results suggest that they tend to prioritize…

Machine Learning · Statistics 2017-07-04 Devansh Arpit , Stanisław Jastrzębski , Nicolas Ballas , David Krueger , Emmanuel Bengio , Maxinder S. Kanwal , Tegan Maharaj , Asja Fischer , Aaron Courville , Yoshua Bengio , Simon Lacoste-Julien

GAMR: Geometric-Aware Manifold Regularization with Virtual Outlier Synthesis for Learning with Noisy Labels

Deep neural networks (DNNs) experience significant performance degradation when processing noisy labels, primarily due to overfitting on mislabeled data. Current mainstream approaches attempt to mitigate this issue by passively filtering…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Ningkang Peng , Jingyang Mao , Xiaoqian Peng , Peirong Ma , Xichen Yang , Weiguang Qu , Yanhui Gu

Enhancing Deep Learning based RMT Data Inversion using Gaussian Random Field

Deep learning (DL) methods have emerged as a powerful tool for the inversion of geophysical data. When applied to field data, these models often struggle without additional fine-tuning of the network. This is because they are built on the…

Machine Learning · Computer Science 2024-10-29 Koustav Ghosal , Arun Singh , Samir Malakar , Shalivahan Srivastava , Deepak Gupta

Robustness via Deep Low-Rank Representations

We investigate the effect of the dimensionality of the representations learned in Deep Neural Networks (DNNs) on their robustness to input perturbations, both adversarial and random. To achieve low dimensionality of learned representations,…

Machine Learning · Computer Science 2020-02-20 Amartya Sanyal , Varun Kanade , Philip H. S. Torr , Puneet K. Dokania

Generalization in diffusion models arises from geometry-adaptive harmonic representations

Deep neural networks (DNNs) trained for image denoising are able to generate high-quality samples with score-based reverse diffusion algorithms. These impressive capabilities seem to imply an escape from the curse of dimensionality, but…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Zahra Kadkhodaie , Florentin Guth , Eero P. Simoncelli , Stéphane Mallat

Learning Sparse Low-Precision Neural Networks With Learnable Regularization

We consider learning deep neural networks (DNNs) that consist of low-precision weights and activations for efficient inference of fixed-point operations. In training low-precision networks, gradient descent in the backward pass is performed…

Computer Vision and Pattern Recognition · Computer Science 2020-05-26 Yoojin Choi , Mostafa El-Khamy , Jungwon Lee

Noise-Adaptive Layerwise Learning Rates: Accelerating Geometry-Aware Optimization for Deep Neural Network Training

Geometry-aware optimization algorithms, such as Muon, have achieved remarkable success in training deep neural networks (DNNs). These methods leverage the underlying geometry of DNNs by selecting appropriate norms for different layers and…

Machine Learning · Computer Science 2026-02-04 Jie Hao , Xiaochuan Gong , Jie Xu , Zhengdao Wang , Mingrui Liu

Constrained Empirical Risk Minimization: Theory and Practice

Deep Neural Networks (DNNs) are widely used for their ability to effectively approximate large classes of functions. This flexibility, however, makes the strict enforcement of constraints on DNNs an open problem. Here we present a framework…

Machine Learning · Computer Science 2023-02-10 Eric Marcus , Ray Sheombarsing , Jan-Jakob Sonke , Jonas Teuwen

On generalization bounds for deep networks based on loss surface implicit regularization

The classical statistical learning theory implies that fitting too many parameters leads to overfitting and poor performance. That modern deep neural networks generalize well despite a large number of parameters contradicts this finding and…

Machine Learning · Statistics 2022-10-18 Masaaki Imaizumi , Johannes Schmidt-Hieber

Geometry-aware Deep Transform

Many recent efforts have been devoted to designing sophisticated deep learning structures, obtaining revolutionary results on benchmark datasets. The success of these deep learning methods mostly relies on an enormous volume of labeled…

Computer Vision and Pattern Recognition · Computer Science 2015-10-20 Jiaji Huang , Qiang Qiu , Robert Calderbank , Guillermo Sapiro

Graph Neural Networks Uncover Geometric Neural Representations in Reinforcement-Based Motor Learning

Graph Neural Networks (GNN) can capture the geometric properties of neural representations in EEG data. Here we utilise those to study how reinforcement-based motor learning affects neural activity patterns during motor planning, leveraging…

Machine Learning · Computer Science 2024-11-01 Federico Nardi , Jinpei Han , Shlomi Haar , A. Aldo Faisal

Towards Verifying the Geometric Robustness of Large-scale Neural Networks

Deep neural networks (DNNs) are known to be vulnerable to adversarial geometric transformation. This paper aims to verify the robustness of large-scale DNNs against the combination of multiple geometric transformations with a provable…

Machine Learning · Computer Science 2023-04-03 Fu Wang , Peipei Xu , Wenjie Ruan , Xiaowei Huang

Deep Recursive Embedding for High-Dimensional Data

Embedding high-dimensional data onto a low-dimensional manifold is of both theoretical and practical value. In this paper, we propose to combine deep neural networks (DNN) with mathematics-guided embedding rules for high-dimensional data…

Machine Learning · Computer Science 2022-08-19 Zixia Zhou , Xinrui Zu , Yuanyuan Wang , Boudewijn P. F. Lelieveldt , Qian Tao

Orthogonal Deep Neural Networks

In this paper, we introduce the algorithms of Orthogonal Deep Neural Networks (OrthDNNs) to connect with recent interest of spectrally regularized deep learning methods. OrthDNNs are theoretically motivated by generalization analysis of…

Machine Learning · Computer Science 2019-10-16 Kui Jia , Shuai Li , Yuxin Wen , Tongliang Liu , Dacheng Tao

$d$-SNE: Domain Adaptation using Stochastic Neighborhood Embedding

Deep neural networks often require copious amount of labeled-data to train their scads of parameters. Training larger and deeper networks is hard without appropriate regularization, particularly while using a small dataset. Laterally,…

Computer Vision and Pattern Recognition · Computer Science 2019-05-31 Xiang Xu , Xiong Zhou , Ragav Venkatesan , Gurumurthy Swaminathan , Orchid Majumder

Regularization Learning Networks: Deep Learning for Tabular Datasets

Despite their impressive performance, Deep Neural Networks (DNNs) typically underperform Gradient Boosting Trees (GBTs) on many tabular-dataset learning tasks. We propose that applying a different regularization coefficient to each weight…

Machine Learning · Statistics 2018-10-25 Ira Shavitt , Eran Segal

Learning Regularization Parameters of Inverse Problems via Deep Neural Networks

In this work, we describe a new approach that uses deep neural networks (DNN) to obtain regularization parameters for solving inverse problems. We consider a supervised learning approach, where a network is trained to approximate the…

Numerical Analysis · Mathematics 2021-04-15 Babak Maboudi Afkham , Julianne Chung , Matthias Chung

Deep Material Network: Overview, applications and current directions

The Deep Material Network (DMN) has emerged as a powerful framework for multiscale materials modeling, enabling efficient and accurate prediction of material behavior across different length scales. Unlike conventional data-driven…

Computational Engineering, Finance, and Science · Computer Science 2026-03-23 Ting-Ju Wei , Wen-Ning Wan , Chuin-Shan Chen

Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks

Recent studies have shown that deep neural networks (DNNs) perform significantly better than shallow networks and Gaussian mixture models (GMMs) on large vocabulary speech recognition tasks. In this paper, we argue that the improved…

Machine Learning · Computer Science 2018-12-06 Dong Yu , Michael L. Seltzer , Jinyu Li , Jui-Ting Huang , Frank Seide

Generalizable Adversarial Training via Spectral Normalization

Deep neural networks (DNNs) have set benchmarks on a wide array of supervised learning tasks. Trained DNNs, however, often lack robustness to minor adversarial perturbations to the input, which undermines their true practicality. Recent…

Machine Learning · Computer Science 2018-11-20 Farzan Farnia , Jesse M. Zhang , David Tse