English
Related papers

Related papers: Understanding Generalization through Visualization…

200 papers

Neural network training relies on our ability to find "good" minimizers of highly non-convex loss functions. It is well-known that certain network architecture designs (e.g., skip connections) produce loss functions that train easier, and…

Machine Learning · Computer Science 2018-11-08 Hao Li , Zheng Xu , Gavin Taylor , Christoph Studer , Tom Goldstein

It is widely observed that deep learning models with learned parameters generalize well, even with much more model parameters than the number of training samples. We systematically investigate the underlying reasons why deep neural networks…

Machine Learning · Computer Science 2017-11-29 Lei Wu , Zhanxing Zhu , Weinan E

Despite the huge empirical success of deep learning, theoretical understanding of neural networks learning process is still lacking. This is the reason, why some of its features seem "mysterious". We emphasize two mysteries of deep…

Machine Learning · Computer Science 2019-05-20 Eugene Golikov

The primary objective of learning methods is generalization. Classic uniform generalization bounds, which rely on VC-dimension or Rademacher complexity, fail to explain the significant attribute that over-parameterized models in deep…

Machine Learning · Computer Science 2025-03-07 Lijia Yu , Yibo Miao , Yifan Zhu , Xiao-Shan Gao , Lijun Zhang

How do neural networks extract patterns from pixels? Feature visualizations attempt to answer this important question by visualizing highly activating patterns through optimization. Today, visualization methods form the foundation of our…

Computer Vision and Pattern Recognition · Computer Science 2024-06-10 Robert Geirhos , Roland S. Zimmermann , Blair Bilodeau , Wieland Brendel , Been Kim

Generalization to unseen degradations remains a fundamental challenge for low-level vision models. This paper aims to investigate the underlying mechanism of this failure, using image deraining as a primary case study due to its…

Computer Vision and Pattern Recognition · Computer Science 2026-02-25 Jinfan Hu , Zhiyuan You , Jinjin Gu , Kaiwen Zhu , Tianfan Xue , Chao Dong

A core issue with learning to optimize neural networks has been the lack of generalization to real world problems. To address this, we describe a system designed from a generalization-first perspective, learning to update optimizer…

Machine Learning · Computer Science 2021-06-09 Diogo Almeida , Clemens Winter , Jie Tang , Wojciech Zaremba

Graph neural networks (GNNs) are the most widely adopted model in graph-structured data oriented learning and representation. Despite their extraordinary success in real-world applications, understanding their working mechanism by theory is…

Machine Learning · Computer Science 2023-05-16 Huayi Tang , Yong Liu

Generalization to unseen data remains poorly understood for deep learning classification and foundation models, especially in the open set scenario. How can one assess the ability of networks to adapt to new or extended versions of their…

Machine Learning · Computer Science 2024-11-05 Luciano Dyballa , Evan Gerritz , Steven W. Zucker

One of the major concerns for neural network training is that the non-convexity of the associated loss functions may cause bad landscape. The recent success of neural networks suggests that their loss landscape is not too bad, but what…

Machine Learning · Computer Science 2023-07-19 Ruoyu Sun , Dawei Li , Shiyu Liang , Tian Ding , R Srikant

Visualization as a discipline often grapples with generalization by reasoning about how study results on the efficacy of a tool in one context might apply to another context. This work offers an account of the logic of generalization in…

Human-Computer Interaction · Computer Science 2025-08-12 Alex Kale

This paper provides theoretical insights into why and how deep learning can generalize well, despite its large capacity, complexity, possible algorithmic instability, nonrobustness, and sharp minima, responding to an open question in the…

Machine Learning · Statistics 2023-08-29 Kenji Kawaguchi , Leslie Pack Kaelbling , Yoshua Bengio

Along with the rapid development of deep learning in practice, the theoretical explanations for its success become urgent. Generalization and expressivity are two widely used measurements to quantify theoretical behaviors of deep learning.…

Machine Learning · Computer Science 2018-03-26 Shao-Bo Lin

Deep learning models have lately shown great performance in various fields such as computer vision, speech recognition, speech translation, and natural language processing. However, alongside their state-of-the-art performance, it is still…

Machine Learning · Computer Science 2019-04-09 Daniel Jakubovitz , Raja Giryes , Miguel R. D. Rodrigues

Network visualization is essential for many scientific, societal, technological and artistic domains. The primary goal is to highlight patterns out of nodes interconnected by edges that are easy to understand, facilitate communication and…

Physics and Society · Physics 2024-06-18 Fabrizio De Vico Fallani , Thibault Rolland

Overparameterized deep networks that generalize well have been key to the dramatic success of deep learning in recent years. The reasons for their remarkable ability to generalize are not well understood yet. When class labels in the…

Machine Learning · Computer Science 2026-02-03 Simran Ketha , Venkatakrishnan Ramaswamy

Data visualization is the process by which data of any size or dimensionality is processed to produce an understandable set of data in a lower dimensionality, allowing it to be manipulated and understood more easily by people. The goal of…

Graphics · Computer Science 2021-07-06 Alexander Kiefer , Md. Khaledur Rahman

Deep neural networks generalize well on unseen data though the number of parameters often far exceeds the number of training examples. Recently proposed complexity measures have provided insights to understanding the generalizability in…

Machine Learning · Computer Science 2020-05-12 Jingling Li , Yanchao Sun , Jiahao Su , Taiji Suzuki , Furong Huang

Artificial intelligence (AI) systems power the world we live in. Deep neural networks (DNNs) are able to solve tasks in an ever-expanding landscape of scenarios, but our eagerness to apply these powerful models leads us to focus on their…

Computer Vision and Pattern Recognition · Computer Science 2022-04-22 Loris Giulivi , Mark James Carman , Giacomo Boracchi

Neural networks can be powerful function approximators, which are able to model high-dimensional feature distributions from a subset of examples drawn from the target distribution. Naturally, they perform well at generalizing within the…

Machine Learning · Computer Science 2021-08-06 Aaron Eisermann , Jae Hee Lee , Cornelius Weber , Stefan Wermter
‹ Prev 1 2 3 10 Next ›