English
Related papers

Related papers: Gradient-based Competitive Learning: Theory

200 papers

Topological learning is a wide research area aiming at uncovering the mutual spatial relationships between the elements of a set. Some of the most common and oldest approaches involve the use of unsupervised competitive neural networks.…

Machine Learning · Statistics 2021-11-03 Pietro Barbiero , Gabriele Ciravegna , Vincenzo Randazzo , Giansalvo Cirrincione

We address the challenging problem of deep representation learning--the efficient adaption of a pre-trained deep network to different tasks. Specifically, we propose to explore gradient-based features. These features are gradients of the…

Machine Learning · Computer Science 2020-04-14 Fangzhou Mu , Yingyu Liang , Yin Li

In this work we address graph based semi-supervised learning using the theory of the spatial segregation of competitive systems. First, we define a discrete counterpart over connected graphs by using direct analogue of the corresponding…

Numerical Analysis · Mathematics 2022-11-30 Farid Bozorgnia , Morteza Fotouhi , Avetik Arakelyan , Abderrahim Elmoataz

We propose a new technique that boosts the convergence of training generative adversarial networks. Generally, the rate of training deep models reduces severely after multiple iterations. A key reason for this phenomenon is that a deep…

Machine Learning · Statistics 2018-06-15 Atsushi Nitanda , Taiji Suzuki

Intermediate features at different layers of a deep neural network are known to be discriminative for visual patterns of different complexities. However, most existing works ignore such cross-layer heterogeneities when classifying samples…

Computer Vision and Pattern Recognition · Computer Science 2016-07-20 Xiaojie Jin , Yunpeng Chen , Jian Dong , Jiashi Feng , Shuicheng Yan

Deep neural networks achieve state-of-the-art and sometimes super-human performance across various domains. However, when learning tasks sequentially, the networks easily forget the knowledge of previous tasks, known as "catastrophic…

Computer Vision and Pattern Recognition · Computer Science 2021-05-18 Shixiang Tang , Dapeng Chen , Jinguo Zhu , Shijie Yu , Wanli Ouyang

Admittedly, Graph Convolution Network (GCN) has achieved excellent results on graph datasets such as social networks, citation networks, etc. However, softmax used as the decision layer in these frameworks is generally optimized with…

Machine Learning · Computer Science 2022-07-19 Xuelong Li , Ziheng Jiao , Hongyuan Zhang , Rui Zhang

Topology applied to real world data using persistent homology has started to find applications within machine learning, including deep learning. We present a differentiable topology layer that computes persistent homology based on level set…

Deep learning has arguably achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks…

Machine Learning · Statistics 2019-04-16 Jianqing Fan , Cong Ma , Yiqiao Zhong

Neural networks have achieved remarkable empirical performance, while the current theoretical analysis is not adequate for understanding their success, e.g., the Neural Tangent Kernel approach fails to capture their key feature learning…

Machine Learning · Computer Science 2023-10-20 Zhenmei Shi , Junyi Wei , Yingyu Liang

In the context of classification problems, Deep Learning (DL) approaches represent state of art. Many DL approaches are based on variations of standard multi-layer feed-forward neural networks. These are also referred to as deep networks.…

Machine Learning · Computer Science 2023-11-21 Andrea Apicella , Francesco Isgrò , Roberto Prevete

We propose gradient adversarial training, an auxiliary deep learning framework applicable to different machine learning problems. In gradient adversarial training, we leverage a prior belief that in many contexts, simultaneous gradient…

Machine Learning · Computer Science 2018-06-22 Ayan Sinha , Zhao Chen , Vijay Badrinarayanan , Andrew Rabinovich

Deep learning empirically achieves high performance in many applications, but its training dynamics has not been fully understood theoretically. In this paper, we explore theoretical analysis on training two-layer ReLU neural networks in a…

Machine Learning · Statistics 2021-06-30 Shunta Akiyama , Taiji Suzuki

Deep learning, a branch of artificial intelligence, is a data-driven method that uses multiple layers of interconnected units or neurons to learn intricate patterns and representations directly from raw input data. Empowered by this…

Machine Learning · Computer Science 2025-07-28 Mohd Halim Mohd Noor , Ayokunle Olalekan Ige

Deep learning has shown promising results in many machine learning applications. The hierarchical feature representation built by deep networks enable compact and precise encoding of the data. A kernel analysis of the trained deep networks…

Machine Learning · Computer Science 2017-03-22 Mandar Kulkarni , Shirish Karande

Recently, several studies have proven the global convergence and generalization abilities of the gradient descent method for two-layer ReLU networks. Most studies especially focused on the regression problems with the squared loss function,…

Machine Learning · Statistics 2020-03-19 Atsushi Nitanda , Geoffrey Chinot , Taiji Suzuki

Although statistical learning theory provides a robust framework to understand supervised learning, many theoretical aspects of deep learning remain unclear, in particular how different architectures may lead to inductive bias when trained…

Machine Learning · Computer Science 2024-03-27 Cédric Gerbelot , Avetik Karagulyan , Stefani Karp , Kavya Ravichandran , Menachem Stern , Nathan Srebro

Distributed deep learning systems (DDLS) train deep neural network models by utilizing the distributed resources of a cluster. Developers of DDLS are required to make many decisions to process their particular workloads in their chosen…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-09 Matthias Langer , Zhen He , Wenny Rahayu , Yanbo Xue

In this study, we propose the integration of competitive learning into convolutional neural networks (CNNs) to improve the representation learning and efficiency of fine-tuning. Conventional CNNs use back propagation learning, and it…

Machine Learning · Computer Science 2018-04-27 Takashi Shinozaki

Despite the prevalence and significance of tabular data across numerous industries and fields, it has been relatively underexplored in the realm of deep learning. Even today, neural networks are often overshadowed by techniques such as…

Machine Learning · Computer Science 2024-07-19 Andreas Voskou , Charalambos Christoforou , Sotirios Chatzis
‹ Prev 1 2 3 10 Next ›