English
Related papers

Related papers: A resource-efficient model for deep kernel learnin…

200 papers

The computational complexity of kernel methods has often been a major barrier for applying them to large-scale learning problems. We argue that this barrier can be effectively overcome. In particular, we develop methods to scale up kernel…

Current Deep Learning approaches have been very successful using convolutional neural networks (CNN) trained on large graphical processing units (GPU)-based computers. Three limitations of this approach are: 1) they are based on a simple…

Neural and Evolutionary Computing · Computer Science 2017-07-17 Thomas E. Potok , Catherine Schuman , Steven R. Young , Robert M. Patton , Federico Spedalieri , Jeremy Liu , Ke-Thia Yao , Garrett Rose , Gangotree Chakma

Artificial intelligence, particularly the subfield of machine learning, has seen a paradigm shift towards data-driven models that learn from and adapt to data. This has resulted in unprecedented advancements in various domains such as…

Machine Learning · Computer Science 2023-10-26 Leonardo Petrini

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin

Random Fourier features provide a way to tackle large-scale machine learning problems with kernel methods. Their slow Monte Carlo convergence rate has motivated the research of deterministic Fourier features whose approximation error can…

Machine Learning · Computer Science 2021-10-20 Frederiek Wesel , Kim Batselier

This paper studies an intriguing phenomenon related to the good generalization performance of estimators obtained by using large learning rates within gradient descent algorithms. First observed in the deep learning literature, we show that…

Machine Learning · Statistics 2022-06-06 Gaspard Beugnot , Julien Mairal , Alessandro Rudi

Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number…

Machine Learning · Computer Science 2018-11-14 Louis Kirsch , Julius Kunze , David Barber

Deep learning methods have predominantly been applied to large artificial neural networks. Despite their state-of-the-art performance, these large networks typically do not generalize well to datasets with limited sample sizes. In this…

Machine Learning · Statistics 2016-11-17 Eric Strobl , Shyam Visweswaran

The curse of dimensionality in the realm of association rules is twofold. Firstly, we have the well known exponential increase in computational complexity with increasing item set size. Secondly, there is a \emph{related curse} concerned…

Artificial Intelligence · Computer Science 2018-05-16 Tom Hanika , Friedrich Martin Schneider , Gerd Stumme

Two pressing topics in the theory of deep learning are the interpretation of feature learning (FL) mechanisms and the determination of implicit bias of networks in the rich regime. Current theories of rich FL often appear in the form of…

Machine Learning · Computer Science 2026-03-25 Noa Rubin , Orit Davidovich , Zohar Ringel

We introduce a novel kernel-based framework for learning differential equations and their solution maps that is efficient in data requirements, in terms of solution examples and amount of measurements from each example, and computational…

Machine Learning · Statistics 2025-04-07 Yasamin Jalalian , Juan Felipe Osorio Ramirez , Alexander Hsu , Bamdad Hosseini , Houman Owhadi

Deep learning, a branch of artificial intelligence, is a data-driven method that uses multiple layers of interconnected units or neurons to learn intricate patterns and representations directly from raw input data. Empowered by this…

Machine Learning · Computer Science 2025-07-28 Mohd Halim Mohd Noor , Ayokunle Olalekan Ige

A novel energy-efficient edge computing paradigm is proposed for real-time deep learning-based image upsampling applications. State-of-the-art deep learning solutions for image upsampling are currently trained using either resize or…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Ian Colbert , Ken Kreutz-Delgado , Srinjoy Das

Model complexity is a fundamental problem in deep learning. In this paper we conduct a systematic overview of the latest studies on model complexity in deep learning. Model complexity of deep learning can be categorized into expressive…

Machine Learning · Computer Science 2021-08-04 Xia Hu , Lingyang Chu , Jian Pei , Weiqing Liu , Jiang Bian

Exploring deep convolutional neural networks of high efficiency and low memory usage is very essential for a wide variety of machine learning tasks. Most of existing approaches used to accelerate deep models by manipulating parameters or…

Computer Vision and Pattern Recognition · Computer Science 2019-07-30 Chuanjian Liu , Yunhe Wang , Kai Han , Chunjing Xu , Chang Xu

The rising computational and energy demands of deep learning, particularly in large-scale architectures such as foundation models and large language models (LLMs), pose significant challenges to sustainability. Traditional gradient-based…

Machine Learning · Computer Science 2025-09-19 Mohammad Saleh Vahdatpour , Huaiyuan Chu , Yanqing Zhang

Recursive Neural Networks are non-linear adaptive models that are able to learn deep structured information. However, these models have not yet been broadly accepted. This fact is mainly due to its inherent complexity. In particular, not…

Neural and Evolutionary Computing · Computer Science 2009-11-18 Alejandro Chinea

We show that a variety of modern deep learning tasks exhibit a "double-descent" phenomenon where, as we increase model size, performance first gets worse and then gets better. Moreover, we show that double descent occurs not just as a…

Machine Learning · Computer Science 2019-12-06 Preetum Nakkiran , Gal Kaplun , Yamini Bansal , Tristan Yang , Boaz Barak , Ilya Sutskever

Large-scale deep neural networks consume expensive training costs, but the training results in less-interpretable weight matrices constructing the networks. Here, we propose a mode decomposition learning that can interpret the weight…

Machine Learning · Computer Science 2023-04-13 Chan Li , Haiping Huang

Deep neural networks have been extremely successful at various image, speech, video recognition tasks because of their ability to model deep structures within the data. However, they are still prohibitively expensive to train and apply for…

Neural and Evolutionary Computing · Computer Science 2015-04-13 Sudheendra Vijayanarasimhan , Jonathon Shlens , Rajat Monga , Jay Yagnik
‹ Prev 1 2 3 10 Next ›