English
Related papers

Related papers: Teaching with Commentaries

200 papers

Over the past decade, deep neural networks have demonstrated significant success using the training scheme that involves mini-batch stochastic gradient descent on extensive datasets. Expanding upon this accomplishment, there has been a…

Machine Learning · Computer Science 2024-11-11 Jaehyeon Son , Soochan Lee , Gunhee Kim

Deep neural networks can achieve great successes when presented with large data sets and sufficient computational resources. However, their ability to learn new concepts quickly is limited. Meta-learning is one approach to address this…

Machine Learning · Computer Science 2021-04-22 Mike Huisman , Jan N. van Rijn , Aske Plaat

We propose meta-curvature (MC), a framework to learn curvature information for better generalization and fast model adaptation. MC expands on the model-agnostic meta-learner (MAML) by learning to transform the gradients in the inner…

Machine Learning · Computer Science 2020-01-10 Eunbyung Park , Junier B. Oliva

Most efforts in interpretability in deep learning have focused on (1) extracting explanations of a specific downstream task in relation to the input features and (2) imposing constraints on the model, often at the expense of predictive…

Machine Learning · Computer Science 2022-02-22 Marco Bertolini , Djork-Arné Clevert , Floriane Montanari

Deep learning has arguably achieved tremendous success in recent years. In simple words, deep learning uses the composition of many nonlinear functions to model the complex dependency between input features and labels. While neural networks…

Machine Learning · Statistics 2019-04-16 Jianqing Fan , Cong Ma , Yiqiao Zhong

Deep Learning shows very good performance when trained on large labeled data sets. The problem of training a deep net on a few or one sample per class requires a different learning approach which can generalize to unseen classes using only…

Machine Learning · Computer Science 2018-08-23 Jinchao Liu , Stuart J. Gibson , Margarita Osadchy

Latent representation learned from multi-layered neural networks via hierarchical feature abstraction enables recent success of deep learning. Under the deep learning framework, generalization performance highly depends on the learned…

Machine Learning · Computer Science 2016-11-07 Hyo-Eun Kim , Sangheum Hwang , Kyunghyun Cho

Motivated by the growing amount of publicly available video data on online streaming services and an increased interest in applications that analyze continuous video streams such as autonomous driving, this technical report provides a…

Computer Vision and Pattern Recognition · Computer Science 2021-01-11 Becky Mashaido

Neural networks have been successfully applied in applications with a large amount of labeled data. However, the task of rapid generalization on new concepts with small training data while preserving performances on previously learned ones…

Machine Learning · Computer Science 2017-06-09 Tsendsuren Munkhdalai , Hong Yu

In this paper we present a technique to train neural network models on small amounts of data. Current methods for training neural networks on small amounts of rich data typically rely on strategies such as fine-tuning a pre-trained neural…

Machine Learning · Computer Science 2016-11-08 Ark Anderson , Kyle Shaffer , Artem Yankov , Court D. Corley , Nathan O. Hodas

The exponential growth of volume, variety and velocity of data is raising the need for investigations of automated or semi-automated ways to extract useful patterns from the data. It requires deep expert knowledge and extensive…

Machine Learning · Computer Science 2020-07-22 Abbas Raza Ali , Marcin Budka , Bogdan Gabrys

Saliency methods have been widely used to highlight important input features in model predictions. Most existing methods use backpropagation on a modified gradient function to generate saliency maps. Thus, noisy gradients can result in…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Aya Abdelsalam Ismail , Héctor Corrada Bravo , Soheil Feizi

Compared with traditional deep learning techniques, continual learning enables deep neural networks to learn continually and adaptively. Deep neural networks have to learn new tasks and overcome forgetting the knowledge obtained from the…

Machine Learning · Computer Science 2022-02-08 Yujiang He

Teaching plays a very important role in our society, by spreading human knowledge and educating our next generations. A good teacher will select appropriate teaching materials, impact suitable methodologies, and set up targeted…

Machine Learning · Computer Science 2018-05-11 Yang Fan , Fei Tian , Tao Qin , Xiang-Yang Li , Tie-Yan Liu

Many prediction problems, such as those that arise in the context of robotics, have a simplifying underlying structure that, if known, could accelerate learning. In this paper, we present a strategy for learning a set of neural network…

Machine Learning · Computer Science 2019-05-06 Ferran Alet , Tomás Lozano-Pérez , Leslie P. Kaelbling

Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number…

Machine Learning · Computer Science 2018-11-14 Louis Kirsch , Julius Kunze , David Barber

Neural networks are among the most accurate supervised learning methods in use today. However, their opacity makes them difficult to trust in critical applications, especially when conditions in training may differ from those in practice.…

Machine Learning · Computer Science 2018-10-03 Andrew Slavin Ross

Deep Neural Networks are well known for efficiently fitting training data, yet experiencing poor generalization capabilities whenever some kind of bias dominates over the actual task labels, resulting in models learning "shortcuts". In…

Machine Learning · Computer Science 2024-08-12 Pietro Morerio , Ruggero Ragonesi , Vittorio Murino

Recently, the concept of teaching has been introduced into machine learning, in which a teacher model is used to guide the training of a student model (which will be used in real tasks) through data selection, loss function design, etc.…

Machine Learning · Computer Science 2021-01-13 Yang Fan , Yingce Xia , Lijun Wu , Shufang Xie , Weiqing Liu , Jiang Bian , Tao Qin , Xiang-Yang Li

Deep learning has become the dominant approach for creating high capacity, scalable models across diverse data modalities. However, because these models rely on a large number of learned parameters, tightly couple feature extraction with…

Artificial Intelligence · Computer Science 2026-05-12 Adam Gould , Francesca Toni
‹ Prev 1 2 3 10 Next ›