Related papers: Robust Student Network Learning

Learning Student Networks via Feature Embedding

Deep convolutional neural networks have been widely used in numerous applications, but their demanding storage and computational resource requirements prevent their applications on mobile devices. Knowledge distillation aims to optimize a…

Machine Learning · Computer Science 2018-12-18 Hanting Chen , Yunhe Wang , Chang Xu , Chao Xu , Dacheng Tao

On Learnability via Gradient Method for Two-Layer ReLU Neural Networks in Teacher-Student Setting

Deep learning empirically achieves high performance in many applications, but its training dynamics has not been fully understood theoretically. In this paper, we explore theoretical analysis on training two-layer ReLU neural networks in a…

Machine Learning · Statistics 2021-06-30 Shunta Akiyama , Taiji Suzuki

Knowledge Distillation in Generations: More Tolerant Teachers Educate Better Students

We focus on the problem of training a deep neural network in generations. The flowchart is that, in order to optimize the target network (student), another network (teacher) with the same architecture is first trained, and used to provide…

Computer Vision and Pattern Recognition · Computer Science 2018-09-10 Chenglin Yang , Lingxi Xie , Siyuan Qiao , Alan Yuille

Teacher-Class Network: A Neural Network Compression Mechanism

To reduce the overwhelming size of Deep Neural Networks (DNN) teacher-student methodology tries to transfer knowledge from a complex teacher network to a simple student network. We instead propose a novel method called the teacher-class…

Machine Learning · Computer Science 2021-11-02 Shaiq Munir Malik , Muhammad Umair Haider , Mohbat Tharani , Musab Rasheed , Murtaza Taj

How Robust are Deep Neural Networks?

Convolutional and Recurrent, deep neural networks have been successful in machine learning systems for computer vision, reinforcement learning, and other allied fields. However, the robustness of such neural networks is seldom apprised,…

Neural and Evolutionary Computing · Computer Science 2018-05-01 Biswa Sengupta , Karl J. Friston

Towards Robust Neural Networks with Lipschitz Continuity

Deep neural networks have shown remarkable performance across a wide range of vision-based tasks, particularly due to the availability of large-scale datasets for training and better architectures. However, data seen in the real world are…

Machine Learning · Computer Science 2018-11-26 Muhammad Usama , Dong Eui Chang

Excess Risk of Two-Layer ReLU Neural Networks in Teacher-Student Settings and its Superiority to Kernel Methods

While deep learning has outperformed other methods for various tasks, theoretical frameworks that explain its reason have not been fully established. To address this issue, we investigate the excess risk of two-layer ReLU neural networks in…

Machine Learning · Statistics 2022-06-07 Shunta Akiyama , Taiji Suzuki

Understanding Robustness in Teacher-Student Setting: A New Perspective

Adversarial examples have appeared as a ubiquitous property of machine learning models where bounded adversarial perturbation could mislead the models to make arbitrarily incorrect predictions. Such examples provide a way to assess the…

Machine Learning · Computer Science 2021-03-02 Zhuolin Yang , Zhaoxi Chen , Tiffany Cai , Xinyun Chen , Bo Li , Yuandong Tian

Adversarial Network Compression

Neural network compression has recently received much attention due to the computational requirements of modern deep models. In this work, our objective is to transfer knowledge from a deep and accurate model to a smaller one. Our…

Computer Vision and Pattern Recognition · Computer Science 2018-11-15 Vasileios Belagiannis , Azade Farshad , Fabio Galasso

Reachable Sets of Classifiers and Regression Models: (Non-)Robustness Analysis and Robust Training

Neural networks achieve outstanding accuracy in classification and regression tasks. However, understanding their behavior still remains an open challenge that requires questions to be addressed on the robustness, explainability and…

Machine Learning · Computer Science 2021-05-13 Anna-Kathrin Kopetzki , Stephan Günnemann

Provably robust learning of regression neural networks using $\beta$-divergences

Regression neural networks (NNs) are most commonly trained by minimizing the mean squared prediction error, which is highly sensitive to outliers and data contamination. Existing robust training methods for regression NNs are often limited…

Machine Learning · Statistics 2026-02-10 Abhik Ghosh , Suryasis Jana

Deep Knowledge Tracing

Knowledge tracing---where a machine models the knowledge of a student as they interact with coursework---is a well established problem in computer supported education. Though effectively modeling student knowledge would have high…

Artificial Intelligence · Computer Science 2015-06-22 Chris Piech , Jonathan Spencer , Jonathan Huang , Surya Ganguli , Mehran Sahami , Leonidas Guibas , Jascha Sohl-Dickstein

Improving Fast Segmentation With Teacher-student Learning

Recently, segmentation neural networks have been significantly improved by demonstrating very promising accuracies on public benchmarks. However, these models are very heavy and generally suffer from low inference speed, which limits their…

Computer Vision and Pattern Recognition · Computer Science 2018-10-22 Jiafeng Xie , Bing Shuai , Jian-Fang Hu , Jingyang Lin , Wei-Shi Zheng

Towards Compact and Robust Deep Neural Networks

Deep neural networks have achieved impressive performance in many applications but their large number of parameters lead to significant computational and storage overheads. Several recent works attempt to mitigate these overheads by…

Machine Learning · Computer Science 2019-06-17 Vikash Sehwag , Shiqi Wang , Prateek Mittal , Suman Jana

Ternary Neural Networks for Resource-Efficient AI Applications

The computation and storage requirements for Deep Neural Networks (DNNs) are usually high. This issue limits their deployability on ubiquitous computing devices such as smart phones, wearables and autonomous drones. In this paper, we…

Machine Learning · Computer Science 2017-02-28 Hande Alemdar , Vincent Leroy , Adrien Prost-Boucle , Frédéric Pétrot

FitNets: Hints for Thin Deep Nets

While depth tends to improve network performances, it also makes gradient-based training more difficult since deeper networks tend to be more non-linear. The recently proposed knowledge distillation approach is aimed at obtaining small and…

Machine Learning · Computer Science 2015-03-30 Adriana Romero , Nicolas Ballas , Samira Ebrahimi Kahou , Antoine Chassang , Carlo Gatta , Yoshua Bengio

Knowledge Projection for Deep Neural Networks

While deeper and wider neural networks are actively pushing the performance limits of various computer vision and machine learning tasks, they often require large sets of labeled data for effective training and suffer from extremely high…

Computer Vision and Pattern Recognition · Computer Science 2017-10-27 Zhi Zhang , Guanghan Ning , Zhihai He

Neural Network Robustness as a Verification Property: A Principled Case Study

Neural networks are very successful at detecting patterns in noisy data, and have become the technology of choice in many fields. However, their usefulness is hampered by their susceptibility to adversarial attacks. Recently, many methods…

Machine Learning · Computer Science 2022-07-14 Marco Casadio , Ekaterina Komendantskaya , Matthew L. Daggitt , Wen Kokke , Guy Katz , Guy Amir , Idan Refaeli

Adversarial Robustness in Multi-Task Learning: Promises and Illusions

Vulnerability to adversarial attacks is a well-known weakness of Deep Neural networks. While most of the studies focus on single-task neural networks with computer vision datasets, very little research has considered complex multi-task…

Machine Learning · Computer Science 2021-10-29 Salah Ghamizi , Maxime Cordy , Mike Papadakis , Yves Le Traon

Robustness of Neural Networks: A Probabilistic and Practical Approach

Neural networks are becoming increasingly prevalent in software, and it is therefore important to be able to verify their behavior. Because verifying the correctness of neural networks is extremely challenging, it is common to focus on the…

Machine Learning · Computer Science 2019-02-19 Ravi Mangal , Aditya V. Nori , Alessandro Orso