Related papers: Learning from Higher-Layer Feature Visualizations

Network-Agnostic Knowledge Transfer for Medical Image Segmentation

Conventional transfer learning leverages weights of pre-trained networks, but mandates the need for similar neural architectures. Alternatively, knowledge distillation can transfer knowledge between heterogeneous networks but often requires…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Shuhang Wang , Vivek Kumar Singh , Alex Benjamin , Mercy Asiedu , Elham Yousef Kalafi , Eugene Cheah , Viksit Kumar , Anthony Samir

Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion

We introduce DeepInversion, a new method for synthesizing images from the image distribution used to train a deep neural network. We 'invert' a trained network (teacher) to synthesize class-conditional input images starting from random…

Machine Learning · Computer Science 2020-06-17 Hongxu Yin , Pavlo Molchanov , Zhizhong Li , Jose M. Alvarez , Arun Mallya , Derek Hoiem , Niraj K. Jha , Jan Kautz

Effectiveness of Arbitrary Transfer Sets for Data-free Knowledge Distillation

Knowledge Distillation is an effective method to transfer the learning across deep neural networks. Typically, the dataset originally used for training the Teacher model is chosen as the "Transfer Set" to conduct the knowledge transfer to…

Machine Learning · Computer Science 2020-11-19 Gaurav Kumar Nayak , Konda Reddy Mopuri , Anirban Chakraborty

Learning Through Noise: Why Subliminal Learning Works and When It Fails

In the context of artificial neural networks, subliminal learning refers to the transfer of task-relevant knowledge or unintended biases from teacher to student models through distillation on task-unrelated input$\unicode{x2013}$output…

Machine Learning · Computer Science 2026-05-25 Vincent C. Brockers , Roman D. Ventzke , Valentin Neuhaus , Belén Hidalgo-Ogalde , Viola Priesemann

Analysis of Knowledge Transfer in Kernel Regime

Knowledge transfer is shown to be a very successful technique for training neural classifiers: together with the ground truth data, it uses the "privileged information" (PI) obtained by a "teacher" network to train a "student" network. It…

Machine Learning · Computer Science 2023-02-03 Arman Rahbar , Ashkan Panahi , Chiranjib Bhattacharyya , Devdatt Dubhashi , Morteza Haghir Chehreghani

Knowledge Squeezed Adversarial Network Compression

Deep network compression has been achieved notable progress via knowledge distillation, where a teacher-student learning manner is adopted by using predetermined loss. Recently, more focuses have been transferred to employ the adversarial…

Machine Learning · Computer Science 2019-04-26 Shu Changyong , Li Peng , Xie Yuan , Qu Yanyun , Dai Longquan , Ma Lizhuang

Distilling Image Classifiers in Object Detectors

Knowledge distillation constitutes a simple yet effective way to improve the performance of a compact student network by exploiting the knowledge of a more powerful teacher. Nevertheless, the knowledge distillation literature remains…

Computer Vision and Pattern Recognition · Computer Science 2022-02-11 Shuxuan Guo , Jose M. Alvarez , Mathieu Salzmann

Teacher-Class Network: A Neural Network Compression Mechanism

To reduce the overwhelming size of Deep Neural Networks (DNN) teacher-student methodology tries to transfer knowledge from a complex teacher network to a simple student network. We instead propose a novel method called the teacher-class…

Machine Learning · Computer Science 2021-11-02 Shaiq Munir Malik , Muhammad Umair Haider , Mohbat Tharani , Musab Rasheed , Murtaza Taj

Learning Student-Friendly Teacher Networks for Knowledge Distillation

We propose a novel knowledge distillation approach to facilitate the transfer of dark knowledge from a teacher to a student. Contrary to most of the existing methods that rely on effective training of student models given pretrained…

Machine Learning · Computer Science 2022-01-25 Dae Young Park , Moon-Hyun Cha , Changwook Jeong , Dae Sin Kim , Bohyung Han

UNIKD: UNcertainty-filtered Incremental Knowledge Distillation for Neural Implicit Representation

Recent neural implicit representations (NIRs) have achieved great success in the tasks of 3D reconstruction and novel view synthesis. However, they require the images of a scene from different camera views to be available for one-time…

Computer Vision and Pattern Recognition · Computer Science 2024-07-30 Mengqi Guo , Chen Li , Hanlin Chen , Gim Hee Lee

Representation Transfer by Optimal Transport

Learning generic representations with deep networks requires massive training samples and significant computer resources. To learn a new specific task, an important issue is to transfer the generic teacher's representation to a student…

Machine Learning · Computer Science 2021-03-01 Xuhong Li , Yves Grandvalet , Rémi Flamary , Nicolas Courty , Dejing Dou

Basic Level Categorization Facilitates Visual Object Recognition

Recent advances in deep learning have led to significant progress in the computer vision field, especially for visual object recognition tasks. The features useful for object classification are learned by feed-forward deep convolutional…

Computer Vision and Pattern Recognition · Computer Science 2016-01-08 Panqu Wang , Garrison W. Cottrell

SampleAhead: Online Classifier-Sampler Communication for Learning from Synthesized Data

State-of-the-art techniques of artificial intelligence, in particular deep learning, are mostly data-driven. However, collecting and manually labeling a large scale dataset is both difficult and expensive. A promising alternative is to…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Qi Chen , Weichao Qiu , Yi Zhang , Lingxi Xie , Alan Yuille

QUEST: Quantized embedding space for transferring knowledge

Knowledge distillation refers to the process of training a compact student network to achieve better accuracy by learning from a high capacity teacher network. Most of the existing knowledge distillation methods direct the student to follow…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Himalaya Jain , Spyros Gidaris , Nikos Komodakis , Patrick Pérez , Matthieu Cord

Adaptive Explicit Knowledge Transfer for Knowledge Distillation

Logit-based knowledge distillation (KD) for classification is cost-efficient compared to feature-based KD but often subject to inferior performance. Recently, it was shown that the performance of logit-based KD can be improved by…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Hyungkeun Park , Jong-Seok Lee

Student Network Learning via Evolutionary Knowledge Distillation

Knowledge distillation provides an effective way to transfer knowledge via teacher-student learning, where most existing distillation approaches apply a fixed pre-trained model as teacher to supervise the learning of student network. This…

Machine Learning · Computer Science 2021-03-26 Kangkai Zhang , Chunhui Zhang , Shikun Li , Dan Zeng , Shiming Ge

Knowledge Distillation Meets Self-Supervision

Knowledge distillation, which involves extracting the "dark knowledge" from a teacher network to guide the learning of a student network, has emerged as an important technique for model compression and transfer learning. Unlike previous…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Guodong Xu , Ziwei Liu , Xiaoxiao Li , Chen Change Loy

Exploring Knowledge Distillation of a Deep Neural Network for Multi-Script identification

Multi-lingual script identification is a difficult task consisting of different language with complex backgrounds in scene text images. According to the current research scenario, deep neural networks are employed as teacher models to train…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Shuvayan Ghosh Dastidar , Kalpita Dutta , Nibaran Das , Mahantapas Kundu , Mita Nasipuri

Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks

Interpretation and explanation of deep models is critical towards wide adoption of systems that rely on them. In this paper, we propose a novel scheme for both interpretation as well as explanation in which, given a pretrained model, we…

Computer Vision and Pattern Recognition · Computer Science 2019-03-11 Jose Oramas , Kaili Wang , Tinne Tuytelaars

Student-Teacher Learning from Clean Inputs to Noisy Inputs

Feature-based student-teacher learning, a training method that encourages the student's hidden features to mimic those of the teacher network, is empirically successful in transferring the knowledge from a pre-trained teacher network to the…

Machine Learning · Computer Science 2021-03-16 Guanzhe Hong , Zhiyuan Mao , Xiaojun Lin , Stanley H. Chan