Related papers: A Kernel Perspective on Distillation-based Collabo…

Data-Efficient Ranking Distillation for Image Retrieval

Recent advances in deep learning has lead to rapid developments in the field of image retrieval. However, the best performing architectures incur significant computational cost. Recent approaches tackle this issue using knowledge…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Zakaria Laskar , Juho Kannala

Competitive Distillation: A Simple Learning Strategy for Improving Visual Classification

Deep Neural Networks (DNNs) have significantly advanced the field of computer vision. To improve DNN training process, knowledge distillation methods demonstrate their effectiveness in accelerating network training by introducing a fixed…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Daqian Shi , Xiaolei Diao , Xu Chen , Cédric M. John

Distributed Kernel Regression: An Algorithm for Training Collaboratively

This paper addresses the problem of distributed learning under communication constraints, motivated by distributed signal processing in wireless sensor networks and data mining with distributed databases. After formalizing a general model…

Machine Learning · Computer Science 2016-11-15 Joel B. Predd , Sanjeev R. Kulkarni , H. Vincent Poor

Towards Model Agnostic Federated Learning Using Knowledge Distillation

Is it possible to design an universal API for federated learning using which an ad-hoc group of data-holders (agents) collaborate with each other and perform federated learning? Such an API would necessarily need to be model-agnostic i.e.…

Machine Learning · Computer Science 2022-05-12 Andrei Afonin , Sai Praneeth Karimireddy

One-Shot Collaborative Data Distillation

Large machine-learning training datasets can be distilled into small collections of informative synthetic data samples. These synthetic sets support efficient model learning and reduce the communication cost of data sharing. Thus,…

Machine Learning · Computer Science 2024-08-13 William Holland , Chandra Thapa , Sarah Ali Siddiqui , Wei Shao , Seyit Camtepe

Decentralized Learning with Multi-Headed Distillation

Decentralized learning with private data is a central problem in machine learning. We propose a novel distillation-based decentralized learning technique that allows multiple agents with private non-iid data to learn from each other,…

Machine Learning · Computer Science 2022-11-30 Andrey Zhmoginov , Mark Sandler , Nolan Miller , Gus Kristiansen , Max Vladymyrov

Distilling Lightweight Domain Experts from Large ML Models by Identifying Relevant Subspaces

Knowledge distillation involves transferring the predictive capabilities of large, high-performing AI models (teachers) to smaller models (students) that can operate in environments with limited computing power. In this paper, we address…

Machine Learning · Computer Science 2026-01-12 Pattarawat Chormai , Ali Hashemi , Klaus-Robert Müller , Grégoire Montavon

Collaborative Learning via Prediction Consensus

We consider a collaborative learning setting where the goal of each agent is to improve their own model by leveraging the expertise of collaborators, in addition to their own training data. To facilitate the exchange of expertise among…

Machine Learning · Computer Science 2023-11-16 Dongyang Fan , Celestine Mendler-Dünner , Martin Jaggi

Improved Knowledge Distillation via Full Kernel Matrix Transfer

Knowledge distillation is an effective way for model compression in deep learning. Given a large model (i.e., teacher model), it aims to improve the performance of a compact model (i.e., student model) by transferring the information from…

Machine Learning · Computer Science 2022-03-31 Qi Qian , Hao Li , Juhua Hu

Interpretable Few-Shot Learning via Linear Distillation

It is important to develop mathematically tractable models than can interpret knowledge extracted from the data and provide reasonable predictions. In this paper, we present a Linear Distillation Learning, a simple remedy to improve the…

Machine Learning · Computer Science 2019-10-14 Arip Asadulaev , Igor Kuznetsov , Andrey Filchenkov

A Comprehensive Study on Dataset Distillation: Performance, Privacy, Robustness and Fairness

The aim of dataset distillation is to encode the rich features of an original dataset into a tiny dataset. It is a promising approach to accelerate neural network training and related studies. Different approaches have been proposed to…

Machine Learning · Computer Science 2023-05-30 Zongxiong Chen , Jiahui Geng , Derui Zhu , Herbert Woisetschlaeger , Qing Li , Sonja Schimmler , Ruben Mayer , Chunming Rong

Clustering on Multiple Incomplete Datasets via Collective Kernel Learning

Multiple datasets containing different types of features may be available for a given task. For instance, users' profiles can be used to group users for recommendation systems. In addition, a model can also use users' historical behaviors…

Machine Learning · Computer Science 2016-05-10 Weixiang Shao , Xiaoxiao Shi , Philip S. Yu

A Diffusion Kernel LMS algorithm for nonlinear adaptive networks

This work presents a distributed algorithm for nonlinear adaptive learning. In particular, a set of nodes obtain measurements, sequentially one per time step, which are related via a nonlinear function; their goal is to collectively…

Information Theory · Computer Science 2016-02-09 Symeon Chouvardas , Moez Draief

Towards a theory of model distillation

Distillation is the task of replacing a complicated machine learning model with a simpler model that approximates the original [BCNM06,HVD15]. Despite many practical applications, basic questions about the extent to which models can be…

Machine Learning · Computer Science 2024-05-07 Enric Boix-Adsera

SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation

Feature regression is a simple way to distill large neural network models to smaller ones. We show that with simple changes to the network architecture, regression can outperform more complex state-of-the-art approaches for knowledge…

Computer Vision and Pattern Recognition · Computer Science 2022-01-14 K L Navaneet , Soroush Abbasi Koohpayegani , Ajinkya Tejankar , Hamed Pirsiavash

Large scale distributed neural network training through online distillation

Techniques such as ensembling and distillation promise model quality improvements when paired with almost any base model. However, due to increased test-time cost (for ensembles) and increased complexity of the training pipeline (for…

Machine Learning · Computer Science 2020-08-24 Rohan Anil , Gabriel Pereyra , Alexandre Passos , Robert Ormandi , George E. Dahl , Geoffrey E. Hinton

Even your Teacher Needs Guidance: Ground-Truth Targets Dampen Regularization Imposed by Self-Distillation

Knowledge distillation is classically a procedure where a neural network is trained on the output of another network along with the original targets in order to transfer knowledge between the architectures. The special case of…

Machine Learning · Computer Science 2021-10-18 Kenneth Borup , Lars N. Andersen

Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory

Dataset distillation enables the training of deep neural networks with comparable performance in significantly reduced time by compressing large datasets into small and representative ones. Although the introduction of generative models has…

Machine Learning · Computer Science 2025-05-27 Mingzhuo Li , Guang Li , Jiafeng Mao , Takahiro Ogawa , Miki Haseyama

Kernels on Sample Sets via Nonparametric Divergence Estimates

Most machine learning algorithms, such as classification or regression, treat the individual data point as the object of interest. Here we consider extending machine learning algorithms to operate on groups of data points. We suggest…

Machine Learning · Computer Science 2021-01-15 Danica J. Sutherland , Liang Xiong , Barnabás Póczos , Jeff Schneider

Study on the simulation control of neural network algorithm in thermally coupled distillation

Thermally coupled distillation is a new energy-saving method, but the traditional thermally coupled distillation simulation calculation process is complicated, and the optimization method based on the traditional simulation process is…

Chemical Physics · Physics 2021-02-09 ZhaoLan Zheng , Yu Qi