English
Related papers

Related papers: A Kernel Perspective on Distillation-based Collabo…

200 papers

Recent advances in deep learning has lead to rapid developments in the field of image retrieval. However, the best performing architectures incur significant computational cost. Recent approaches tackle this issue using knowledge…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Zakaria Laskar , Juho Kannala

Deep Neural Networks (DNNs) have significantly advanced the field of computer vision. To improve DNN training process, knowledge distillation methods demonstrate their effectiveness in accelerating network training by introducing a fixed…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Daqian Shi , Xiaolei Diao , Xu Chen , Cédric M. John

This paper addresses the problem of distributed learning under communication constraints, motivated by distributed signal processing in wireless sensor networks and data mining with distributed databases. After formalizing a general model…

Machine Learning · Computer Science 2016-11-15 Joel B. Predd , Sanjeev R. Kulkarni , H. Vincent Poor

Is it possible to design an universal API for federated learning using which an ad-hoc group of data-holders (agents) collaborate with each other and perform federated learning? Such an API would necessarily need to be model-agnostic i.e.…

Machine Learning · Computer Science 2022-05-12 Andrei Afonin , Sai Praneeth Karimireddy

Large machine-learning training datasets can be distilled into small collections of informative synthetic data samples. These synthetic sets support efficient model learning and reduce the communication cost of data sharing. Thus,…

Machine Learning · Computer Science 2024-08-13 William Holland , Chandra Thapa , Sarah Ali Siddiqui , Wei Shao , Seyit Camtepe

Decentralized learning with private data is a central problem in machine learning. We propose a novel distillation-based decentralized learning technique that allows multiple agents with private non-iid data to learn from each other,…

Machine Learning · Computer Science 2022-11-30 Andrey Zhmoginov , Mark Sandler , Nolan Miller , Gus Kristiansen , Max Vladymyrov

Knowledge distillation involves transferring the predictive capabilities of large, high-performing AI models (teachers) to smaller models (students) that can operate in environments with limited computing power. In this paper, we address…

Machine Learning · Computer Science 2026-01-12 Pattarawat Chormai , Ali Hashemi , Klaus-Robert Müller , Grégoire Montavon

We consider a collaborative learning setting where the goal of each agent is to improve their own model by leveraging the expertise of collaborators, in addition to their own training data. To facilitate the exchange of expertise among…

Machine Learning · Computer Science 2023-11-16 Dongyang Fan , Celestine Mendler-Dünner , Martin Jaggi

Knowledge distillation is an effective way for model compression in deep learning. Given a large model (i.e., teacher model), it aims to improve the performance of a compact model (i.e., student model) by transferring the information from…

Machine Learning · Computer Science 2022-03-31 Qi Qian , Hao Li , Juhua Hu

It is important to develop mathematically tractable models than can interpret knowledge extracted from the data and provide reasonable predictions. In this paper, we present a Linear Distillation Learning, a simple remedy to improve the…

Machine Learning · Computer Science 2019-10-14 Arip Asadulaev , Igor Kuznetsov , Andrey Filchenkov

The aim of dataset distillation is to encode the rich features of an original dataset into a tiny dataset. It is a promising approach to accelerate neural network training and related studies. Different approaches have been proposed to…

Machine Learning · Computer Science 2023-05-30 Zongxiong Chen , Jiahui Geng , Derui Zhu , Herbert Woisetschlaeger , Qing Li , Sonja Schimmler , Ruben Mayer , Chunming Rong

Multiple datasets containing different types of features may be available for a given task. For instance, users' profiles can be used to group users for recommendation systems. In addition, a model can also use users' historical behaviors…

Machine Learning · Computer Science 2016-05-10 Weixiang Shao , Xiaoxiao Shi , Philip S. Yu

This work presents a distributed algorithm for nonlinear adaptive learning. In particular, a set of nodes obtain measurements, sequentially one per time step, which are related via a nonlinear function; their goal is to collectively…

Information Theory · Computer Science 2016-02-09 Symeon Chouvardas , Moez Draief

Distillation is the task of replacing a complicated machine learning model with a simpler model that approximates the original [BCNM06,HVD15]. Despite many practical applications, basic questions about the extent to which models can be…

Machine Learning · Computer Science 2024-05-07 Enric Boix-Adsera

Feature regression is a simple way to distill large neural network models to smaller ones. We show that with simple changes to the network architecture, regression can outperform more complex state-of-the-art approaches for knowledge…

Computer Vision and Pattern Recognition · Computer Science 2022-01-14 K L Navaneet , Soroush Abbasi Koohpayegani , Ajinkya Tejankar , Hamed Pirsiavash

Techniques such as ensembling and distillation promise model quality improvements when paired with almost any base model. However, due to increased test-time cost (for ensembles) and increased complexity of the training pipeline (for…

Machine Learning · Computer Science 2020-08-24 Rohan Anil , Gabriel Pereyra , Alexandre Passos , Robert Ormandi , George E. Dahl , Geoffrey E. Hinton

Knowledge distillation is classically a procedure where a neural network is trained on the output of another network along with the original targets in order to transfer knowledge between the architectures. The special case of…

Machine Learning · Computer Science 2021-10-18 Kenneth Borup , Lars N. Andersen

Dataset distillation enables the training of deep neural networks with comparable performance in significantly reduced time by compressing large datasets into small and representative ones. Although the introduction of generative models has…

Machine Learning · Computer Science 2025-05-27 Mingzhuo Li , Guang Li , Jiafeng Mao , Takahiro Ogawa , Miki Haseyama

Most machine learning algorithms, such as classification or regression, treat the individual data point as the object of interest. Here we consider extending machine learning algorithms to operate on groups of data points. We suggest…

Machine Learning · Computer Science 2021-01-15 Danica J. Sutherland , Liang Xiong , Barnabás Póczos , Jeff Schneider

Thermally coupled distillation is a new energy-saving method, but the traditional thermally coupled distillation simulation calculation process is complicated, and the optimization method based on the traditional simulation process is…

Chemical Physics · Physics 2021-02-09 ZhaoLan Zheng , Yu Qi
‹ Prev 1 2 3 10 Next ›