Related papers: Knowledge Amalgamation for Object Detection with T…

Contrastive Knowledge Amalgamation for Unsupervised Image Classification

Knowledge amalgamation (KA) aims to learn a compact student model to handle the joint objective from multiple teacher models that are are specialized for their own tasks respectively. Current methods focus on coarsely aligning teachers and…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Shangde Gao , Yichao Fu , Ke Liu , Yuqiang Han

Class-Incremental Learning via Knowledge Amalgamation

Catastrophic forgetting has been a significant problem hindering the deployment of deep learning algorithms in the continual learning setting. Numerous methods have been proposed to address the catastrophic forgetting problem where an agent…

Machine Learning · Computer Science 2022-09-07 Marcus de Carvalho , Mahardhika Pratama , Jie Zhang , Yajuan San

Architectural Insights into Knowledge Distillation for Object Detection: A Comprehensive Review

Object detection has achieved remarkable accuracy through deep learning, yet these improvements often come with increased computational cost, limiting deployment on resource-constrained devices. Knowledge Distillation (KD) provides an…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Mahdi Golizadeh , Nassibeh Golizadeh , Mohammad Ali Keyvanrad , Hossein Shirazi

Knowledge Amalgamation from Heterogeneous Networks by Common Feature Learning

An increasing number of well-trained deep networks have been released online by researchers and developers, enabling the community to reuse them in a plug-and-play way without accessing the training annotations. However, due to the large…

Machine Learning · Computer Science 2019-06-26 Sihui Luo , Xinchao Wang , Gongfan Fang , Yao Hu , Dapeng Tao , Mingli Song

Amalgamating Knowledge towards Comprehensive Classification

With the rapid development of deep learning, there have been an unprecedentedly large number of trained deep network models available online. Reusing such trained models can significantly reduce the cost of training the new models from…

Computer Vision and Pattern Recognition · Computer Science 2018-12-04 Chengchao Shen , Xinchao Wang , Jie Song , Li Sun , Mingli Song

Model Uncertainty-Aware Knowledge Amalgamation for Pre-Trained Language Models

As many fine-tuned pre-trained language models~(PLMs) with promising performance are generously released, investigating better ways to reuse these models is vital as it can greatly reduce the retraining computational cost and the potential…

Computation and Language · Computer Science 2021-12-15 Lei Li , Yankai Lin , Xuancheng Ren , Guangxiang Zhao , Peng Li , Jie Zhou , Xu Sun

Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More

In this paper, we investigate a novel deep-model reusing task. Our goal is to train a lightweight and versatile student model, without human-labelled annotations, that amalgamates the knowledge and masters the expertise of two pretrained…

Computer Vision and Pattern Recognition · Computer Science 2019-04-24 Jingwen Ye , Yixin Ji , Xinchao Wang , Kairi Ou , Dapeng Tao , Mingli Song

Distilling Image Classifiers in Object Detectors

Knowledge distillation constitutes a simple yet effective way to improve the performance of a compact student network by exploiting the knowledge of a more powerful teacher. Nevertheless, the knowledge distillation literature remains…

Computer Vision and Pattern Recognition · Computer Science 2022-02-11 Shuxuan Guo , Jose M. Alvarez , Mathieu Salzmann

Rethinking Centered Kernel Alignment in Knowledge Distillation

Knowledge distillation has emerged as a highly effective method for bridging the representation discrepancy between large-scale models and lightweight models. Prevalent approaches involve leveraging appropriate metrics to minimize the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Zikai Zhou , Yunhang Shen , Shitong Shao , Linrui Gong , Shaohui Lin

Distilling Object Detectors with Task Adaptive Regularization

Current state-of-the-art object detectors are at the expense of high computational costs and are hard to deploy to low-end devices. Knowledge distillation, which aims at training a smaller student network by transferring knowledge from a…

Computer Vision and Pattern Recognition · Computer Science 2020-06-24 Ruoyu Sun , Fuhui Tang , Xiaopeng Zhang , Hongkai Xiong , Qi Tian

Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection

Knowledge distillation (KD) is an effective method for compressing models in object detection tasks. Due to limited computational capability, UAV-based object detection (UAV-OD) widely adopt the KD technique to obtain lightweight detectors.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-22 Liang Yao , Fan Liu , Chuanyi Zhang , Zhiquan Ou , Ting Wu

Incremental Object Detection via Meta-Learning

In a real-world setting, object instances from new classes can be continuously encountered by object detectors. When existing object detectors are applied to such scenarios, their performance on old classes deteriorates significantly. A few…

Computer Vision and Pattern Recognition · Computer Science 2021-12-16 K J Joseph , Jathushan Rajasegaran , Salman Khan , Fahad Shahbaz Khan , Vineeth N Balasubramanian

Learning Lightweight Object Detectors via Multi-Teacher Progressive Distillation

Resource-constrained perception systems such as edge computing and vision-for-robotics require vision models to be both accurate and lightweight in computation and memory usage. While knowledge distillation is a proven strategy to enhance…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Shengcao Cao , Mengtian Li , James Hays , Deva Ramanan , Yi-Xiong Wang , Liang-Yan Gui

Self-Regulated Data-Free Knowledge Amalgamation for Text Classification

Recently, there has been a growing availability of pre-trained text models on various model repositories. These models greatly reduce the cost of training new models from scratch as they can be fine-tuned for specific tasks or trained on…

Computation and Language · Computer Science 2024-06-25 Prashanth Vijayaraghavan , Hongzhi Wang , Luyao Shi , Tyler Baldwin , David Beymer , Ehsan Degan

Federated Selective Aggregation for Knowledge Amalgamation

In this paper, we explore a new knowledge-amalgamation problem, termed Federated Selective Aggregation (FedSA). The goal of FedSA is to train a student model for a new task with the help of several decentralized teachers, whose pre-training…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Donglin Xie , Ruonan Yu , Gongfan Fang , Jie Song , Zunlei Feng , Xinchao Wang , Li Sun , Mingli Song

Transformer with Peak Suppression and Knowledge Guidance for Fine-grained Image Recognition

Fine-grained image recognition is challenging because discriminative clues are usually fragmented, whether from a single image or multiple images. Despite their significant improvements, most existing methods still focus on the most…

Multimedia · Computer Science 2022-06-07 Xinda Liu , Lili Wang , Xiaoguang Han

GAN-Knowledge Distillation for one-stage Object Detection

Convolutional neural networks have a significant improvement in the accuracy of Object detection. As convolutional neural networks become deeper, the accuracy of detection is also obviously improved, and more floating-point calculations are…

Computer Vision and Pattern Recognition · Computer Science 2019-07-05 Wei Hong , Jin ke Yu Fan Zong

Cross Architecture Distillation for Face Recognition

Transformers have emerged as the superior choice for face recognition tasks, but their insufficient platform acceleration hinders their application on mobile devices. In contrast, Convolutional Neural Networks (CNNs) capitalize on…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Weisong Zhao , Xiangyu Zhu , Zhixiang He , Xiao-Yu Zhang , Zhen Lei

MixedTeacher : Knowledge Distillation for fast inference textural anomaly detection

For a very long time, unsupervised learning for anomaly detection has been at the heart of image processing research and a stepping stone for high performance industrial automation process. With the emergence of CNN, several methods have…

Computer Vision and Pattern Recognition · Computer Science 2023-06-19 Simon Thomine , Hichem Snoussi , Mahmoud Soua

KAT: A Knowledge Augmented Transformer for Vision-and-Language

The primary focus of recent work with largescale transformers has been on optimizing the amount of information packed into the model's parameters. In this work, we ask a different question: Can multimodal transformers leverage explicit…

Computation and Language · Computer Science 2022-05-06 Liangke Gui , Borui Wang , Qiuyuan Huang , Alex Hauptmann , Yonatan Bisk , Jianfeng Gao