Related papers: Deep Probabilistic Supervision for Image Classific…

Regularizing Class-wise Predictions via Self-knowledge Distillation

Deep neural networks with millions of parameters may suffer from poor generalization due to overfitting. To mitigate the issue, we propose a new regularization method that penalizes the predictive distribution between similar samples. In…

Machine Learning · Computer Science 2020-04-08 Sukmin Yun , Jongjin Park , Kimin Lee , Jinwoo Shin

SDHSI-Net: Learning Better Representations for Hyperspectral Images via Self-Distillation

Hyperspectral image (HSI) classification presents unique challenges due to its high spectral dimensionality and limited labeled data. Traditional deep learning models often suffer from overfitting and high computational costs.…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Prachet Dev Singh , Shyamsundar Paramasivam , Sneha Barman , Mainak Singha , Ankit Jha , Girish Mishra , Biplab Banerjee

DeepUSPS: Deep Robust Unsupervised Saliency Prediction With Self-Supervision

Deep neural network (DNN) based salient object detection in images based on high-quality labels is expensive. Alternative unsupervised approaches rely on careful selection of multiple handcrafted saliency methods to generate noisy…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Duc Tam Nguyen , Maximilian Dax , Chaithanya Kumar Mummadi , Thi Phuong Nhung Ngo , Thi Hoai Phuong Nguyen , Zhongyu Lou , Thomas Brox

Self-Distilled Self-Supervised Representation Learning

State-of-the-art frameworks in self-supervised learning have recently shown that fully utilizing transformer-based models can lead to performance boost compared to conventional CNN models. Striving to maximize the mutual information of two…

Computer Vision and Pattern Recognition · Computer Science 2022-11-29 Jiho Jang , Seonhoon Kim , Kiyoon Yoo , Chaerin Kong , Jangho Kim , Nojun Kwak

Boost Self-Supervised Dataset Distillation via Parameterization, Predefined Augmentation, and Approximation

Although larger datasets are crucial for training large deep models, the rapid growth of dataset size has brought a significant challenge in terms of considerable training costs, which even results in prohibitive computational expenses.…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Sheng-Feng Yu , Jia-Jiun Yao , Wei-Chen Chiu

Self-Distillation Improves DNA Sequence Inference

Self-supervised pretraining (SSP) has been recognized as a method to enhance prediction accuracy in various downstream tasks. However, its efficacy for DNA sequences remains somewhat constrained. This limitation stems primarily from the…

Machine Learning · Computer Science 2024-05-15 Tong Yu , Lei Cheng , Ruslan Khalitov , Erland Brandser Olsson , Zhirong Yang

Self-supervised self-supervision by combining deep learning and probabilistic logic

Labeling training examples at scale is a perennial challenge in machine learning. Self-supervision methods compensate for the lack of direct supervision by leveraging prior knowledge to automatically generate noisy labeled examples. Deep…

Machine Learning · Computer Science 2020-12-24 Hunter Lang , Hoifung Poon

Self-supervised Learning of Geometrically Stable Features Through Probabilistic Introspection

Self-supervision can dramatically cut back the amount of manually-labelled data required to train deep neural networks. While self-supervision has usually been considered for tasks such as image classification, in this paper we aim at…

Computer Vision and Pattern Recognition · Computer Science 2018-04-06 David Novotny , Samuel Albanie , Diane Larlus , Andrea Vedaldi

Self-Knowledge Distillation with Progressive Refinement of Targets

The generalization capability of deep neural networks has been substantially improved by applying a wide spectrum of regularization methods, e.g., restricting function space, injecting randomness during training, augmenting data, etc. In…

Machine Learning · Computer Science 2021-10-08 Kyungyul Kim , ByeongMoon Ji , Doyoung Yoon , Sangheum Hwang

Deep Clustering with Diffused Sampling and Hardness-aware Self-distillation

Deep clustering has gained significant attention due to its capability in learning clustering-friendly representations without labeled data. However, previous deep clustering methods tend to treat all samples equally, which neglect the…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Hai-Xin Zhang , Dong Huang

Self-Knowledge Distillation in Natural Language Processing

Since deep learning became a key player in natural language processing (NLP), many deep learning models have been showing remarkable performances in a variety of NLP tasks, and in some cases, they are even outperforming humans. Such high…

Computation and Language · Computer Science 2019-08-07 Sangchul Hahn , Heeyoul Choi

Dropout Distillation for Efficiently Estimating Model Confidence

We propose an efficient way to output better calibrated uncertainty scores from neural networks. The Distilled Dropout Network (DDN) makes standard (non-Bayesian) neural networks more introspective by adding a new training loss which…

Computer Vision and Pattern Recognition · Computer Science 2018-09-28 Corina Gurau , Alex Bewley , Ingmar Posner

Toward Robust Semi-supervised Regression via Dual-stream Knowledge Distillation

Semi-supervised regression (SSR), which aims to predict continuous scores for samples while reducing the reliance on large-scale labeled data, has recently attracted considerable attention across various applications, including computer…

Machine Learning · Computer Science 2026-05-28 Ye Su , Hezhe Qiao , Wei Huang , Lin Chen

Dataset Distillation for Pre-Trained Self-Supervised Vision Models

The task of dataset distillation aims to find a small set of synthetic images such that training a model on them reproduces the performance of the same model trained on a much larger dataset of real samples. Existing distillation methods…

Computer Vision and Pattern Recognition · Computer Science 2025-11-21 George Cazenavette , Antonio Torralba , Vincent Sitzmann

Self-Supervised Learning for Large-Scale Unsupervised Image Clustering

Unsupervised learning has always been appealing to machine learning researchers and practitioners, allowing them to avoid an expensive and complicated process of labeling the data. However, unsupervised learning of complex data is…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Evgenii Zheltonozhskii , Chaim Baskin , Alex M. Bronstein , Avi Mendelson

Interpret the Predictions of Deep Networks via Re-Label Distillation

Interpreting the predictions of a black-box deep network can facilitate the reliability of its deployment. In this work, we propose a re-label distillation approach to learn a direct map from the input to the prediction in a…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Yingying Hua , Shiming Ge , Daichi Zhang

Self-Distribution Distillation: Efficient Uncertainty Estimation

Deep learning is increasingly being applied in safety-critical domains. For these scenarios it is important to know the level of uncertainty in a model's prediction to ensure appropriate decisions are made by the system. Deep ensembles are…

Machine Learning · Computer Science 2022-03-17 Yassir Fathullah , Mark J. F. Gales

Difficulty-guided Sampling: Bridging the Target Gap between Dataset Distillation and Downstream Tasks

In this paper, we propose difficulty-guided sampling (DGS) to bridge the target gap between the distillation objective and the downstream task, therefore improving the performance of dataset distillation. Deep neural networks achieve…

Computer Vision and Pattern Recognition · Computer Science 2026-01-16 Mingzhuo Li , Guang Li , Linfeng Ye , Jiafeng Mao , Takahiro Ogawa , Konstantinos N. Plataniotis , Miki Haseyama

Beyond Self-Supervision: A Simple Yet Effective Network Distillation Alternative to Improve Backbones

Recently, research efforts have been concentrated on revealing how pre-trained model makes a difference in neural network performance. Self-supervision and semi-supervised learning technologies have been extensively explored by the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Cheng Cui , Ruoyu Guo , Yuning Du , Dongliang He , Fu Li , Zewu Wu , Qiwen Liu , Shilei Wen , Jizhou Huang , Xiaoguang Hu , Dianhai Yu , Errui Ding , Yanjun Ma

ISD: Self-Supervised Learning by Iterative Similarity Distillation

Recently, contrastive learning has achieved great results in self-supervised learning, where the main idea is to push two augmentations of an image (positive pairs) closer compared to other random images (negative pairs). We argue that not…

Computer Vision and Pattern Recognition · Computer Science 2021-09-13 Ajinkya Tejankar , Soroush Abbasi Koohpayegani , Vipin Pillai , Paolo Favaro , Hamed Pirsiavash