Related papers: Measuring Discrimination to Boost Comparative Test…

Sampling for Deep Learning Model Diagnosis (Technical Report)

Deep learning (DL) models have achieved paradigm-changing performance in many fields with high dimensional data, such as images, audio, and text. However, the black-box nature of deep neural networks is a barrier not just to adoption in…

Machine Learning · Computer Science 2020-02-25 Parmita Mehta , Stephen Portillo , Magdalena Balazinska , Andrew Connolly

Reinforced Data Sampling for Model Diversification

With the rising number of machine learning competitions, the world has witnessed an exciting race for the best algorithms. However, the involved data selection process may fundamentally suffer from evidence ambiguity and concept drift…

Machine Learning · Computer Science 2020-06-15 Hoang D. Nguyen , Xuan-Son Vu , Quoc-Tuan Truong , Duc-Trong Le

On the Use of Deep Learning in Software Defect Prediction

Context: Automated software defect prediction (SDP) methods are increasingly applied, often with the use of machine learning (ML) techniques. Yet, the existing ML-based approaches require manually extracted features, which are cumbersome,…

Software Engineering · Computer Science 2022-10-06 Görkem Giray , Kwabena Ebo Bennin , Ömer Köksal , Önder Babur , Bedir Tekinerdogan

Deep Neural Network Benchmarks for Selective Classification

With the increasing deployment of machine learning models in many socially sensitive tasks, there is a growing demand for reliable and trustworthy predictions. One way to accomplish these requirements is to allow a model to abstain from…

Machine Learning · Computer Science 2024-09-19 Andrea Pugnana , Lorenzo Perini , Jesse Davis , Salvatore Ruggieri

Testing Deep Learning Models: A First Comparative Study of Multiple Testing Techniques

Deep Learning (DL) has revolutionized the capabilities of vision-based systems (VBS) in critical applications such as autonomous driving, robotic surgery, critical infrastructure surveillance, air and maritime traffic control, etc. By…

Software Engineering · Computer Science 2022-07-12 Mohit Kumar Ahuja , Arnaud Gotlieb , Helge Spieker

Selective sampling for accelerating training of deep neural networks

We present a selective sampling method designed to accelerate the training of deep neural networks. To this end, we introduce a novel measurement, the minimal margin score (MMS), which measures the minimal amount of displacement an input…

Machine Learning · Computer Science 2019-11-19 Berry Weinstein , Shai Fine , Yacov Hel-Or

Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection

Semi-Supervised Learning (SSL) has become a preferred paradigm in many deep learning tasks, which reduces the need for human labor. Previous studies primarily focus on effectively utilising the labelled and unlabeled data to improve…

Machine Learning · Computer Science 2024-10-29 Qian Shao , Jiangrui Kang , Qiyuan Chen , Zepeng Li , Hongxia Xu , Yiwen Cao , Jiajuan Liang , Jian Wu

Learnable Sampler Distillation for Discrete Diffusion Models

Discrete diffusion models (DDMs) have shown powerful generation ability for discrete data modalities like text and molecules. However, their practical application is hindered by inefficient sampling, requiring a large number of sampling…

Machine Learning · Computer Science 2025-09-25 Feiyang Fu , Tongxian Guo , Zhaoqiang Liu

Decision making via semi-supervised machine learning techniques

Semi-supervised learning (SSL) is a class of supervised learning tasks and techniques that also exploits the unlabeled data for training. SSL significantly reduces labeling related costs and is able to handle large data sets. The primary…

Machine Learning · Computer Science 2016-06-30 Eftychios Protopapadakis

Deep Learning-based Sentiment Classification: A Comparative Survey

Recently, Deep Learning (DL) approaches have been applied to solve the Sentiment Classification (SC) problem, which is a core task in reviews mining or Sentiment Analysis (SA). The performances of these approaches are affected by different…

Computation and Language · Computer Science 2024-01-01 Mohamed Kayed , Rebeca P. Díaz-Redondo , Alhassan Mabrouk

Exploring Test-time Scaling via Prediction Merging on Large-Scale Recommendation

Inspired by the success of language models (LM), scaling up deep learning recommendation systems (DLRS) has become a recent trend in the community. All previous methods tend to scale up the model parameters during training time. However,…

Information Retrieval · Computer Science 2025-12-09 Fuyuan Lyu , Zhentai Chen , Jingyan Jiang , Lingjie Li , Xing Tang , Xiuqiang He , Xue Liu

A Survey on Active Deep Learning: From Model-driven to Data-driven

Which samples should be labelled in a large data set is one of the most important problems for trainingof deep learning. So far, a variety of active sample selection strategies related to deep learning havebeen proposed in many literatures.…

Machine Learning · Computer Science 2022-02-09 Peng Liu , Lizhe Wang , Guojin He , Lei Zhao

A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges

In the last decade, the rapid development of deep learning (DL) has made it possible to perform automatic, accurate, and robust Change Detection (CD) on large volumes of Remote Sensing Images (RSIs). However, despite advances in CD methods,…

Computer Vision and Pattern Recognition · Computer Science 2025-02-06 Lei Ding , Danfeng Hong , Maofan Zhao , Hongruixuan Chen , Chenyu Li , Jie Deng , Naoto Yokoya , Lorenzo Bruzzone , Jocelyn Chanussot

An Evolutionary-Based Approach to Learning Multiple Decision Models from Underrepresented Data

The use of multiple Decision Models (DMs) enables to enhance the accuracy in decisions and at the same time allows users to evaluate the confidence in decision making. In this paper we explore the ability of multiple DMs to learn from a…

Artificial Intelligence · Computer Science 2008-05-27 Vitaly Schetinin , Dayou Li , Carsten Maple

Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning

The remarkable success of contrastive-learning-based multimodal models has been greatly driven by training on ever-larger datasets with expensive compute consumption. Sample selection as an alternative efficient paradigm plays an important…

Computer Vision and Pattern Recognition · Computer Science 2025-07-18 Zihua Zhao , Feng Hong , Mengxi Chen , Pengyi Chen , Benyuan Liu , Jiangchao Yao , Ya Zhang , Yanfeng Wang

Diversified Batch Selection for Training Acceleration

The remarkable success of modern machine learning models on large datasets often demands extensive training time and resource consumption. To save cost, a prevalent research line, known as online batch selection, explores selecting…

Machine Learning · Computer Science 2024-06-10 Feng Hong , Yueming Lyu , Jiangchao Yao , Ya Zhang , Ivor W. Tsang , Yanfeng Wang

Deep Co-Space: Sample Mining Across Feature Transformation for Semi-Supervised Learning

Aiming at improving performance of visual classification in a cost-effective manner, this paper proposes an incremental semi-supervised learning paradigm called Deep Co-Space (DCS). Unlike many conventional semi-supervised learning methods…

Computer Vision and Pattern Recognition · Computer Science 2017-08-01 Ziliang Chen , Keze Wang , Xiao Wang , Pai Peng , Ebroul Izquierdo , Liang Lin

Enhancing Classification with Semi-Supervised Deep Learning Using Distance-Based Sample Weights

Recent advancements in semi-supervised deep learning have introduced effective strategies for leveraging both labeled and unlabeled data to improve classification performance. This work proposes a semi-supervised framework that utilizes a…

Machine Learning · Computer Science 2025-05-21 Aydin Abedinia , Shima Tabakhi , Vahid Seydi

Optimizing Data Usage via Differentiable Rewards

To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems. Similarly, a machine learning model…

Machine Learning · Computer Science 2021-06-18 Xinyi Wang , Hieu Pham , Paul Michel , Antonios Anastasopoulos , Jaime Carbonell , Graham Neubig

DsDm: Model-Aware Dataset Selection with Datamodels

When selecting data for training large-scale models, standard practice is to filter for examples that match human notions of data quality. Such filtering yields qualitatively clean datapoints that intuitively should improve model behavior.…

Machine Learning · Computer Science 2024-01-24 Logan Engstrom , Axel Feldmann , Aleksander Madry