Related papers: Machine Learning from Explanations

Iterative Label Improvement: Robust Training by Confidence Based Filtering and Dataset Partitioning

State-of-the-art, high capacity deep neural networks not only require large amounts of labelled training data, they are also highly susceptible to label errors in this data, typically resulting in large efforts and costs and therefore…

Machine Learning · Computer Science 2020-07-20 Christian Haase-Schütz , Rainer Stal , Heinz Hertlein , Bernhard Sick

Model Debiasing by Learnable Data Augmentation

Deep Neural Networks are well known for efficiently fitting training data, yet experiencing poor generalization capabilities whenever some kind of bias dominates over the actual task labels, resulting in models learning "shortcuts". In…

Machine Learning · Computer Science 2024-08-12 Pietro Morerio , Ruggero Ragonesi , Vittorio Murino

Active clustering for labeling training data

Gathering training data is a key step of any supervised learning task, and it is both critical and expensive. Critical, because the quantity and quality of the training data has a high impact on the performance of the learned function.…

Data Structures and Algorithms · Computer Science 2021-10-28 Quentin Lutz , Élie de Panafieu , Alex Scott , Maya Stein

Regularization Through Reasoning: Systematic Improvements in Language Model Classification via Explanation-Enhanced Fine-Tuning

Fine-tuning LLMs for classification typically maps inputs directly to labels. We ask whether attaching brief explanations to each label during fine-tuning yields better models. We evaluate conversational response quality along three axes:…

Machine Learning · Computer Science 2026-03-03 Vivswan Shah , Randy Cogill , Hanwei Yue , Gopinath Chennupati , Rinat Khaziev

Teaching Meaningful Explanations

The adoption of machine learning in high-stakes applications such as healthcare and law has lagged in part because predictions are not accompanied by explanations comprehensible to the domain user, who often holds the ultimate…

Artificial Intelligence · Computer Science 2018-09-12 Noel C. F. Codella , Michael Hind , Karthikeyan Natesan Ramamurthy , Murray Campbell , Amit Dhurandhar , Kush R. Varshney , Dennis Wei , Aleksandra Mojsilovic

Should We Always Train Models on Fine-Grained Classes?

In classification problems, models must predict a class label based on the input data features. However, class labels are organized hierarchically in many datasets. While a classification task is often defined at a specific level of this…

Machine Learning · Computer Science 2025-09-08 Davide Pirovano , Federico Milanesio , Michele Caselle , Piero Fariselli , Matteo Osella

You Need Reasoning to Learn Reasoning: The Limitations of Label-Free RL in Weak Base Models

Recent advances in large language models have demonstrated the promise of unsupervised reinforcement learning (RL) methods for enhancing reasoning capabilities without external supervision. However, the generalizability of these label-free…

Machine Learning · Computer Science 2025-11-10 Shuvendu Roy , Hossein Hajimirsadeghi , Mengyao Zhai , Golnoosh Samei

Active learning for binary classification with variable selection

Modern computing and communication technologies can make data collection procedures very efficient. However, our ability to analyze large data sets and/or to extract information out from them is hard-pressed to keep up with our capacities…

Machine Learning · Statistics 2019-01-30 Zhanfeng Wang , Yumi Kwon , Yuan-chin Ivan Chang

A Data Management Approach for Dataset Selection Using Human Computation

As the number of applications that use machine learning algorithms increases, the need for labeled data useful for training such algorithms intensifies. Getting labels typically involves employing humans to do the annotation, which directly…

Machine Learning · Computer Science 2013-07-16 Alexandros Ntoulas , Omar Alonso , Vasilis Kandylas

Learning Fast Matching Models from Weak Annotations

This paper proposes a novel training scheme for fast matching models in Search Ads, which is motivated by the real challenges in model training. The first challenge stems from the pursuit of high throughput, which prohibits the deployment…

Information Retrieval · Computer Science 2019-04-23 Xue Li , Zhipeng Luo , Hao Sun , Jianjin Zhang , Weihao Han , Xianqi Chu , Liangjie Zhang , Qi Zhang

Training Machine Learning Models by Regularizing their Explanations

Neural networks are among the most accurate supervised learning methods in use today. However, their opacity makes them difficult to trust in critical applications, especially when conditions in training may differ from those in practice.…

Machine Learning · Computer Science 2018-10-03 Andrew Slavin Ross

Evaluating and Crafting Datasets Effective for Deep Learning With Data Maps

Rapid development in deep learning model construction has prompted an increased need for appropriate training data. The popularity of large datasets - sometimes known as "big data" - has diverted attention from assessing their quality.…

Machine Learning · Computer Science 2022-10-25 Jay Bishnu , Andrew Gondoputro

Sample Efficient Learning of Image-Based Diagnostic Classifiers Using Probabilistic Labels

Deep learning approaches often require huge datasets to achieve good generalization. This complicates its use in tasks like image-based medical diagnosis, where the small training datasets are usually insufficient to learn appropriate data…

Computer Vision and Pattern Recognition · Computer Science 2021-02-12 Roberto Vega , Pouneh Gorji , Zichen Zhang , Xuebin Qin , Abhilash Rakkunedeth Hareendranathan , Jeevesh Kapur , Jacob L. Jaremko , Russell Greiner

Obtaining Explainable Classification Models using Distributionally Robust Optimization

Model explainability is crucial for human users to be able to interpret how a proposed classifier assigns labels to data based on its feature values. We study generalized linear models constructed using sets of feature value rules, which…

Machine Learning · Statistics 2023-11-06 Sanjeeb Dash , Soumyadip Ghosh , Joao Goncalves , Mark S. Squillante

Comparing the Value of Labeled and Unlabeled Data in Method-of-Moments Latent Variable Estimation

Labeling data for modern machine learning is expensive and time-consuming. Latent variable models can be used to infer labels from weaker, easier-to-acquire sources operating on unlabeled data. Such models can also be trained using labeled…

Machine Learning · Computer Science 2021-03-05 Mayee F. Chen , Benjamin Cohen-Wang , Stephen Mussmann , Frederic Sala , Christopher Ré

Data Consistency for Weakly Supervised Learning

In many applications, training machine learning models involves using large amounts of human-annotated data. Obtaining precise labels for the data is expensive. Instead, training with weak supervision provides a low-cost alternative. We…

Machine Learning · Computer Science 2022-02-09 Chidubem Arachie , Bert Huang

Coupled Training with Privileged Information and Unlabeled Data

In many prediction problems, we have extra information during training (for example, measurements that are expensive or slow to collect) that will not be available when the model is deployed. A common strategy is to first train a model that…

Machine Learning · Statistics 2026-05-25 Jiahao Shi , Omar Hagrass , Jason M. Klusowski

Few Shot Rationale Generation using Self-Training with Dual Teachers

Self-rationalizing models that also generate a free-text explanation for their predicted labels are an important tool to build trustworthy AI applications. Since generating explanations for annotated labels is a laborious and costly pro…

Computation and Language · Computer Science 2023-06-07 Aditya Srikanth Veerubhotla , Lahari Poddar , Jun Yin , György Szarvas , Sharanya Eswaran

Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Evidence suggests that networks trained on large datasets generalize well not solely because of the numerous training examples, but also class diversity which encourages learning of enriched features. This raises the question of whether…

Computer Vision and Pattern Recognition · Computer Science 2020-08-12 Christos Matsoukas , Albert Bou I Hernandez , Yue Liu , Karin Dembrower , Gisele Miranda , Emir Konuk , Johan Fredin Haslum , Athanasios Zouzos , Peter Lindholm , Fredrik Strand , Kevin Smith

Reflective-Net: Learning from Explanations

We examine whether data generated by explanation techniques, which promote a process of self-reflection, can improve classifier performance. Our work is based on the idea that humans have the ability to make quick, intuitive decisions as…

Machine Learning · Computer Science 2025-03-05 Johannes Schneider , Michalis Vlachos