Related papers: ILDAE: Instance-Level Difficulty Analysis of Evalu…

R2DE: a NLP approach to estimating IRT parameters of newly generated questions

The main objective of exams consists in performing an assessment of students' expertise on a specific subject. Such expertise, also referred to as skill or knowledge level, can then be leveraged in different ways (e.g., to assign a grade to…

Machine Learning · Computer Science 2020-01-22 Luca Benedetto , Andrea Cappelli , Roberto Turrin , Paolo Cremonesi

Towards Difficulty-Aware Analysis of Deep Neural Networks

Traditional instance-based model analysis focuses mainly on misclassified instances. However, this approach overlooks the varying difficulty associated with different instances. Ideally, a robust model should recognize and reflect the…

Human-Computer Interaction · Computer Science 2025-07-02 Linhao Meng , Stef van den Elzen , Anna Vilanova

Let the Model Decide its Curriculum for Multitask Learning

Curriculum learning strategies in prior multi-task learning approaches arrange datasets in a difficulty hierarchy either based on human perception or by exhaustively searching the optimal arrangement. However, human perception of difficulty…

Machine Learning · Computer Science 2022-05-30 Neeraj Varshney , Swaroop Mishra , Chitta Baral

Data Complexity: A New Perspective for Analyzing the Difficulty of Defect Prediction Tasks

Defect prediction is crucial for software quality assurance and has been extensively researched over recent decades. However, prior studies rarely focus on data complexity in defect prediction tasks, and even less on understanding the…

Software Engineering · Computer Science 2023-05-08 Xiaohui Wan , Zheng Zheng , Fangyun Qin , Xuhui Lu

Reducing the Effects of Detrimental Instances

Not all instances in a data set are equally beneficial for inducing a model of the data. Some instances (such as outliers or noise) can be detrimental. However, at least initially, the instances in a data set are generally considered…

Machine Learning · Statistics 2014-10-16 Michael R. Smith , Tony Martinez

A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation

Early exiting allows instances to exit at different layers according to the estimation of difficulty. Previous works usually adopt heuristic metrics such as the entropy of internal outputs to measure instance difficulty, which suffers from…

Computation and Language · Computer Science 2022-03-04 Tianxiang Sun , Xiangyang Liu , Wei Zhu , Zhichao Geng , Lingling Wu , Yilong He , Yuan Ni , Guotong Xie , Xuanjing Huang , Xipeng Qiu

Instance-dependent Noisy-label Learning with Graphical Model Based Noise-rate Estimation

Deep learning faces a formidable challenge when handling noisy labels, as models tend to overfit samples affected by label noise. This challenge is further compounded by the presence of instance-dependent noise (IDN), a realistic form of…

Computer Vision and Pattern Recognition · Computer Science 2026-01-12 Arpit Garg , Cuong Nguyen , Rafael Felix , Thanh-Toan Do , Gustavo Carneiro

Multi-Instance Learning by Treating Instances As Non-I.I.D. Samples

Multi-instance learning attempts to learn from a training set consisting of labeled bags each containing many unlabeled instances. Previous studies typically treat the instances in the bags as independently and identically distributed.…

Machine Learning · Computer Science 2009-05-13 Zhi-Hua Zhou , Yu-Yin Sun , Yu-Feng Li

Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design

While much progress has been made in understanding the minimax sample complexity of reinforcement learning (RL) -- the complexity of learning on the "worst-case" instance -- such measures of complexity often do not capture the true…

Machine Learning · Computer Science 2023-07-21 Andrew Wagenmaker , Kevin Jamieson

Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity

We study oracle complexity of gradient based methods for stochastic approximation problems. Though in many settings optimal algorithms and tight lower bounds are known for such problems, these optimal algorithms do not achieve the best…

Optimization and Control · Mathematics 2022-06-20 Jingzhao Zhang , Hongzhou Lin , Subhro Das , Suvrit Sra , Ali Jadbabaie

Instance-dependent Label Distribution Estimation for Learning with Label Noise

Noise transition matrix (NTM) estimation is a promising approach for learning with label noise. It can infer clean posterior probabilities, known as Label Distribution (LD), based on noisy ones and reduce the impact of noisy labels.…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Zehui Liao , Shishuai Hu , Yutong Xie , Yong Xia

Estimating Item Difficulty with Large Language Models as Experts

Accurate estimates of item difficulty are essential for valid assessment and effective adaptive learning. However, for newly created tasks, response data are typically unavailable. Pretesting and expert judgement can be costly and slow,…

Methodology · Statistics 2026-05-19 Diana Kolesnikova , Kirill Fedyanin , Abe D. Hofman , Matthieu J. S. Brinkhuis , Maria Bolsinova

An Empirical Comparison of Instance Attribution Methods for NLP

Widespread adoption of deep models has motivated a pressing need for approaches to interpret network outputs and to facilitate model debugging. Instance attribution methods constitute one means of accomplishing these goals by retrieving…

Computation and Language · Computer Science 2021-04-12 Pouya Pezeshkpour , Sarthak Jain , Byron C. Wallace , Sameer Singh

Predicting Annotation Difficulty to Improve Task Routing and Model Performance for Biomedical Information Extraction

Modern NLP systems require high-quality annotated data. In specialized domains, expert annotations may be prohibitively expensive. An alternative is to rely on crowdsourcing to reduce costs at the risk of introducing noise. In this paper we…

Computation and Language · Computer Science 2019-05-21 Yinfei Yang , Oshin Agarwal , Chris Tar , Byron C. Wallace , Ani Nenkova

Deep Learning Through the Lens of Example Difficulty

Existing work on understanding deep learning often employs measures that compress all data-dependent information into a few numbers. In this work, we adopt a perspective based on the role of individual examples. We introduce a measure of…

Machine Learning · Computer Science 2021-06-21 Robert J. N. Baldock , Hartmut Maennel , Behnam Neyshabur

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

While generalization over tasks from easy to hard is crucial to profile language models (LLMs), the datasets with fine-grained difficulty annotations for each problem across a broad range of complexity are still blank. Aiming to address…

Machine Learning · Computer Science 2025-06-10 Mucong Ding , Chenghao Deng , Jocelyn Choo , Zichu Wu , Aakriti Agrawal , Avi Schwarzschild , Tianyi Zhou , Tom Goldstein , John Langford , Anima Anandkumar , Furong Huang

LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning

Instruction tuning has emerged as a critical paradigm for improving the capabilities and alignment of large language models (LLMs). However, existing iterative model-aware data selection methods incur significant computational overhead, as…

Machine Learning · Computer Science 2025-05-13 Xiaotian Lin , Yanlin Qi , Yizhang Zhu , Themis Palpanas , Chengliang Chai , Nan Tang , Yuyu Luo

Take Out Your Calculators: Estimating the Real Difficulty of Question Items with LLM Student Simulations

Standardized math assessments require expensive human pilot studies to establish the difficulty of test items. We investigate the predictive value of open-source large language models (LLMs) for evaluating the difficulty of multiple-choice…

Computation and Language · Computer Science 2026-04-22 Christabel Acquaye , Yi Ting Huang , Marine Carpuat , Rachel Rudinger

Understanding In-Context Learning via Supportive Pretraining Data

In-context learning (ICL) improves language models' performance on a variety of NLP tasks by simply demonstrating a handful of examples at inference time. It is not well understood why ICL ability emerges, as the model has never been…

Computation and Language · Computer Science 2023-06-28 Xiaochuang Han , Daniel Simig , Todor Mihaylov , Yulia Tsvetkov , Asli Celikyilmaz , Tianlu Wang

"In-Context Learning" or: How I learned to stop worrying and love "Applied Information Retrieval"

With the increasing ability of large language models (LLMs), in-context learning (ICL) has evolved as a new paradigm for natural language processing (NLP), where instead of fine-tuning the parameters of an LLM specific to a downstream task…

Information Retrieval · Computer Science 2024-05-03 Andrew Parry , Debasis Ganguly , Manish Chandra