English
Related papers

Related papers: ILDAE: Instance-Level Difficulty Analysis of Evalu…

200 papers

The main objective of exams consists in performing an assessment of students' expertise on a specific subject. Such expertise, also referred to as skill or knowledge level, can then be leveraged in different ways (e.g., to assign a grade to…

Machine Learning · Computer Science 2020-01-22 Luca Benedetto , Andrea Cappelli , Roberto Turrin , Paolo Cremonesi

Traditional instance-based model analysis focuses mainly on misclassified instances. However, this approach overlooks the varying difficulty associated with different instances. Ideally, a robust model should recognize and reflect the…

Human-Computer Interaction · Computer Science 2025-07-02 Linhao Meng , Stef van den Elzen , Anna Vilanova

Curriculum learning strategies in prior multi-task learning approaches arrange datasets in a difficulty hierarchy either based on human perception or by exhaustively searching the optimal arrangement. However, human perception of difficulty…

Machine Learning · Computer Science 2022-05-30 Neeraj Varshney , Swaroop Mishra , Chitta Baral

Defect prediction is crucial for software quality assurance and has been extensively researched over recent decades. However, prior studies rarely focus on data complexity in defect prediction tasks, and even less on understanding the…

Software Engineering · Computer Science 2023-05-08 Xiaohui Wan , Zheng Zheng , Fangyun Qin , Xuhui Lu

Not all instances in a data set are equally beneficial for inducing a model of the data. Some instances (such as outliers or noise) can be detrimental. However, at least initially, the instances in a data set are generally considered…

Machine Learning · Statistics 2014-10-16 Michael R. Smith , Tony Martinez

Early exiting allows instances to exit at different layers according to the estimation of difficulty. Previous works usually adopt heuristic metrics such as the entropy of internal outputs to measure instance difficulty, which suffers from…

Computation and Language · Computer Science 2022-03-04 Tianxiang Sun , Xiangyang Liu , Wei Zhu , Zhichao Geng , Lingling Wu , Yilong He , Yuan Ni , Guotong Xie , Xuanjing Huang , Xipeng Qiu

Deep learning faces a formidable challenge when handling noisy labels, as models tend to overfit samples affected by label noise. This challenge is further compounded by the presence of instance-dependent noise (IDN), a realistic form of…

Computer Vision and Pattern Recognition · Computer Science 2026-01-12 Arpit Garg , Cuong Nguyen , Rafael Felix , Thanh-Toan Do , Gustavo Carneiro

Multi-instance learning attempts to learn from a training set consisting of labeled bags each containing many unlabeled instances. Previous studies typically treat the instances in the bags as independently and identically distributed.…

Machine Learning · Computer Science 2009-05-13 Zhi-Hua Zhou , Yu-Yin Sun , Yu-Feng Li

While much progress has been made in understanding the minimax sample complexity of reinforcement learning (RL) -- the complexity of learning on the "worst-case" instance -- such measures of complexity often do not capture the true…

Machine Learning · Computer Science 2023-07-21 Andrew Wagenmaker , Kevin Jamieson

We study oracle complexity of gradient based methods for stochastic approximation problems. Though in many settings optimal algorithms and tight lower bounds are known for such problems, these optimal algorithms do not achieve the best…

Optimization and Control · Mathematics 2022-06-20 Jingzhao Zhang , Hongzhou Lin , Subhro Das , Suvrit Sra , Ali Jadbabaie

Noise transition matrix (NTM) estimation is a promising approach for learning with label noise. It can infer clean posterior probabilities, known as Label Distribution (LD), based on noisy ones and reduce the impact of noisy labels.…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Zehui Liao , Shishuai Hu , Yutong Xie , Yong Xia

Accurate estimates of item difficulty are essential for valid assessment and effective adaptive learning. However, for newly created tasks, response data are typically unavailable. Pretesting and expert judgement can be costly and slow,…

Widespread adoption of deep models has motivated a pressing need for approaches to interpret network outputs and to facilitate model debugging. Instance attribution methods constitute one means of accomplishing these goals by retrieving…

Computation and Language · Computer Science 2021-04-12 Pouya Pezeshkpour , Sarthak Jain , Byron C. Wallace , Sameer Singh

Modern NLP systems require high-quality annotated data. In specialized domains, expert annotations may be prohibitively expensive. An alternative is to rely on crowdsourcing to reduce costs at the risk of introducing noise. In this paper we…

Computation and Language · Computer Science 2019-05-21 Yinfei Yang , Oshin Agarwal , Chris Tar , Byron C. Wallace , Ani Nenkova

Existing work on understanding deep learning often employs measures that compress all data-dependent information into a few numbers. In this work, we adopt a perspective based on the role of individual examples. We introduce a measure of…

Machine Learning · Computer Science 2021-06-21 Robert J. N. Baldock , Hartmut Maennel , Behnam Neyshabur

While generalization over tasks from easy to hard is crucial to profile language models (LLMs), the datasets with fine-grained difficulty annotations for each problem across a broad range of complexity are still blank. Aiming to address…

Instruction tuning has emerged as a critical paradigm for improving the capabilities and alignment of large language models (LLMs). However, existing iterative model-aware data selection methods incur significant computational overhead, as…

Machine Learning · Computer Science 2025-05-13 Xiaotian Lin , Yanlin Qi , Yizhang Zhu , Themis Palpanas , Chengliang Chai , Nan Tang , Yuyu Luo

Standardized math assessments require expensive human pilot studies to establish the difficulty of test items. We investigate the predictive value of open-source large language models (LLMs) for evaluating the difficulty of multiple-choice…

Computation and Language · Computer Science 2026-04-22 Christabel Acquaye , Yi Ting Huang , Marine Carpuat , Rachel Rudinger

In-context learning (ICL) improves language models' performance on a variety of NLP tasks by simply demonstrating a handful of examples at inference time. It is not well understood why ICL ability emerges, as the model has never been…

Computation and Language · Computer Science 2023-06-28 Xiaochuang Han , Daniel Simig , Todor Mihaylov , Yulia Tsvetkov , Asli Celikyilmaz , Tianlu Wang

With the increasing ability of large language models (LLMs), in-context learning (ICL) has evolved as a new paradigm for natural language processing (NLP), where instead of fine-tuning the parameters of an LLM specific to a downstream task…

Information Retrieval · Computer Science 2024-05-03 Andrew Parry , Debasis Ganguly , Manish Chandra
‹ Prev 1 2 3 10 Next ›