Related papers: Comprehensive Algorithm Portfolio Evaluation using…

An Item Response Theory-based R Module for Algorithm Portfolio Analysis

Experimental evaluation is crucial in AI research, especially for assessing algorithms across diverse tasks. Many studies often evaluate a limited set of algorithms, failing to fully understand their strengths and weaknesses within a…

Machine Learning · Computer Science 2025-09-04 Brodie Oldfield , Sevvandi Kandanaarachchi , Ziqi Xu , Mario Andrés Muñoz

On Evaluation of Vision Datasets and Models using Human Competency Frameworks

Evaluating models and datasets in computer vision remains a challenging task, with most leaderboards relying solely on accuracy. While accuracy is a popular metric for model evaluation, it provides only a coarse assessment by considering a…

Computer Vision and Pattern Recognition · Computer Science 2024-09-09 Rahul Ramachandran , Tejal Kulkarni , Charchit Sharma , Deepak Vijaykeerthy , Vineeth N Balasubramanian

Item Response Theory based Ensemble in Machine Learning

In this article, we propose a novel probabilistic framework to improve the accuracy of a weighted majority voting algorithm. In order to assign higher weights to the classifiers which can correctly classify hard-to-classify instances, we…

Machine Learning · Statistics 2019-11-13 Ziheng Chen , Hongshik Ahn

$\beta^3$-IRT: A New Item Response Model and its Applications

Item Response Theory (IRT) aims to assess latent abilities of respondents based on the correctness of their answers in aptitude test items with different difficulty levels. In this paper, we propose the $\beta^3$-IRT model, which models…

Machine Learning · Statistics 2019-06-04 Yu Chen , Telmo Silva Filho , Ricardo B. C. Prudêncio , Tom Diethe , Peter Flach

Fairness Evaluation with Item Response Theory

Item Response Theory (IRT) has been widely used in educational psychometrics to assess student ability, as well as the difficulty and discrimination of test questions. In this context, discrimination specifically refers to how effectively a…

Computers and Society · Computer Science 2024-11-06 Ziqi Xu , Sevvandi Kandanaarachchi , Cheng Soon Ong , Eirini Ntoutsi

Beyond Random Sampling: Instance Quality-Based Data Partitioning via Item Response Theory

Robust validation of Machine Learning (ML) models is essential, but traditional data partitioning approaches often ignore the intrinsic quality of each instance. This study proposes the use of Item Response Theory (IRT) parameters to…

Machine Learning · Computer Science 2025-08-15 Lucas Cardoso , Vitor Santos , José Ribeiro Filho , Ricardo Prudêncio , Regiane Kawasaki , Ronnie Alves

Enhancing Item Response Theory for Cognitive Diagnosis

Cognitive diagnosis is a fundamental and crucial task in many educational applications, e.g., computer adaptive test and cognitive assignments. Item Response Theory (IRT) is a classical cognitive diagnosis method which can provide…

Artificial Intelligence · Computer Science 2019-12-03 Song Cheng , Qi Liu

Scalable Learning of Item Response Theory Models

Item Response Theory (IRT) models aim to assess latent abilities of $n$ examinees along with latent difficulty characteristics of $m$ test items from categorical data that indicates the quality of their corresponding answers. Classical…

Machine Learning · Computer Science 2024-08-16 Susanne Frick , Amer Krivošija , Alexander Munteanu

Modeling Item Response Theory with Stochastic Variational Inference

Item Response Theory (IRT) is a ubiquitous model for understanding human behaviors and attitudes based on their responses to questions. Large modern datasets offer opportunities to capture more nuances in human behavior, potentially…

Machine Learning · Computer Science 2022-07-29 Mike Wu , Richard L. Davis , Benjamin W. Domingue , Chris Piech , Noah Goodman

Variational Item Response Theory: Fast, Accurate, and Expressive

Item Response Theory (IRT) is a ubiquitous model for understanding humans based on their responses to questions, used in fields as diverse as education, medicine and psychology. Large modern datasets offer opportunities to capture more…

Machine Learning · Computer Science 2020-03-17 Mike Wu , Richard L. Davis , Benjamin W. Domingue , Chris Piech , Noah Goodman

Item Response Theory -- A Statistical Framework for Educational and Psychological Measurement

Item response theory (IRT) has become one of the most popular statistical models for psychometrics, a field of study concerned with the theory and techniques of psychological measurement. The IRT models are latent factor models tailored to…

Methodology · Statistics 2021-08-20 Yunxiao Chen , Xiaoou Li , Jingchen Liu , Zhiliang Ying

Extending Item Response Theory to Online Homework

Item Response Theory becomes an increasingly important tool when analyzing ``Big Data'' gathered from online educational venues. However, the mechanism was originally developed in traditional exam settings, and several of its assumptions…

Physics Education · Physics 2014-05-29 Gerd Kortemeyer

Analyzing Force Concept Inventory with Item Response Theory

Item Response Theory (IRT) is a popular assessment method used in education measurement, which builds on an assumption of a probability framework connecting students' innate ability and their actual performances on test items. The model…

Physics Education · Physics 2015-05-19 Jing Wang , Lei Bao

Building an Evaluation Scale using Item Response Theory

Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards…

Computation and Language · Computer Science 2016-09-26 John P. Lalor , Hao Wu , Hong Yu

AutoIRT: Calibrating Item Response Theory Models with Automated Machine Learning

Item response theory (IRT) is a class of interpretable factor models that are widely used in computerized adaptive tests (CATs), such as language proficiency tests. Traditionally, these are fit using parametric mixed effects models on the…

Machine Learning · Computer Science 2024-09-16 James Sharpnack , Phoebe Mulcaire , Klinton Bicknell , Geoff LaFlair , Kevin Yancey

A New Item Response Theory Model for Open-Ended Online Homework with Multiple Allowed Attempts

Item Response Theory (IRT) was originally developed in traditional exam settings, and it has been shown that the model does not readily transfer to formative assessment in the form of online homework. We investigate if this is mostly due to…

Physics Education · Physics 2015-03-24 Emre Gönülateş , Gerd Kortemeyer

Standing on the shoulders of giants

Although fundamental to the advancement of Machine Learning, the classic evaluation metrics extracted from the confusion matrix, such as precision and F1, are limited. Such metrics only offer a quantitative view of the models' performance,…

Machine Learning · Computer Science 2024-09-09 Lucas Felipe Ferraro Cardoso , José de Sousa Ribeiro Filho , Vitor Cirilo Araujo Santos , Regiane Silva Kawasaki Frances , Ronnie Cley de Oliveira Alves

Amortised Design Optimization for Item Response Theory

Item Response Theory (IRT) is a well known method for assessing responses from humans in education and psychology. In education, IRT is used to infer student abilities and characteristics of test items from student responses. Interactions…

Artificial Intelligence · Computer Science 2023-07-20 Antti Keurulainen , Isak Westerlund , Oskar Keurulainen , Andrew Howes

Probabilistically-autoencoded horseshoe-disentangled multidomain item-response theory models

Item response theory (IRT) is a non-linear generative probabilistic paradigm for using exams to identify, quantify, and compare latent traits of individuals, relative to their peers, within a population of interest. In pre-existing…

Machine Learning · Computer Science 2019-12-06 Joshua C. Chang , Shashaank Vattikuti , Carson C. Chow

Implicit assessment of language learning during practice as accurate as explicit testing

Assessment of proficiency of the learner is an essential part of Intelligent Tutoring Systems (ITS). We use Item Response Theory (IRT) in computer-aided language learning for assessment of student ability in two contexts: in test sessions,…

Artificial Intelligence · Computer Science 2024-09-25 Jue Hou , Anisia Katinskaia , Anh-Duc Vu , Roman Yangarber