Related papers: Deep Computerized Adaptive Testing

Survey of Computerized Adaptive Testing: A Machine Learning Perspective

Computerized Adaptive Testing (CAT) offers an efficient and personalized method for assessing examinee proficiency by dynamically adjusting test questions based on individual performance. Compared to traditional, non-personalized testing…

Machine Learning · Computer Science 2026-03-17 Yan Zhuang , Qi Liu , Haoyang Bi , Zhenya Huang , Weizhe Huang , Jiatong Li , Junhao Yu , Zirui Liu , Zirui Hu , Yuting Hong , Zachary A. Pardos , Haiping Ma , Mengxiao Zhu , Shijin Wang , Enhong Chen

Addressing Selection Bias in Computerized Adaptive Testing: A User-Wise Aggregate Influence Function Approach

Computerized Adaptive Testing (CAT) is a widely used, efficient test mode that adapts to the examinee's proficiency level in the test domain. CAT requires pre-trained item profiles, for CAT iteratively assesses the student real-time based…

Machine Learning · Computer Science 2025-03-12 Soonwoo Kwon , Sojung Kim , Seunghyun Lee , Jin-Young Kim , Suyeong An , Kyuseok Kim

Bayesian model-averaging stochastic item selection for adaptive testing

Computer Adaptive Testing (CAT) aims to accurately estimate an individual's ability using only a subset of an Item Response Theory (IRT) instrument. Many applications also require diverse item exposure across testing sessions, preventing…

Methodology · Statistics 2026-04-01 Tina Su , Edison Choe , Joshua C. Chang

Probabilistic Models for Computerized Adaptive Testing

In this paper we follow our previous research in the area of Computerized Adaptive Testing (CAT). We present three different methods for CAT. One of them, the item response theory, is a well established method, while the other two, Bayesian…

Computers and Society · Computer Science 2017-03-30 Martin Plajner

BOBCAT: Bilevel Optimization-Based Computerized Adaptive Testing

Computerized adaptive testing (CAT) refers to a form of tests that are personalized to every student/test taker. CAT methods adaptively select the next most informative question/item for each student given their responses to previous…

Machine Learning · Computer Science 2021-08-18 Aritra Ghosh , Andrew Lan

Computerized adaptive testing: implementation issues

One of the fastest evolving field among teaching and learning research is students' performance evaluation. Computer based testing systems are increasingly adopted by universities. However, the implementation and maintenance of such a…

Computers and Society · Computer Science 2016-08-14 Margit Antal , Levente Erős , Attila Imre

BanditCAT and AutoIRT: Machine Learning Approaches to Computerized Adaptive Testing and Item Calibration

In this paper, we present a complete framework for quickly calibrating and administering a robust large-scale computerized adaptive test (CAT) with a small number of responses. Calibration - learning item parameters in a test - is done…

Machine Learning · Statistics 2024-10-29 James Sharpnack , Kevin Hao , Phoebe Mulcaire , Klinton Bicknell , Geoff LaFlair , Kevin Yancey , Alina A. von Davier

Sequential Design for Computerized Adaptive Testing that Allows for Response Revision

In computerized adaptive testing (CAT), items (questions) are selected in real time based on the already observed responses, so that the ability of the examinee can be estimated as accurately as possible. This is typically formulated as a…

Statistics Theory · Mathematics 2015-01-08 Shiyu Wang , Georgios Fellouris , Hua-Hua Chang

Probabilistic Models for Computerized Adaptive Testing: Experiments

This paper follows previous research we have already performed in the area of Bayesian networks models for CAT. We present models using Item Response Theory (IRT - standard CAT method), Bayesian networks, and neural networks. We conducted…

Artificial Intelligence · Computer Science 2016-02-02 Martin Plajner , Jiří Vomlel

Quality meets Diversity: A Model-Agnostic Framework for Computerized Adaptive Testing

Computerized Adaptive Testing (CAT) is emerging as a promising testing application in many scenarios, such as education, game and recruitment, which targets at diagnosing the knowledge mastery levels of examinees on required concepts. It…

Artificial Intelligence · Computer Science 2021-01-18 Haoyang Bi , Haiping Ma , Zhenya Huang , Yu Yin , Qi Liu , Enhong Chen , Yu Su , Shijin Wang

Bayesian Network Models for Adaptive Testing

Computerized adaptive testing (CAT) is an interesting and promising approach to testing human abilities. In our research we use Bayesian networks to create a model of tested humans. We collected data from paper tests performed with grammar…

Artificial Intelligence · Computer Science 2017-03-28 Martin Plajner , Jiří Vomlel

AutoIRT: Calibrating Item Response Theory Models with Automated Machine Learning

Item response theory (IRT) is a class of interpretable factor models that are widely used in computerized adaptive tests (CATs), such as language proficiency tests. Traditionally, these are fit using parametric mixed effects models on the…

Machine Learning · Computer Science 2024-09-16 James Sharpnack , Phoebe Mulcaire , Klinton Bicknell , Geoff LaFlair , Kevin Yancey

Selective Mixup for Debiasing Question Selection in Computerized Adaptive Testing

Computerized Adaptive Testing (CAT) is a widely used technology for evaluating learners' proficiency in online education platforms. By leveraging prior estimates of proficiency to select questions and updating the estimates iteratively…

Information Retrieval · Computer Science 2025-12-24 Mi Tian , Kun Zhang , Fei Liu , Jinglong Li , Yuxin Liao , Chenxi Bai , Zhengtao Tan , Le Wu , Richang Hong

Scalable Learning of Item Response Theory Models

Item Response Theory (IRT) models aim to assess latent abilities of $n$ examinees along with latent difficulty characteristics of $m$ test items from categorical data that indicates the quality of their corresponding answers. Classical…

Machine Learning · Computer Science 2024-08-16 Susanne Frick , Amer Krivošija , Alexander Munteanu

CodeGENCAT: Generative Computerized Adaptive Testing for Open-ended Coding Problems

Existing Computerized Adaptive Testing (CAT) frameworks typically select questions based on the predicted likelihood that the student will answer correctly. This design ignores information contained in students' open-ended responses,…

Computation and Language · Computer Science 2026-05-28 Wanyong Feng , Alexander Scarlatos , Ruochen Sun , Andrew Lan

Confident Rankings with Fewer Items: Adaptive LLM Evaluation with Continuous Scores

Computerized Adaptive Testing (CAT) has proven effective for efficient LLM evaluation on multiple-choice benchmarks, but modern LLM evaluation increasingly relies on generation tasks where outputs are scored continuously rather than marked…

Computation and Language · Computer Science 2026-01-21 Esma Balkır , Alice Pernthaller , Marco Basaldella , José Hernández-Orallo , Nigel Collier

Balancing Test Accuracy and Security in Computerized Adaptive Testing

Computerized adaptive testing (CAT) is a form of personalized testing that accurately measures students' knowledge levels while reducing test length. Bilevel optimization-based CAT (BOBCAT) is a recent framework that learns a data-driven…

Computers and Society · Computer Science 2023-05-31 Wanyong Feng , Aritra Ghosh , Stephen Sireci , Andrew S. Lan

Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks

Evaluating large language models (LLMs) typically requires thousands of benchmark items, making the process expensive, slow, and increasingly impractical at scale. Existing evaluation protocols rely on average accuracy over fixed item sets,…

Computation and Language · Computer Science 2026-02-03 Peiyu Li , Xiuxiu Tang , Si Chen , Ying Cheng , Ronald Metoyer , Ting Hua , Nitesh V. Chawla

Enhancing Item Response Theory for Cognitive Diagnosis

Cognitive diagnosis is a fundamental and crucial task in many educational applications, e.g., computer adaptive test and cognitive assignments. Item Response Theory (IRT) is a classical cognitive diagnosis method which can provide…

Artificial Intelligence · Computer Science 2019-12-03 Song Cheng , Qi Liu

A Latent Variable Model with Change-Points and Its Application to Time Pressure Effects in Educational Assessment

Educational assessments are valuable tools for measuring student knowledge and skills, but their validity can be compromised when test takers exhibit changes in response behavior due to factors such as time pressure. To address this issue,…

Methodology · Statistics 2025-05-06 Gabriel Wallin , Yunxiao Chen , Yi-Hsuan Lee , Xiaoou Li