Related papers: Marking: Visual Grading with Highlighting Errors a…

Automatic Short Math Answer Grading via In-context Meta-learning

Automatic short answer grading is an important research direction in the exploration of how to use artificial intelligence (AI)-based tools to improve education. Current state-of-the-art approaches use neural language models to create…

Computation and Language · Computer Science 2022-07-12 Mengxue Zhang , Sami Baral , Neil Heffernan , Andrew Lan

Mark My Works Autograder for Programming Courses

Large programming courses struggle to provide timely, detailed feedback on student code. We developed Mark My Works, a local autograding system that combines traditional unit testing with LLM-generated explanations. The system uses…

Software Engineering · Computer Science 2026-01-16 Yiding Qiu , Seyed Mahdi Azimi , Artem Lensky

Generative Grading: Near Human-level Accuracy for Automated Feedback on Richly Structured Problems

Access to high-quality education at scale is limited by the difficulty of providing student feedback on open-ended assignments in structured domains like computer programming, graphics, and short response questions. This problem has proven…

Machine Learning · Computer Science 2021-03-25 Ali Malik , Mike Wu , Vrinda Vasavada , Jinpeng Song , Madison Coots , John Mitchell , Noah Goodman , Chris Piech

Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams

Automated answer grading is a critical challenge in educational technology, with the potential to streamline assessment processes, ensure grading consistency, and provide timely feedback to students. However, existing approaches are often…

Computation and Language · Computer Science 2025-06-02 Masoud Safilian , Amin Beheshti , Stephen Elbourn

BAGS: An automatic homework grading system using the pictures taken by smart phones

Homework grading is critical to evaluate teaching quality and effect. However, it is usually time-consuming to grade the homework manually. In automatic homework grading scenario, many optical mark reader (OMR)-based solutions which require…

Computer Vision and Pattern Recognition · Computer Science 2019-06-11 Xiaoshuo Li , Tiezhu Yue , Xuanping Huang , Zhe Yang , Gang Xu

Automatic Short Answer Grading and Feedback Using Text Mining Methods

Automatic grading is not a new approach but the need to adapt the latest technology to automatic grading has become very important. As the technology has rapidly became more powerful on scoring exams and essays, especially from the 1990s…

Computation and Language · Computer Science 2020-04-20 Neslihan Suzen , Alexander Gorban , Jeremy Levesley , Evgeny Mirkes

Beyond Static Scoring: Enhancing Assessment Validity via AI-Generated Interactive Verification

Large Language Models (LLMs) challenge the validity of traditional open-ended assessments by blurring the lines of authorship. While recent research has focused on the accuracy of automated scoring (AES), these static approaches fail to…

Computers and Society · Computer Science 2025-12-16 Tom Lee , Sihoon Lee , Seonghun Kim

Neural network approach to classifying alarming student responses to online assessment

Automated scoring engines are increasingly being used to score the free-form text responses that students give to questions. Such engines are not designed to appropriately deal with responses that a human reader would find alarming such as…

Information Retrieval · Computer Science 2018-09-25 Christopher M. Ormerod , Amy E. Harris

Focusing on Students, not Machines: Grounded Question Generation and Automated Answer Grading

Digital technologies are increasingly used in education to reduce the workload of teachers and students. However, creating open-ended study or examination questions and grading their answers is still a tedious task. This thesis presents the…

Computation and Language · Computer Science 2025-06-17 Gérôme Meyer , Philip Breuer

User-Centric Evidence Ranking for Attribution and Fact Verification

Attribution and fact verification are critical challenges in natural language processing for assessing information reliability. While automated systems and Large Language Models (LLMs) aim to retrieve and select concise evidence to support…

Computation and Language · Computer Science 2026-01-30 Guy Alt , Eran Hirsch , Serwar Basch , Ido Dagan , Oren Glickman

Edit Based Grading of SQL Queries

Grading student SQL queries manually is a tedious and error-prone process. Earlier work on testing correctness of student SQL queries, such as the XData system, can be used to test correctness of a student query. However, in case a student…

Databases · Computer Science 2019-12-20 Bikash Chandra , Ananyo Banerjee , Udbhas Hazra , Mathew Joseph , S. Sudarshan

Unravelling Interlanguage Facts via Explainable Machine Learning

Native language identification (NLI) is the task of training (via supervised machine learning) a classifier that guesses the native language of the author of a text. This task has been extensively researched in the last decade, and the…

Computation and Language · Computer Science 2022-08-03 Barbara Berti , Andrea Esuli , Fabrizio Sebastiani

When VLMs 'Fix' Students: Identifying and Penalizing Over-Correction in the Evaluation of Multi-line Handwritten Math OCR

Accurate transcription of handwritten mathematics is crucial for educational AI systems, yet current benchmarks fail to evaluate this capability properly. Most prior studies focus on single-line expressions and rely on lexical metrics such…

Computers and Society · Computer Science 2026-05-27 Jin Seong , Wencke Liermann , Minho Kim , Jong-hun Shin , Soojong Lim

A Framework for Evaluation of Machine Reading Comprehension Gold Standards

Machine Reading Comprehension (MRC) is the task of answering a question over a paragraph of text. While neural MRC systems gain popularity and achieve noticeable performance, issues are being raised with the methodology used to establish…

Computation and Language · Computer Science 2020-03-11 Viktor Schlegel , Marco Valentino , André Freitas , Goran Nenadic , Riza Batista-Navarro

Knowledge Markers: An AI-Agnostic Concept for the Design of Programming Courses

Generative AI enables students to produce plausible code quickly. Producing working code is therefore no longer a reliable indicator of understanding. This is particularly problematic in non-computer-science programmes, where time…

Computers and Society · Computer Science 2026-04-09 Christina Maria Mayr

Transforming Student Evaluation with Adaptive Intelligence and Performance Analytics

The development in Artificial Intelligence (AI) offers transformative potential for redefining student assessment methodologies. This paper aims to establish the idea of the advancement of Artificial Intelligence (AI) and its prospect in…

Computers and Society · Computer Science 2025-03-10 Pushpalatha K S , Abhishek Mangalur , Ketan Hegde , Chetan Badachi , Mohammad Aamir

Towards Human-Like Grading: A Unified LLM-Enhanced Framework for Subjective Question Evaluation

Automatic grading of subjective questions remains a significant challenge in examination assessment due to the diversity in question formats and the open-ended nature of student responses. Existing works primarily focus on a specific type…

Computation and Language · Computer Science 2025-10-10 Fanwei Zhua , Jiaxuan He , Xiaoxiao Chen , Zulong Chen , Quan Lu , Chenrui Mei

A Neural-Symbolic Approach Towards Identifying Grammatically Correct Sentences

Textual content around us is growing on a daily basis. Numerous articles are being written as we speak on online newspapers, blogs, or social media. Similarly, recent advances in the AI field, like language models or traditional classic AI…

Computation and Language · Computer Science 2023-07-18 Nicos Isaak

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Large pretrained language models (LMs) like BERT have improved performance in many disparate natural language processing (NLP) tasks. However, fine tuning such models requires a large number of training examples for each target task.…

Computation and Language · Computer Science 2022-01-28 Jixuan Wang , Kuan-Chieh Wang , Frank Rudzicz , Michael Brudno

Ranking Clarification Questions via Natural Language Inference

Given a natural language query, teaching machines to ask clarifying questions is of immense utility in practical natural language processing systems. Such interactions could help in filling information gaps for better machine comprehension…

Machine Learning · Computer Science 2020-08-19 Vaibhav Kumar , Vikas Raunak , Jamie Callan