Related papers: KASER: Knowledge-Aligned Student Error Simulator f…

BIDER: Bridging Knowledge Inconsistency for Efficient Retrieval-Augmented LLMs via Key Supporting Evidence

Retrieval-augmented large language models (LLMs) have demonstrated efficacy in knowledge-intensive tasks such as open-domain QA, addressing inherent challenges in knowledge update and factual inadequacy. However, inconsistencies between…

Computation and Language · Computer Science 2024-05-31 Jiajie Jin , Yutao Zhu , Yujia Zhou , Zhicheng Dou

Test Case-Informed Knowledge Tracing for Open-ended Coding Tasks

Open-ended coding tasks, which ask students to construct programs according to certain specifications, are common in computer science education. Student modeling can be challenging since their open-ended nature means that student code can…

Computers and Society · Computer Science 2024-12-24 Zhangqi Duan , Nigel Fernandez , Alexander Hicks , Andrew Lan

LECTOR: LLM-Enhanced Concept-based Test-Oriented Repetition for Adaptive Spaced Learning

Spaced repetition systems are fundamental to efficient learning and memory retention, but existing algorithms often struggle with semantic interference and personalized adaptation. We present LECTOR (\textbf{L}LM-\textbf{E}nhanced…

Computation and Language · Computer Science 2025-08-06 Jiahao Zhao

Does This Look Familiar to You? Knowledge Analysis via Model Internal Representations

Recent advances in large language models (LLMs) have been driven by pretraining, supervised fine tuning (SFT), and alignment tuning. Among these, SFT plays a crucial role in transforming a model 's general knowledge into structured…

Computation and Language · Computer Science 2025-09-10 Sihyun Park

OpenSIR: Open-Ended Self-Improving Reasoner

Recent advances in large language model (LLM) reasoning through reinforcement learning rely on annotated datasets for verifiable rewards, which may limit models' ability to surpass human-level performance. While self-play offers a promising…

Computation and Language · Computer Science 2026-01-01 Wai-Chung Kwan , Joshua Ong Jun Leang , Pavlos Vougiouklis , Jeff Z. Pan , Marco Valentino , Pasquale Minervini

Learning to Make MISTAKEs: Modeling Incorrect Student Thinking And Key Errors

Research on reasoning in language models (LMs) predominantly focuses on improving the correctness of their outputs. But some important applications require modeling reasoning patterns that are incorrect. For example, automated systems that…

Machine Learning · Computer Science 2025-10-14 Alexis Ross , Jacob Andreas

SCALER:Synthetic Scalable Adaptive Learning Environment for Reasoning

Reinforcement learning (RL) offers a principled way to enhance the reasoning capabilities of large language models, yet its effectiveness hinges on training signals that remain informative as models evolve. In practice, RL progress often…

Artificial Intelligence · Computer Science 2026-05-05 Caijun Xu , Changyi Xiao , Zhongyuan Peng , Xinrun Wang , Yixin Cao

Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems

Fine-grained skill representations, commonly referred to as knowledge components (KCs), are fundamental to many approaches in student modeling and learning analytics. However, KC-level correctness labels are rarely available in real-world…

Computation and Language · Computer Science 2026-03-31 Zhangqi Duan , Arnav Kankaria , Dhruv Kartik , Andrew Lan

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Reward Models

Reinforcement Learning from Verifiable Rewards (RLVR) has driven recent progress in code large language models by leveraging execution-based feedback from unit tests, but its scalability is fundamentally constrained by the availability and…

Machine Learning · Computer Science 2026-05-19 Xiao Zhu , Xinyu Zhou , Boyu Zhu , Hanxu Hu , Mingzhe Du , Haotian Zhang , Huiming Wang , Zhijiang Guo

KAUCUS: Knowledge Augmented User Simulators for Training Language Model Assistants

An effective multi-turn instruction-following assistant can be developed by creating a simulator that can generate useful interaction data. Apart from relying on its intrinsic weights, an ideal user simulator should also be able to…

Human-Computer Interaction · Computer Science 2024-01-31 Kaustubh D. Dhole

From Amateur to Master: Infusing Knowledge into LLMs via Automated Curriculum Learning

Large Language Models (LLMs) excel at general tasks but underperform in specialized domains like economics and psychology, which require deep, principled understanding. To address this, we introduce ACER (Automated Curriculum-Enhanced…

Computation and Language · Computer Science 2025-10-31 Nishit Neema , Srinjoy Mukherjee , Sapan Shah , Gokul Ramakrishnan , Ganesh Venkatesh

Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents

Large language models (LLMs) are revolutionizing education, with LLM-based agents playing a key role in simulating student behavior. A major challenge in student simulation is modeling the diverse learning patterns of students at various…

Machine Learning · Computer Science 2025-08-12 Tao Wu , Jingyuan Chen , Wang Lin , Mengze Li , Yumeng Zhu , Ang Li , Kun Kuang , Fei Wu

LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits

Reward Models (RMs) are crucial to aligning large language models (LLMs), but the degree to which an RM specialized to one task (e.g. writing) generalizes to new tasks (e.g. math) is often not known a priori, often making using only one…

Computation and Language · Computer Science 2025-10-23 Duy Nguyen , Archiki Prasad , Elias Stengel-Eskin , Mohit Bansal

Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces

Online question-and-answer (Q\&A) systems based on the Large Language Model (LLM) have progressively diverged from recreational to professional use. This paper proposed a Multi-Agent framework with environmentally reinforcement learning…

Software Engineering · Computer Science 2024-09-05 Jiapeng Yu , Yuqian Wu , Yajing Zhan , Wenhao Guo , Zhou Xu , Raymond Lee

DistillER: Knowledge Distillation in Entity Resolution with Large Language Models

Recent advances in Entity Resolution (ER) have leveraged Large Language Models (LLMs), achieving strong performance but at the cost of substantial computational resources or high financial overhead. Existing LLM-based ER approaches operate…

Databases · Computer Science 2026-02-06 Alexandros Zeakis , George Papadakis , Dimitrios Skoutas , Manolis Koubarakis

Enhancing Continual Learning for Software Vulnerability Prediction: Addressing Catastrophic Forgetting via Hybrid-Confidence-Aware Selective Replay for Temporal LLM Fine-Tuning

Recent work applies Large Language Models (LLMs) to source-code vulnerability detection, but most evaluations still rely on random train-test splits that ignore time and overestimate real-world performance. In practice, detectors are…

Cryptography and Security · Computer Science 2026-03-02 Xuhui Dou , Hayretdin Bahsi , Alejandro Guerra-Manzanares

Curr-RLCER:Curriculum Reinforcement Learning For Coherence Explainable Recommendation

Explainable recommendation systems (RSs) are designed to explicitly uncover the rationale of each recommendation, thereby enhancing the transparency and credibility of RSs. Previous methods often jointly predicted ratings and generated…

Information Retrieval · Computer Science 2026-04-08 Xiangchen Pan , Wei Wei

MACER: A Modular Framework for Accelerated Compilation Error Repair

Automated compilation error repair, the problem of suggesting fixes to buggy programs that fail to compile, has generated significant interest in recent years. Apart from being a tool of general convenience, automated code repair has…

Software Engineering · Computer Science 2020-05-29 Darshak Chhatbar , Umair Z. Ahmed , Purushottam Kar

Scalable Language Model with Generalized Continual Learning

Continual learning has gained increasing importance as it facilitates the acquisition and refinement of scalable knowledge and skills in language models. However, existing methods typically encounter strict limitations and challenges in…

Computation and Language · Computer Science 2024-04-12 Bohao Peng , Zhuotao Tian , Shu Liu , Mingchang Yang , Jiaya Jia

Code as Reward: Empowering Reinforcement Learning with VLMs

Pre-trained Vision-Language Models (VLMs) are able to understand visual concepts, describe and decompose complex tasks into sub-tasks, and provide feedback on task completion. In this paper, we aim to leverage these capabilities to support…

Machine Learning · Computer Science 2024-02-08 David Venuto , Sami Nur Islam , Martin Klissarov , Doina Precup , Sherry Yang , Ankit Anand