Related papers: Improving Socratic Question Generation using Data …

Automatic Generation of Socratic Subquestions for Teaching Math Word Problems

Socratic questioning is an educational method that allows students to discover answers to complex problems by asking them a series of thoughtful questions. Generation of didactically sound questions is challenging, requiring understanding…

Computation and Language · Computer Science 2022-11-24 Kumar Shridhar , Jakub Macina , Mennatallah El-Assady , Tanmay Sinha , Manu Kapur , Mrinmaya Sachan

Boosting Large Language Models with Socratic Method for Conversational Mathematics Teaching

With the introduction of large language models (LLMs), automatic math reasoning has seen tremendous success. However, current methods primarily focus on providing solutions or using techniques like Chain-of-Thought to enhance…

Computation and Language · Computer Science 2024-07-25 Yuyang Ding , Hanglei Hu , Jie Zhou , Qin Chen , Bo Jiang , Liang He

Socratic Students: Teaching Language Models to Learn by Asking Questions

Large language Models (LLMs) are usually used to answer questions, but many high-stakes applications (e.g., tutoring, clinical support) require the complementary skill of asking questions: detecting missing information, requesting…

Artificial Intelligence · Computer Science 2026-01-07 Rajeev Bhatt Ambati , Tianyi Niu , Aashu Singh , Shlok Mishra , Snigdha Chaturvedi , Shashank Srivastava

Learning to Ask Informative Questions: Enhancing LLMs with Preference Optimization and Expected Information Gain

Questions are essential tools for acquiring the necessary information to complete information-seeking tasks. However, large language models (LLMs), especially open-source models, often perform poorly in generating informative questions, as…

Computation and Language · Computer Science 2024-10-18 Davide Mazzaccara , Alberto Testoni , Raffaella Bernardi

Self-Boosting Large Language Models with Synthetic Preference Data

Through alignment with human preferences, Large Language Models (LLMs) have advanced significantly in generating honest, harmless, and helpful responses. However, collecting high-quality preference data is a resource-intensive and…

Computation and Language · Computer Science 2024-10-10 Qingxiu Dong , Li Dong , Xingxing Zhang , Zhifang Sui , Furu Wei

Enhancing Critical Thinking in Education by means of a Socratic Chatbot

While large language models (LLMs) are increasingly playing a pivotal role in education by providing instantaneous, adaptive responses, their potential to promote critical thinking remains understudied. In this paper, we fill such a gap and…

Human-Computer Interaction · Computer Science 2024-09-10 Lucile Favero , Juan Antonio Pérez-Ortiz , Tanja Käser , Nuria Oliver

Can Language Models Employ the Socratic Method? Experiments with Code Debugging

When employing the Socratic method of teaching, instructors guide students toward solving a problem on their own rather than providing the solution directly. While this strategy can substantially improve learning outcomes, it is usually…

Computation and Language · Computer Science 2023-10-06 Erfan Al-Hossami , Razvan Bunescu , Justin Smith , Ryan Teehan

Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning

This paper presents an intervention study on the effects of the combined methods of (1) the Socratic method, (2) Chain of Thought (CoT) reasoning, (3) simplified gamification and (4) formative feedback on university students' Maths learning…

Computation and Language · Computer Science 2025-07-17 Tosin Adewumi , Foteini Simistira Liwicki , Marcus Liwicki , Viktor Gardelli , Lama Alkhaled , Hamam Mokayed

Improving the Validity of Automatically Generated Feedback via Reinforcement Learning

Automatically generating feedback via large language models (LLMs) in intelligent tutoring systems and online learning platforms has the potential to improve the learning outcomes of many students. However, both feedback generation and…

Computation and Language · Computer Science 2024-12-13 Alexander Scarlatos , Digory Smith , Simon Woodhead , Andrew Lan

AI-TA: Towards an Intelligent Question-Answer Teaching Assistant using Open-Source LLMs

Responding to the thousands of student questions on online QA platforms each semester has a considerable human cost, particularly in computing courses with rapidly growing enrollments. To address the challenges of scalable and intelligent…

Machine Learning · Computer Science 2023-12-20 Yann Hicke , Anmol Agarwal , Qianou Ma , Paul Denny

Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution

Recent breakthroughs in large language models (LLMs) on reasoning tasks rely heavily on massive, high-quality datasets-typically human-annotated and thus difficult to scale. While data synthesis or distillation offers a promising…

Computation and Language · Computer Science 2025-09-30 Shaobo Wang , Zhengbo Jiao , Zifan Zhang , Yilang Peng , Xu Ze , Boyu Yang , Wei Wang , Hu Wei , Linfeng Zhang

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Effective training of language models (LMs) for mathematical reasoning tasks demands high-quality supervised fine-tuning data. Besides obtaining annotations from human experts, a common alternative is sampling from larger and more powerful…

Computation and Language · Computer Science 2024-07-26 Tianduo Wang , Shichen Li , Wei Lu

Boosting LLM Reasoning via Spontaneous Self-Correction

While large language models (LLMs) have demonstrated remarkable success on a broad range of tasks, math reasoning remains a challenging one. One of the approaches for improving math reasoning is self-correction, which designs self-improving…

Artificial Intelligence · Computer Science 2025-06-10 Xutong Zhao , Tengyu Xu , Xuewei Wang , Zhengxing Chen , Di Jin , Liang Tan , Yen-Ting , Zishun Yu , Zhuokai Zhao , Yun He , Sinong Wang , Han Fang , Sarath Chandar , Chen Zhu

Iterative Reasoning Preference Optimization

Iterative preference optimization methods have recently been shown to perform well for general instruction tuning tasks, but typically make little improvement on reasoning tasks (Yuan et al., 2024, Chen et al., 2024). In this work we…

Computation and Language · Computer Science 2024-06-27 Richard Yuanzhe Pang , Weizhe Yuan , Kyunghyun Cho , He He , Sainbayar Sukhbaatar , Jason Weston

Self-Improving Code Generation via Semantic Entropy and Behavioral Consensus

Improving the code generation capabilities of large language models (LLMs) typically relies on supervised fine-tuning or preference optimization, both of which require costly external resources such as powerful teacher models or reliable…

Software Engineering · Computer Science 2026-04-01 Huan Zhang , Wei Cheng , Wei Hu

Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs

As large language models (LLMs) are applied to more use cases, creating high quality, task-specific datasets for fine-tuning becomes a bottleneck for model improvement. Using high quality human data has been the most common approach to…

Computation and Language · Computer Science 2024-10-31 Yung-Chieh Chan , George Pu , Apaar Shanker , Parth Suresh , Penn Jenks , John Heyer , Sam Denton

SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models

There is a growing trend of teaching large language models (LLMs) to solve mathematical problems through coding. Existing studies primarily focus on prompting powerful, closed-source models to generate seed training data followed by…

Computation and Language · Computer Science 2024-08-29 Dian Yu , Baolin Peng , Ye Tian , Linfeng Song , Haitao Mi , Dong Yu

Wisdom of the Crowd, Without the Crowd: A Socratic LLM for Asynchronous Deliberation on Perspectivist Data

Data annotation underpins the success of modern AI, but the aggregation of crowd-collected datasets can harm the preservation of diverse perspectives in data. Difficult and ambiguous tasks cannot easily be collapsed into unitary labels.…

Human-Computer Interaction · Computer Science 2025-08-14 Malik Khadar , Daniel Runningen , Julia Tang , Stevie Chancellor , Harmanpreet Kaur

Exploring Conversational Design Choices in LLMs for Pedagogical Purposes: Socratic and Narrative Approaches for Improving Instructor's Teaching Practice

Large language models (LLMs) typically generate direct answers, yet they are increasingly used as learning tools. Studying instructors' usage is critical, given their role in teaching and guiding AI adoption in education. We designed and…

Human-Computer Interaction · Computer Science 2025-09-16 Si Chen , Isabel R. Molnar , Peiyu Li , Adam Acunin , Ting Hua , Alex Ambrose , Nitesh V. Chawla , Ronald Metoyer

SemPA: Improving Sentence Embeddings of Large Language Models through Semantic Preference Alignment

Traditional sentence embedding methods employ token-level contrastive learning on non-generative pre-trained models. Recently, there have emerged embedding methods based on generative large language models (LLMs). These methods either rely…

Computation and Language · Computer Science 2026-01-09 Ziyang Chen , Zhenxuan Huang , Yile Wang , Weiqin Wang , Lu Yin , Hui Huang