Related papers: Model-Based Simulation for Optimising Smart Reply

Surrogate Scoring Rules

Strictly proper scoring rules (SPSR) are incentive compatible for eliciting information about random variables from strategic agents when the principal can reward agents after the realization of the random variables. They also quantify the…

Computer Science and Game Theory · Computer Science 2020-06-09 Yang Liu , Juntao Wang , Yiling Chen

AutoSimulate: (Quickly) Learning Synthetic Data Generation

Simulation is increasingly being used for generating large labelled datasets in many machine learning problems. Recent methods have focused on adjusting simulator parameters with the goal of maximising accuracy on a validation task, usually…

Computer Vision and Pattern Recognition · Computer Science 2020-08-20 Harkirat Singh Behl , Atılım Güneş Baydin , Ran Gal , Philip H. S. Torr , Vibhav Vineet

LLM-based Smart Reply (LSR): Enhancing Collaborative Performance with ChatGPT-mediated Smart Reply System

Interactive user interfaces have increasingly explored AI's role in enhancing communication efficiency and productivity in collaborative tasks. The emergence of Large Language Models (LLMs) such as ChatGPT has revolutionized conversational…

Human-Computer Interaction · Computer Science 2024-03-06 Ashish Bastola , Hao Wang , Judsen Hembree , Pooja Yadav , Zihao Gong , Emma Dixon , Abolfazl Razi , Nathan McNeese

Application of Statistical Relational Learning to Hybrid Recommendation Systems

Recommendation systems usually involve exploiting the relations among known features and content that describe items (content-based filtering) or the overlap of similar users who interacted with or rated the target item (collaborative…

Artificial Intelligence · Computer Science 2016-07-06 Shuo Yang , Mohammed Korayem , Khalifeh AlJadda , Trey Grainger , Sriraam Natarajan

Search-Based Interaction For Conversation Recommendation via Generative Reward Model Based Simulated User

Conversational recommendation systems (CRSs) use multi-turn interaction to capture user preferences and provide personalized recommendations. A fundamental challenge in CRSs lies in effectively understanding user preferences from…

Information Retrieval · Computer Science 2025-04-30 Xiaolei Wang , Chunxuan Xia , Junyi Li , Fanzhe Meng , Lei Huang , Jinpeng Wang , Wayne Xin Zhao , Ji-Rong Wen

Sampling Through the Lens of Sequential Decision Making

Sampling is ubiquitous in machine learning methodologies. Due to the growth of large datasets and model complexity, we want to learn and adapt the sampling process while training a representation. Towards achieving this grand goal, a…

Machine Learning · Computer Science 2022-12-14 Jason Xiaotian Dou , Alvin Qingkai Pan , Runxue Bao , Haiyi Harry Mao , Lei Luo , Zhi-Hong Mao

Learning to Make MISTAKEs: Modeling Incorrect Student Thinking And Key Errors

Research on reasoning in language models (LMs) predominantly focuses on improving the correctness of their outputs. But some important applications require modeling reasoning patterns that are incorrect. For example, automated systems that…

Machine Learning · Computer Science 2025-10-14 Alexis Ross , Jacob Andreas

Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems

Long-term user engagement (LTE) optimization in sequential recommender systems (SRS) is shown to be suited by reinforcement learning (RL) which finds a policy to maximize long-term rewards. Meanwhile, RL has its shortcomings, particularly…

Information Retrieval · Computer Science 2023-05-09 Xiong-Hui Chen , Bowei He , Yang Yu , Qingyang Li , Zhiwei Qin , Wenjie Shang , Jieping Ye , Chen Ma

Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems

Large Language Models (LLMs) have demonstrated exceptional capabilities, yet selecting the most reliable response from multiple LLMs remains a challenge, particularly in resource-constrained settings. Existing approaches often depend on…

Computation and Language · Computer Science 2025-10-06 Aakriti Agrawal , Rohith Aralikatti , Anirudh Satheesh , Souradip Chakraborty , Amrit Singh Bedi , Furong Huang

SMART: Simulated Students Aligned with Item Response Theory for Question Difficulty Prediction

Item (question) difficulties play a crucial role in educational assessments, enabling accurate and efficient assessment of student abilities and personalization to maximize learning outcomes. Traditionally, estimating item difficulties can…

Computation and Language · Computer Science 2025-09-19 Alexander Scarlatos , Nigel Fernandez , Christopher Ormerod , Susan Lottridge , Andrew Lan

Reward Reasoning Model

Reward models play a critical role in guiding large language models toward outputs that align with human expectations. However, an open challenge remains in effectively utilizing test-time compute to enhance reward model performance. In…

Computation and Language · Computer Science 2025-05-21 Jiaxin Guo , Zewen Chi , Li Dong , Qingxiu Dong , Xun Wu , Shaohan Huang , Furu Wei

Response-Aware User Memory Selection for LLM Personalization

A common approach to personalization in large language models (LLMs) is to incorporate a subset of the user memory into the prompt at inference time to guide the model's generation. Existing methods select these subsets primarily using…

Artificial Intelligence · Computer Science 2026-04-17 Jillian Fisher , Jennifer Neville , Chan Young Park

Socratic Students: Teaching Language Models to Learn by Asking Questions

Large language Models (LLMs) are usually used to answer questions, but many high-stakes applications (e.g., tutoring, clinical support) require the complementary skill of asking questions: detecting missing information, requesting…

Artificial Intelligence · Computer Science 2026-01-07 Rajeev Bhatt Ambati , Tianyi Niu , Aashu Singh , Shlok Mishra , Snigdha Chaturvedi , Shashank Srivastava

Beyond Scores: A Modular RAG-Based System for Automatic Short Answer Scoring with Feedback

Automatic short answer scoring (ASAS) helps reduce the grading burden on educators but often lacks detailed, explainable feedback. Existing methods in ASAS with feedback (ASAS-F) rely on fine-tuning language models with limited datasets,…

Computation and Language · Computer Science 2024-10-11 Menna Fateen , Bo Wang , Tsunenori Mine

Explainable Recommendation with Simulated Human Feedback

Recent advancements in explainable recommendation have greatly bolstered user experience by elucidating the decision-making rationale. However, the existing methods actually fail to provide effective feedback signals for potentially better…

Information Retrieval · Computer Science 2025-08-08 Jiakai Tang , Jingsen Zhang , Zihang Tian , Xueyang Feng , Lei Wang , Xu Chen

Data-Efficient Reinforcement Learning with Self-Predictive Representations

While deep reinforcement learning excels at solving tasks where large amounts of data can be collected through virtually unlimited interaction with the environment, learning from limited interaction remains a key challenge. We posit that an…

Machine Learning · Computer Science 2021-05-21 Max Schwarzer , Ankesh Anand , Rishab Goel , R Devon Hjelm , Aaron Courville , Philip Bachman

When robots learn reward functions using high capacity models that take raw state directly as input, they need to both learn a representation for what matters in the task -- the task ``features" -- as well as how to combine these features…

Robotics · Computer Science 2023-03-20 Andreea Bobu , Yi Liu , Rohin Shah , Daniel S. Brown , Anca D. Dragan

MS-Ranker: Accumulating Evidence from Potentially Correct Candidates for Answer Selection

As conventional answer selection (AS) methods generally match the question with each candidate answer independently, they suffer from the lack of matching information between the question and the candidate. To address this problem, we…

Computation and Language · Computer Science 2020-10-13 Yingxue Zhang , Fandong Meng , Peng Li , Ping Jian , Jie Zhou

Simulation-Aided Policy Tuning for Black-Box Robot Learning

How can robots learn and adapt to new tasks and situations with little data? Systematic exploration and simulation are crucial tools for efficient robot learning. We present a novel black-box policy search algorithm focused on…

Robotics · Computer Science 2025-02-11 Shiming He , Alexander von Rohr , Dominik Baumann , Ji Xiang , Sebastian Trimpe

Probabilistic Super-Resolution for High-Fidelity Physical System Simulations with Uncertainty Quantification

Super-resolution (SR) is a promising tool for generating high-fidelity simulations of physical systems from low-resolution data, enabling fast and accurate predictions in engineering applications. However, existing deep-learning based SR…

Machine Learning · Computer Science 2025-02-17 Pengyu Zhang , Connor Duffin , Alex Glyn-Davies , Arnaud Vadeboncoeur , Mark Girolami