Related papers: Minimizing Errors or Surprises?

Quantifying the Prediction Uncertainty of Machine Learning Models for Individual Data

Machine learning models have exhibited exceptional results in various domains. The most prevalent approach for learning is the empirical risk minimizer (ERM), which adapts the model's weights to reduce the loss on a training set and…

Machine Learning · Computer Science 2024-12-11 Koby Bibas

Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds

Learning a transition model via Maximum Likelihood Estimation (MLE) followed by planning inside the learned model is perhaps the most standard and simplest Model-based Reinforcement Learning (RL) framework. In this work, we show that such a…

Machine Learning · Computer Science 2024-10-30 Zhiyong Wang , Dongruo Zhou , John C. S. Lui , Wen Sun

Feasible Learning

We introduce Feasible Learning (FL), a sample-centric learning paradigm where models are trained by solving a feasibility problem that bounds the loss for each training sample. In contrast to the ubiquitous Empirical Risk Minimization (ERM)…

Machine Learning · Computer Science 2025-01-28 Juan Ramirez , Ignacio Hounie , Juan Elenter , Jose Gallego-Posada , Meraj Hashemizadeh , Alejandro Ribeiro , Simon Lacoste-Julien

Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees

Model-based reinforcement learning (RL) is considered to be a promising approach to reduce the sample complexity that hinders model-free RL. However, the theoretical understanding of such methods has been rather limited. This paper…

Machine Learning · Computer Science 2021-02-16 Yuping Luo , Huazhe Xu , Yuanzhi Li , Yuandong Tian , Trevor Darrell , Tengyu Ma

Balancing New Against Old Information: The Role of Surprise in Learning

Surprise describes a range of phenomena from unexpected events to behavioral responses. We propose a measure of surprise and use it for surprise-driven learning. Our surprise measure takes into account data likelihood as well as the degree…

Machine Learning · Statistics 2017-03-03 Mohammadjavad Faraji , Kerstin Preuschoff , Wulfram Gerstner

A Sober Look at Spectral Learning

Spectral learning recently generated lots of excitement in machine learning, largely because it is the first known method to produce consistent estimates (under suitable conditions) for several latent variable models. In contrast, maximum…

Machine Learning · Computer Science 2014-06-19 Han Zhao , Pascal Poupart

MACRO: A Meta-Algorithm for Conditional Risk Minimization

We study conditional risk minimization (CRM), i.e. the problem of learning a hypothesis of minimal risk for prediction at the next step of sequentially arriving dependent data. Despite it being a fundamental problem, successful learning in…

Machine Learning · Statistics 2018-11-06 Alexander Zimin , Christoph Lampert

Taylor Learning

Empirical risk minimization stands behind most optimization in supervised machine learning. Under this scheme, labeled data is used to approximate an expected cost (risk), and a learning algorithm updates model-defining parameters in search…

Machine Learning · Statistics 2023-05-25 James Schmidt

Learn More with Less: Uncertainty Consistency Guided Query Selection for RLVR

Large Language Models (LLMs) have recently improved mathematical reasoning through Reinforcement Learning with Verifiable Reward (RLVR). However, existing RLVR algorithms require large query budgets, making annotation costly. We investigate…

Artificial Intelligence · Computer Science 2026-02-02 Hao Yi , Yulan Hu , Xin Li , Sheng Ouyang , Lizhong Ding , Yong Liu

Semiparametric Mixture Regression with Unspecified Error Distributions

In fitting a mixture of linear regression models, normal assumption is traditionally used to model the error and then regression parameters are estimated by the maximum likelihood estimators (MLE). This procedure is not valid if the normal…

Methodology · Statistics 2018-11-06 Yanyuan Ma , Shaoli Wang , Lin Xu , Weixin Yao

Restricted maximum likelihood estimation in generalized linear mixed models

Restricted maximum likelihood (REML) estimation is a widely accepted and frequently used method for fitting linear mixed models, with its principal advantage being that it produces less biased estimates of the variance components. However,…

Methodology · Statistics 2025-05-15 Luca Maestrini , Francis K. C. Hui , Alan H. Welsh

Discriminator Augmented Model-Based Reinforcement Learning

By planning through a learned dynamics model, model-based reinforcement learning (MBRL) offers the prospect of good performance with little environment interaction. However, it is common in practice for the learned model to be inaccurate,…

Machine Learning · Computer Science 2021-03-31 Behzad Haghgoo , Allan Zhou , Archit Sharma , Chelsea Finn

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

In reinforcement learning, specifying reward functions that capture the intended task can be very challenging. Reward learning aims to address this issue by learning the reward function. However, a learned reward model may have a low error…

Machine Learning · Computer Science 2025-07-09 Lukas Fluri , Leon Lang , Alessandro Abate , Patrick Forré , David Krueger , Joar Skalse

On the benefits of maximum likelihood estimation for Regression and Forecasting

We advocate for a practical Maximum Likelihood Estimation (MLE) approach towards designing loss functions for regression and forecasting, as an alternative to the typical approach of direct empirical risk minimization on a specific target…

Machine Learning · Statistics 2021-10-12 Pranjal Awasthi , Abhimanyu Das , Rajat Sen , Ananda Theertha Suresh

Rule Based Rewards for Language Model Safety

Reinforcement learning based fine-tuning of large language models (LLMs) on human preferences has been shown to enhance both their capabilities and safety behavior. However, in cases related to safety, without precise instructions to human…

Artificial Intelligence · Computer Science 2024-11-05 Tong Mu , Alec Helyar , Johannes Heidecke , Joshua Achiam , Andrea Vallone , Ian Kivlichan , Molly Lin , Alex Beutel , John Schulman , Lilian Weng

From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research

Large language models (LLMs) are increasingly used to simulate human behavior, but common practices to use LLM-generated data are inefficient. Treating an LLM's output ("model choice") as a single data point underutilizes the information…

Artificial Intelligence · Computer Science 2025-12-30 Hongshen Sun , Juanjuan Zhang

Rectification Difficulty and Optimal Sample Allocation in LLM-Augmented Surveys

Large Language Models can generate synthetic survey responses at low cost, but their accuracy varies unpredictably across questions. We study the design problem of allocating a fixed budget of human respondents across estimation tasks when…

Artificial Intelligence · Computer Science 2026-04-21 Zikun Ye , Hema Yoganarasimhan

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Explainable Empirical Risk Minimization

The successful application of machine learning (ML) methods becomes increasingly dependent on their interpretability or explainability. Designing explainable ML systems is instrumental to ensuring transparency of automated decision-making…

Machine Learning · Computer Science 2022-07-04 L. Zhang , G. Karakasidis , A. Odnoblyudova , L. Dogruel , A. Jung

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference

Our goal is for agents to optimize the right reward function, despite how difficult it is for us to specify what that is. Inverse Reinforcement Learning (IRL) enables us to infer reward functions from demonstrations, but it usually assumes…

Machine Learning · Computer Science 2019-06-25 Rohin Shah , Noah Gundotra , Pieter Abbeel , Anca D. Dragan