Related papers: Do Machine Learning Models Learn Statistical Rules…

Pedagogical Rule Extraction to Learn Interpretable Models - an Empirical Study

Machine-learning models are ubiquitous. In some domains, for instance, in medicine, the models' predictions must be interpretable. Decision trees, classification rules, and subgroup discovery are three broad categories of supervised…

Machine Learning · Computer Science 2022-04-29 Vadim Arzamasov , Benjamin Jochum , Klemens Böhm

Statistically Testing Training Data for Unwanted Error Patterns using Rule-Oriented Regression

Artificial intelligence models trained from data can only be as good as the underlying data is. Biases in training data propagating through to the output of a machine learning model are a well-documented and well-understood phenomenon, but…

Machine Learning · Computer Science 2025-04-02 Stefan Rass , Martin Dallinger

Robust Machine Learning by Transforming and Augmenting Imperfect Training Data

Machine Learning (ML) is an expressive framework for turning data into computer programs. Across many problem domains -- both in industry and policy settings -- the types of computer programs needed for accurate prediction or optimal…

Machine Learning · Computer Science 2023-12-21 Elliot Creager

Certain and Approximately Certain Models for Statistical Learning

Real-world data is often incomplete and contains missing values. To train accurate models over real-world datasets, users need to spend a substantial amount of time and resources imputing and finding proper values for missing data items. In…

Machine Learning · Statistics 2024-03-05 Cheng Zhen , Nischal Aryal , Arash Termehchy , Alireza Aghasi , Amandeep Singh Chabada

Learning from Rules Generalizing Labeled Exemplars

In many applications labeled data is not readily available, and needs to be collected via pain-staking human supervision. We propose a rule-exemplar method for collecting human supervision to combine the efficiency of rules with the quality…

Machine Learning · Computer Science 2020-05-18 Abhijeet Awasthi , Sabyasachi Ghosh , Rasna Goyal , Sunita Sarawagi

Distilling Rule-based Knowledge into Large Language Models

Large language models (LLMs) have shown incredible performance in completing various real-world tasks. The current paradigm of knowledge learning for LLMs is mainly based on learning from examples, in which LLMs learn the internal rule…

Computation and Language · Computer Science 2024-12-17 Wenkai Yang , Yankai Lin , Jie Zhou , Ji-Rong Wen

Human vs. supervised machine learning: Who learns patterns faster?

The capabilities of supervised machine learning (SML), especially compared to human abilities, are being discussed in scientific research and in the usage of SML. This study provides an answer to how learning performance differs between…

Artificial Intelligence · Computer Science 2020-12-08 Niklas Kühl , Marc Goutier , Lucas Baier , Clemens Wolff , Dominik Martin

Flexible inference for animal learning rules using neural networks

Understanding how animals learn is a central challenge in neuroscience, with growing relevance to the development of animal- or human-aligned artificial intelligence. However, existing approaches tend to assume fixed parametric forms for…

Machine Learning · Computer Science 2026-02-06 Yuhan Helena Liu , Victor Geadah , Jonathan Pillow

Towards Integrating Personal Knowledge into Test-Time Predictions

Machine learning (ML) models can make decisions based on large amounts of data, but they can be missing personal knowledge available to human users about whom predictions are made. For example, a model trained to predict psychiatric…

Machine Learning · Computer Science 2024-06-14 Isaac Lage , Sonali Parbhoo , Finale Doshi-Velez

A Survey on Model-based Reinforcement Learning

Reinforcement learning (RL) solves sequential decision-making problems via a trial-and-error process interacting with the environment. While RL achieves outstanding success in playing complex video games that allow huge trial-and-error,…

Machine Learning · Computer Science 2022-06-22 Fan-Ming Luo , Tian Xu , Hang Lai , Xiong-Hui Chen , Weinan Zhang , Yang Yu

Rule induction for global explanation of trained models

Understanding the behavior of a trained network and finding explanations for its outputs is important for improving the network's performance and generalization ability, and for ensuring trust in automated systems. Several approaches have…

Computation and Language · Computer Science 2018-08-30 Madhumita Sushil , Simon Šuster , Walter Daelemans

A Bayesian Perspective of Statistical Machine Learning for Big Data

Statistical Machine Learning (SML) refers to a body of algorithms and methods by which computers are allowed to discover important features of input data sets which are often very large in size. The very task of feature discovery from data…

Machine Learning · Computer Science 2018-11-14 Rajiv Sambasivan , Sourish Das , Sujit K Sahu

Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs

Large language models (LLMs) have achieved impressive human-like performance across various reasoning tasks. However, their mastery of underlying inferential rules still falls short of human capabilities. To investigate this, we propose a…

Computation and Language · Computer Science 2024-06-24 Siyuan Wang , Zhongyu Wei , Yejin Choi , Xiang Ren

Self-supervised Representation Learning From Random Data Projectors

Self-supervised representation learning~(SSRL) has advanced considerably by exploiting the transformation invariance assumption under artificially designed data augmentations. While augmentation-based SSRL algorithms push the boundaries of…

Machine Learning · Computer Science 2024-03-22 Yi Sui , Tongzi Wu , Jesse C. Cresswell , Ga Wu , George Stein , Xiao Shi Huang , Xiaochen Zhang , Maksims Volkovs

Sample-efficient Learning of Concepts with Theoretical Guarantees: from Data to Concepts without Interventions

Machine learning is a vital part of many real-world systems, but several concerns remain about the lack of interpretability, explainability and robustness of black-box AI systems. Concept Bottleneck Models (CBM) address some of these…

Machine Learning · Statistics 2025-10-24 Hidde Fokkema , Tim van Erven , Sara Magliacane

A Solvable Model of Neural Scaling Laws

Large language models with a huge number of parameters, when trained on near internet-sized number of tokens, have been empirically shown to obey neural scaling laws: specifically, their performance behaves predictably as a power law in…

Machine Learning · Computer Science 2022-11-01 Alexander Maloney , Daniel A. Roberts , James Sully

A Hitchhiker's Guide to Scaling Law Estimation

Scaling laws predict the loss of a target machine learning model by extrapolating from easier-to-train models with fewer parameters or smaller training sets. This provides an efficient way for practitioners and researchers alike to compare…

Machine Learning · Computer Science 2025-06-04 Leshem Choshen , Yang Zhang , Jacob Andreas

Interpreting Social Respect: A Normative Lens for ML Models

Machine learning is often viewed as an inherently value-neutral process: statistical tendencies in the training inputs are "simply" used to generalize to new examples. However when models impact social systems such as interactions between…

Computers and Society · Computer Science 2019-08-21 Ben Hutchinson , KJ Pittl , Margaret Mitchell

Training Machine Learning Models by Regularizing their Explanations

Neural networks are among the most accurate supervised learning methods in use today. However, their opacity makes them difficult to trust in critical applications, especially when conditions in training may differ from those in practice.…

Machine Learning · Computer Science 2018-10-03 Andrew Slavin Ross

Machines Learn to Infer Stellar Parameters Just by Looking at a Large Number of Spectra

Machine learning has been widely applied to clearly defined problems of astronomy and astrophysics. However, deep learning and its conceptual differences to classical machine learning have been largely overlooked in these fields. The broad…

Instrumentation and Methods for Astrophysics · Physics 2024-10-15 Nima Sedaghat , Martino Romaniello , Jonathan E. Carrick , François-Xavier Pineau