Related papers: Numerical Sequence Prediction using Bayesian Conce…

Human-like Few-Shot Learning via Bayesian Reasoning over Natural Language

A core tension in models of concept learning is that the model must carefully balance the tractability of inference against the expressivity of the hypothesis class. Humans, however, can efficiently learn a broad range of concepts. We…

Computation and Language · Computer Science 2023-10-02 Kevin Ellis

Concept Generalization in Humans and Large Language Models: Insights from the Number Game

We compare human and large language model (LLM) generalization in the number game, a concept inference task. Using a Bayesian model as an analytical framework, we examined the inductive biases and inference strategies of humans and LLMs.…

Artificial Intelligence · Computer Science 2025-12-24 Arghavan Bazigaran , Hansem Sohn

Bayesian Neural Networks with Domain Knowledge Priors

Bayesian neural networks (BNNs) have recently gained popularity due to their ability to quantify model uncertainty. However, specifying a prior for BNNs that captures relevant domain knowledge is often extremely challenging. In this work,…

Machine Learning · Computer Science 2024-02-22 Dylan Sam , Rattana Pukdee , Daniel P. Jeong , Yewon Byun , J. Zico Kolter

Machine Learning and the Future of Bayesian Computation

Bayesian models are a powerful tool for studying complex data, allowing the analyst to encode rich hierarchical dependencies and leverage prior information. Most importantly, they facilitate a complete characterization of uncertainty…

Machine Learning · Statistics 2023-04-25 Steven Winter , Trevor Campbell , Lizhen Lin , Sanvesh Srivastava , David B. Dunson

Bayesian Neural Networks

In recent times, neural networks have become a powerful tool for the analysis of complex and abstract data models. However, their introduction intrinsically increases our uncertainty about which features of the analysis are model-related…

Machine Learning · Statistics 2020-11-09 Tom Charnock , Laurence Perreault-Levasseur , François Lanusse

Bayesian Prior Learning via Neural Networks for Next-item Recommendation

Next-item prediction is a a popular problem in the recommender systems domain. As the name suggests, the task is to recommend subsequent items that a user would be interested in given contextual information and historical interaction data.…

Information Retrieval · Computer Science 2022-05-12 Manoj Reddy Dareddy , Zijun Xue , Nicholas Lin , Junghoo Cho

Knowing when we do not know: Bayesian continual learning for sensing-based analysis tasks

Despite much research targeted at enabling conventional machine learning models to continually learn tasks and data distributions sequentially without forgetting the knowledge acquired, little effort has been devoted to account for more…

Machine Learning · Computer Science 2021-06-11 Sandra Servia-Rodriguez , Cecilia Mascolo , Young D. Kwon

A Tutorial on Learning With Bayesian Networks

A Bayesian network is a graphical model that encodes probabilistic relationships among variables of interest. When used in conjunction with statistical techniques, the graphical model has several advantages for data analysis. One, because…

Machine Learning · Computer Science 2022-01-11 David Heckerman

Modeling rapid language learning by distilling Bayesian priors into artificial neural networks

Humans can learn languages from remarkably little experience. Developing computational models that explain this ability has been a major challenge in cognitive science. Bayesian models that build in strong inductive biases - factors that…

Computation and Language · Computer Science 2023-05-25 R. Thomas McCoy , Thomas L. Griffiths

Neuronal Sequence Models for Bayesian Online Inference

Sequential neuronal activity underlies a wide range of processes in the brain. Neuroscientific evidence for neuronal sequences has been reported in domains as diverse as perception, motor control, speech, spatial navigation and memory.…

Adaptation and Self-Organizing Systems · Physics 2020-04-03 Sascha Frölich , Dimitrije Marković , Stefan J. Kiebel

Bayesian Neural Network Ensembles

Ensembles of neural networks (NNs) have long been used to estimate predictive uncertainty; a small number of NNs are trained from different initialisations and sometimes on differing versions of the dataset. The variance of the ensemble's…

Machine Learning · Computer Science 2018-11-30 Tim Pearce , Mohamed Zaki , Andy Neely

Can LSTM Learn to Capture Agreement? The Case of Basque

Sequential neural networks models are powerful tools in a variety of Natural Language Processing (NLP) tasks. The sequential nature of these models raises the questions: to what extent can these models implicitly learn hierarchical…

Computation and Language · Computer Science 2018-11-27 Shauli Ravfogel , Francis M. Tyers , Yoav Goldberg

Probabilistic Learning and Generation in Deep Sequence Models

Despite exceptional predictive performance of Deep sequence models (DSMs), the main concern of their deployment centers around the lack of uncertainty awareness. In contrast, probabilistic models quantify the uncertainty associated with…

Machine Learning · Computer Science 2026-03-03 Wenlong Chen

Learning to Represent and Predict Sets with Deep Neural Networks

In this thesis, we develop various techniques for working with sets in machine learning. Each input or output is not an image or a sequence, but a set: an unordered collection of multiple objects, each object described by a feature vector.…

Machine Learning · Computer Science 2021-03-09 Yan Zhang

Bayesian Learning of Neural Network Architectures

In this paper we propose a Bayesian method for estimating architectural parameters of neural networks, namely layer size and network depth. We do this by learning concrete distributions over these parameters. Our results show that regular…

Machine Learning · Statistics 2019-01-29 Georgi Dikov , Patrick van der Smagt , Justin Bayer

Neural Implementation of Probabilistic Models of Cognition

Bayesian models of cognition hypothesize that human brains make sense of data by representing probability distributions and applying Bayes' rule to find the best explanation for available data. Understanding the neural mechanisms underlying…

Neural and Evolutionary Computing · Computer Science 2021-07-02 Milad Kharratzadeh , Thomas R. Shultz

Distilling Symbolic Priors for Concept Learning into Neural Networks

Humans can learn new concepts from a small number of examples by drawing on their inductive biases. These inductive biases have previously been captured by using Bayesian models defined over symbolic hypothesis spaces. Is it possible to…

Machine Learning · Computer Science 2024-02-13 Ioana Marinescu , R. Thomas McCoy , Thomas L. Griffiths

Learning to Continually Learn with the Bayesian Principle

In the present era of deep learning, continual learning research is mainly focused on mitigating forgetting when training a neural network with stochastic gradient descent on a non-stationary stream of data. On the other hand, in the more…

Machine Learning · Computer Science 2024-05-30 Soochan Lee , Hyeonseong Jeon , Jaehyeon Son , Gunhee Kim

Bayesian Neural Network Language Modeling for Speech Recognition

State-of-the-art neural network language models (NNLMs) represented by long short term memory recurrent neural networks (LSTM-RNNs) and Transformers are becoming highly complex. They are prone to overfitting and poor generalization when…

Computation and Language · Computer Science 2022-08-30 Boyang Xue , Shoukang Hu , Junhao Xu , Mengzhe Geng , Xunying Liu , Helen Meng

Learning and Generalization in RNNs

Simple recurrent neural networks (RNNs) and their more advanced cousins LSTMs etc. have been very successful in sequence modeling. Their theoretical understanding, however, is lacking and has not kept pace with the progress for feedforward…

Machine Learning · Computer Science 2021-06-02 Abhishek Panigrahi , Navin Goyal