Related papers: Discrete-Continuous Mixtures in Probabilistic Prog…

Probabilistic Language Tries: A Unified Framework for Compression, Decision Policies, and Execution Reuse

We introduce probabilistic language tries (PLTs), a unified representation that makes explicit the prefix structure implicitly defined by any generative model over sequences. By assigning to each outgoing edge the conditional probability of…

Machine Learning · Computer Science 2026-04-09 Gregory Magarshak

Probabilistic Subspace Manifolds for Contextual Inference in Large Language Models

Representing token embeddings as probability distributions over learned manifolds allows for more flexible contextual inference, reducing representational rigidity while enhancing semantic granularity. Comparative evaluations demonstrate…

Computation and Language · Computer Science 2025-04-25 Christopher Nightingale , Dominic Lavington , Jonathan Thistlethwaite , Sebastian Penhaligon , Thomas Belinski , David Boldo

Monolingual Probabilistic Programming Using Generalized Coroutines

Probabilistic programming languages and modeling toolkits are two modular ways to build and reuse stochastic models and inference procedures. Combining strengths of both, we express models and inference as generalized coroutines in the same…

Programming Languages · Computer Science 2012-05-14 Oleg Kiselyov , Chung-chieh Shan

MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on LLMs

The rapid scaling of large language models~(LLMs) has made inference efficiency a primary bottleneck in the practical deployment. To address this, semi-structured sparsity offers a promising solution by strategically retaining $N$ elements…

Machine Learning · Computer Science 2026-05-14 Yan Sun , Qixin Zhang , Zhiyuan Yu , Xikun Zhang , Li Shen , Dacheng Tao

Deep networks learn to parse uniform-depth context-free languages from local statistics

Understanding how the structure of language can be learned from sentences alone is a central question in both cognitive science and machine learning. Studies of the internal representations of Large Language Models (LLMs) support their…

Machine Learning · Statistics 2026-02-10 Jack T. Parley , Francesco Cagnetta , Matthieu Wyart

LLM Generated Distribution-Based Prediction of US Electoral Results, Part I

This paper introduces distribution-based prediction, a novel approach to using Large Language Models (LLMs) as predictive tools by interpreting output token probabilities as distributions representing the models' learned representation of…

Artificial Intelligence · Computer Science 2024-11-07 Caleb Bradshaw , Caelen Miller , Sean Warnick

Neural Networks Processing Mean Values of Random Variables

We introduce a class of neural networks derived from probabilistic models in the form of Bayesian belief networks. By imposing additional assumptions about the nature of the probabilistic models represented in the belief networks, we derive…

Disordered Systems and Neural Networks · Physics 2007-05-23 M. J. Barber , J. W. Clark , C. H. Anderson

Integrating Probabilistic Rules into Neural Networks: A Stochastic EM Learning Algorithm

The EM-algorithm is a general procedure to get maximum likelihood estimates if part of the observations on the variables of a network are missing. In this paper a stochastic version of the algorithm is adapted to probabilistic neural…

Artificial Intelligence · Computer Science 2013-03-26 Gerhard Paass

Word Embedding Algorithms as Generalized Low Rank Models and their Canonical Form

Word embedding algorithms produce very reliable feature representations of words that are used by neural network models across a constantly growing multitude of NLP tasks. As such, it is imperative for NLP practitioners to understand how…

Computation and Language · Computer Science 2019-11-11 Kian Kenyon-Dean

Combining Constraint Programming Reasoning with Large Language Model Predictions

Constraint Programming (CP) and Machine Learning (ML) face challenges in text generation due to CP's struggle with implementing "meaning'' and ML's difficulty with structural constraints. This paper proposes a solution by combining both…

Computation and Language · Computer Science 2024-09-26 Florian Régin , Elisabetta De Maria , Alexandre Bonlarron

Incoherent Probability Judgments in Large Language Models

Autoregressive Large Language Models (LLMs) trained for next-word prediction have demonstrated remarkable proficiency at producing coherent text. But are they equally adept at forming coherent probability judgments? We use probabilistic…

Computation and Language · Computer Science 2025-05-07 Jian-Qiao Zhu , Thomas L. Griffiths

Learning What Matters: Probabilistic Task Selection via Mutual Information for Model Finetuning

The performance of finetuned large language models (LLMs) hinges critically on the composition of the training mixture. However, selecting an optimal blend of task datasets remains a largely manual, heuristic driven process, with…

Machine Learning · Computer Science 2025-08-08 Prateek Chanda , Saral Sureka , Parth Pratim Chatterjee , Krishnateja Killamsetty , Nikhil Shivakumar Nayak , Ganesh Ramakrishnan

Ensemble Learning for Large Language Models in Text and Code Generation: A Survey

Generative Pretrained Transformers (GPTs) are foundational Large Language Models (LLMs) for text generation. However, individual LLMs often produce inconsistent outputs and exhibit biases, limiting their representation of diverse language…

Computation and Language · Computer Science 2025-08-06 Mari Ashiga , Wei Jie , Fan Wu , Vardan Voskanyan , Fateme Dinmohammadi , Paul Brookes , Jingzhi Gong , Zheng Wang

Deep Multivariate Models with Parametric Conditionals

We consider deep multivariate models for heterogeneous collections of random variables. In the context of computer vision, such collections may e.g. consist of images, segmentations, image attributes, and latent variables. When developing…

Machine Learning · Computer Science 2026-02-03 Dmitrij Schlesinger , Boris Flach , Alexander Shekhovtsov

Large language models in climate and sustainability policy: limits and opportunities

As multiple crises threaten the sustainability of our societies and pose at risk the planetary boundaries, complex challenges require timely, updated, and usable information. Natural-language processing (NLP) tools enhance and expand data…

Computers and Society · Computer Science 2025-02-05 Francesca Larosa , Sergio Hoyas , H. Alberto Conejero , Javier Garcia-Martinez , Francesco Fuso Nerini , Ricardo Vinuesa

Generative Datalog with Continuous Distributions

Arguing for the need to combine declarative and probabilistic programming, B\'ar\'any et al. (TODS 2017) recently introduced a probabilistic extension of Datalog as a "purely declarative probabilistic programming language." We revisit this…

Databases · Computer Science 2022-02-17 Martin Grohe , Benjamin Lucien Kaminski , Joost-Pieter Katoen , Peter Lindner

Reading Between the Tokens: Improving Preference Predictions through Mechanistic Forecasting

Large language models are increasingly used to predict human preferences in both scientific and business endeavors, yet current approaches rely exclusively on analyzing model outputs without considering the underlying mechanisms. Using…

Computers and Society · Computer Science 2026-02-04 Sarah Ball , Simeon Allmendinger , Niklas Kühl , Frauke Kreuter

Optimising Density Computations in Probabilistic Programs via Automatic Loop Vectorisation

Probabilistic programming languages (PPLs) are a popular tool for high-level modelling across many fields. They provide a range of algorithms for probabilistic inference, which analyse models by learning their parameters from a dataset or…

Programming Languages · Computer Science 2025-11-17 Sangho Lim , Hyoungjin Lim , Wonyeol Lee , Xavier Rival , Hongseok Yang

Subsampling MCMC for Bayesian Variable Selection and Model Averaging in BGNLM

Bayesian Generalized Nonlinear Models (BGNLM) offer a flexible nonlinear alternative to GLM while still providing better interpretability than machine learning techniques such as neural networks. In BGNLM, the methods of Bayesian Variable…

Computation · Statistics 2023-12-29 Jon Lachmann , Aliaksandr Hubin

On the Reasoning Capacity of AI Models and How to Quantify It

Recent advances in Large Language Models (LLMs) have intensified the debate surrounding the fundamental nature of their reasoning capabilities. While achieving high performance on benchmarks such as GPQA and MMLU, these models exhibit…

Artificial Intelligence · Computer Science 2025-01-24 Santosh Kumar Radha , Oktay Goktas