Related papers: On Prediction Using Variable Order Markov Models

Bayesian Context Trees: Modelling and exact inference for discrete time series

We develop a new Bayesian modelling framework for the class of higher-order, variable-memory Markov chains, and introduce an associated collection of methodological tools for exact inference with discrete time series. We show that a version…

Methodology · Statistics 2022-02-08 Ioannis Kontoyiannis , Lambros Mertzanis , Athina Panotopoulou , Ioannis Papageorgiou , Maria Skoularidou

Adaptive Context Tree Weighting

We describe an adaptive context tree weighting (ACTW) algorithm, as an extension to the standard context tree weighting (CTW) algorithm. Unlike the standard CTW algorithm, which weights all observations equally regardless of the depth, ACTW…

Information Theory · Computer Science 2012-01-11 Alexander O'Neill , Marcus Hutter , Wen Shao , Peter Sunehag

An Information-Theoretic Approach to Understanding Transformers' In-Context Learning of Variable-Order Markov Chains

We study transformers' in-context learning of variable-length Markov chains (VOMCs), focusing on the finite-sample accuracy as the number of in-context examples increases. Compared to fixed-order Markov chains (FOMCs), learning VOMCs is…

Machine Learning · Computer Science 2026-04-01 Ruida Zhou , Chao Tian , Suhas Diggavi

On Learning Prediction-Focused Mixtures

Probabilistic models help us encode latent structures that both model the data and are ideally also useful for specific downstream tasks. Among these, mixture models and their time-series counterparts, hidden Markov models, identify…

Machine Learning · Computer Science 2021-10-29 Abhishek Sharma , Catherine Zeng , Sanjana Narayanan , Sonali Parbhoo , Finale Doshi-Velez

Classification algorithms using adaptive partitioning

Algorithms for binary classification based on adaptive tree partitioning are formulated and analyzed for both their risk performance and their friendliness to numerical implementation. The algorithms can be viewed as generating a set…

Statistics Theory · Mathematics 2014-11-05 Peter Binev , Albert Cohen , Wolfgang Dahmen , Ronald DeVore

Tree-based exploratory identification of predictive biomarkers in observational data

The idea of "stratified medicine" is an important driver of methodological research on the identification of predictive biomarkers. Most methods proposed so far for this purpose have been developed for the use on randomized data only.…

Methodology · Statistics 2022-12-19 Julia Krzykalla , Axel Benner , Annette Kopp-Schneider

Probabilistic Models for High-Order Projective Dependency Parsing

This paper presents generalized probabilistic models for high-order projective dependency parsing and an algorithmic framework for learning these statistical models involving dependency trees. Partition functions and marginals for…

Computation and Language · Computer Science 2015-02-17 Xuezhe Ma , Hai Zhao

Personalized Tree-Based Progressive Regression Model for Watch-Time Prediction in Short Video Recommendation

In online video platforms, accurate watch time prediction has become a fundamental and challenging problem in video recommendation. Previous research has revealed that the accuracy of watch time prediction highly depends on both the…

Information Retrieval · Computer Science 2025-08-26 Xiaokai Chen , Xiao Lin , Changcheng Li , Peng Jiang

Sequential Universal Modeling for Non-Binary Sequences with Constrained Distributions

Sequential probability assignment and universal compression go hand in hand. We propose sequential probability assignment for non-binary (and large alphabet) sequences with empirical distributions whose parameters are known to be bounded…

Information Theory · Computer Science 2021-02-09 Michael Drmota , Gil Shamir , Wojciech Szpankowski

Learning a Machine for the Decision in a Partially Observable Markov Universe

In this paper, we are interested in optimal decisions in a partially observable Markov universe. Our viewpoint departs from the dynamic programming viewpoint: we are directly approximating an optimal strategic tree depending on the…

General Mathematics · Mathematics 2007-05-23 Frederic Dambreville

Convolutional Neural Network Compression Based on Low-Rank Decomposition

Deep neural networks typically impose significant computational loads and memory consumption. Moreover, the large parameters pose constraints on deploying the model on edge devices such as embedded systems. Tensor decomposition offers a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Yaping He , Linhao Jiang , Di Wu

Protein Classification using Machine Learning and Statistical Techniques: A Comparative Analysis

In recent era prediction of enzyme class from an unknown protein is one of the challenging tasks in bioinformatics. Day to day the number of proteins is increases as result the prediction of enzyme class gives a new opportunity to…

Machine Learning · Computer Science 2019-01-21 Chhote Lal Prasad Gupta , Anand Bihari , Sudhakar Tripathi

A Model-Driven Lossless Compression Algorithm Resistant to Mismatch

Due to the fundamental connection between next-symbol prediction and compression, modern predictive models, such as large language models (LLMs), can be combined with entropy coding to achieve compression rates that surpass those of…

Information Theory · Computer Science 2026-01-27 Cordelia Hu , Jennifer Tang

Context Tree Prior Distributions based on Node Weighting with exact Bayes Factors

Variable-length Markov chains (VLMCs) are a flexible class of higher-order Markov models that admit a natural representation as context trees. Existing Bayesian methods for specifying prior distributions on tree structures rely on branching…

Methodology · Statistics 2026-05-11 Thiago Paulichen , Victor Freguglia

Weakly Convergent Nonparametric Forecasting of Stationary Time Series

The conditional distribution of the next outcome given the infinite past of a stationary process can be inferred from finite but growing segments of the past. Several schemes are known for constructing pointwise consistent estimates, but…

Statistics Theory · Mathematics 2016-11-17 G. Morvai , S. Yakowitz , P. Algoet

A Family of LZ78-based Universal Sequential Probability Assignments

We propose and study a family of universal sequential probability assignments on individual sequences, based on the incremental parsing procedure of the Lempel-Ziv (LZ78) compression algorithm. We show that the normalized log loss under any…

Information Theory · Computer Science 2025-12-15 Naomi Sagan , Tsachy Weissman

The posterior-Viterbi: a new decoding algorithm for hidden Markov models

Background: Hidden Markov models (HMM) are powerful machine learning tools successfully applied to problems of computational Molecular Biology. In a predictive task, the HMM is endowed with a decoding algorithm in order to assign the most…

Biomolecules · Quantitative Biology 2007-05-23 Piero Fariselli , Pier Luigi Martelli , Rita Casadio

Compositional Probabilistic Model Checking with String Diagrams of MDPs

We present a compositional model checking algorithm for Markov decision processes, in which they are composed in the categorical graphical language of string diagrams. The algorithm computes optimal expected rewards. Our theoretical…

Logic in Computer Science · Computer Science 2023-07-19 Kazuki Watanabe , Clovis Eberhart , Kazuyuki Asada , Ichiro Hasuo

Probabilistic Planning with Prioritized Preferences over Temporal Logic Objectives

This paper studies temporal planning in probabilistic environments, modeled as labeled Markov decision processes (MDPs), with user preferences over multiple temporal goals. Existing works reflect such preferences as a prioritized list of…

Formal Languages and Automata Theory · Computer Science 2023-04-25 Lening Li , Hazhar Rahmani , Jie Fu

A Spectral Algorithm for Latent Junction Trees

Latent variable models are an elegant framework for capturing rich probabilistic dependencies in many applications. However, current approaches typically parametrize these models using conditional probability tables, and learning relies…

Machine Learning · Computer Science 2012-10-19 Ankur P. Parikh , Le Song , Mariya Ishteva , Gabi Teodoru , Eric P. Xing