Related papers: PAC learning PDFA from data streams

Learning state machines via efficient hashing of future traces

State machines are popular models to model and visualize discrete systems such as software systems, and to represent regular grammars. Most algorithms that passively learn state machines from data assume all the data to be available from…

Formal Languages and Automata Theory · Computer Science 2022-07-05 Robert Baumgartner , Sicco Verwer

Enhancing Automata Learning with Statistical Machine Learning: A Network Security Case Study

Intrusion detection systems are crucial for network security. Verification of these systems is complicated by various factors, including the heterogeneity of network platforms and the continuously changing landscape of cyber threats. In…

Cryptography and Security · Computer Science 2024-07-16 Negin Ayoughi , Shiva Nejati , Mehrdad Sabetzadeh , Patricio Saavedra

Online-to-PAC generalization bounds under graph-mixing dependencies

Traditional generalization results in statistical learning require a training data set made of independently drawn examples. Most of the recent efforts to relax this independence assumption have considered either purely temporal (mixing)…

Machine Learning · Statistics 2024-10-14 Baptiste Abélès , Eugenio Clerico , Gergely Neu

Generalization bounds for mixing processes via delayed online-to-PAC conversions

We study the generalization error of statistical learning algorithms in a non-i.i.d. setting, where the training data is sampled from a stationary mixing process. We develop an analytic framework for this scenario based on a reduction to…

Machine Learning · Computer Science 2025-02-20 Baptiste Abeles , Eugenio Clerico , Gergely Neu

A Markovian Model for Learning-to-Optimize

We present a probabilistic model for stochastic iterative algorithms with the use case of optimization algorithms in mind. Based on this model, we present PAC-Bayesian generalization bounds for functions that are defined on the trajectory…

Machine Learning · Computer Science 2024-08-22 Michael Sucker , Peter Ochs

Online-to-PAC Conversions: Generalization Bounds via Regret Analysis

We present a new framework for deriving bounds on the generalization bound of statistical learning algorithms from the perspective of online learning. Specifically, we construct an online learning game called the "generalization game",…

Machine Learning · Statistics 2024-10-18 Gábor Lugosi , Gergely Neu

Human in the Loop: Interactive Passive Automata Learning via Evidence-Driven State-Merging Algorithms

We present an interactive version of an evidence-driven state-merging (EDSM) algorithm for learning variants of finite state automata. Learning these automata often amounts to recovering or reverse engineering the model generating the data…

Machine Learning · Statistics 2017-08-01 Christian A. Hammerschmidt , Radu State , Sicco Verwer

Predictive PAC Learning and Process Decompositions

We informally call a stochastic process learnable if it admits a generalization error approaching zero in probability for any concept class with finite VC-dimension (IID processes are the simplest example). A mixture of learnable processes…

Machine Learning · Statistics 2015-07-27 Cosma Rohilla Shalizi , Aryeh Kontorovich

Sample Efficient Algorithms for Learning Quantum Channels in PAC Model and the Approximate State Discrimination Problem

We generalize the PAC (probably approximately correct) learning model to the quantum world by generalizing the concepts from classical functions to quantum processes, defining the problem of \emph{PAC learning quantum process}, and study…

Quantum Physics · Physics 2021-05-20 Kai-Min Chung , Han-Hsuan Lin

The Closer You Look, The More You Learn: A Grey-box Approach to Protocol State Machine Learning

In this paper, we propose a new approach to infer state machine models from protocol implementations. Our method, STATEINSPECTOR, learns protocol states by using novel program analyses to combine observations of run-time memory and I/O. It…

Cryptography and Security · Computer Science 2021-06-08 Chris McMahon Stone , Sam L. Thomas , Mathy Vanhoef , James Henderson , Nicolas Bailluet , Tom Chothia

Monotonic Learning in the PAC Framework: A New Perspective

Monotone learning describes learning processes in which expected performance consistently improves as the amount of training data increases. However, recent studies challenge this conventional wisdom, revealing significant gaps in the…

Machine Learning · Computer Science 2025-05-22 Ming Li , Chenyi Zhang , Qin Li

Rethinking Graph-Based Document Classification: Learning Data-Driven Structures Beyond Heuristic Approaches

In document classification, graph-based models effectively capture document structure, overcoming sequence length limitations and enhancing contextual understanding. However, most existing graph document representations rely on heuristics,…

Computation and Language · Computer Science 2025-08-05 Margarita Bugueño , Gerard de Melo

Building Dynamic Knowledge Graphs from Text using Machine Reading Comprehension

We propose a neural machine-reading model that constructs dynamic knowledge graphs from procedural text. It builds these graphs recurrently for each step of the described procedure, and uses them to track the evolving states of participant…

Computation and Language · Computer Science 2018-10-16 Rajarshi Das , Tsendsuren Munkhdalai , Xingdi Yuan , Adam Trischler , Andrew McCallum

Generalization Bounds: Perspectives from Information Theory and PAC-Bayes

A fundamental question in theoretical machine learning is generalization. Over the past decades, the PAC-Bayesian approach has been established as a flexible framework to address the generalization capabilities of machine learning…

Machine Learning · Computer Science 2024-03-28 Fredrik Hellström , Giuseppe Durisi , Benjamin Guedj , Maxim Raginsky

PAC Learning-Based Verification and Model Synthesis

We introduce a novel technique for verification and model synthesis of sequential programs. Our technique is based on learning a regular model of the set of feasible paths in a program, and testing whether this model contains an incorrect…

Software Engineering · Computer Science 2015-11-04 Yu-Fang Chen , Chiao Hsieh , Ondřej Lengál , Tsung-Ju Lii , Ming-Hsien Tsai , Bow-Yaw Wang , Farn Wang

Stream-based State-Machine Replication

Developing state-machine replication protocols for practical use is a complex and labor-intensive process because of the myriad of essential tasks (e.g., deployment, communication, recovery) that need to be taken into account in an…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-25 Laura Lawniczak , Tobias Distler

Approximate Learning in Complex Dynamic Bayesian Networks

In this paper we extend the work of Smith and Papamichail (1999) and present fast approximate Bayesian algorithms for learning in complex scenarios where at any time frame, the relationships between explanatory state space variables can be…

Machine Learning · Computer Science 2013-01-30 Raffaella Settimi , Jim Q. Smith , A. S. Gargoum

Learning and Generalization with Mixture Data

In many, if not most, machine learning applications the training data is naturally heterogeneous (e.g. federated learning, adversarial attacks and domain adaptation in neural net training). Data heterogeneity is identified as one of the…

Machine Learning · Statistics 2025-04-30 Harsh Vardhan , Avishek Ghosh , Arya Mazumdar

Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

When confronted with massive data streams, summarizing data with dimension reduction methods such as PCA raises theoretical and algorithmic pitfalls. Principal curves act as a nonlinear generalization of PCA and the present paper proposes a…

Machine Learning · Statistics 2021-12-16 Benjamin Guedj , Le Li

Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning

Statistical performance bounds for reinforcement learning (RL) algorithms can be critical for high-stakes applications like healthcare. This paper introduces a new framework for theoretically measuring the performance of such algorithms…

Machine Learning · Computer Science 2018-01-03 Christoph Dann , Tor Lattimore , Emma Brunskill