Related papers: Stringological sequence prediction I: efficient al…

Sequence Alignment Algorithm for Statistical Similarity Assessment

This paper presents a new approach to statistical similarity assessment based on sequence alignment. The algorithm performs mutual matching of two random sequences by successively searching for common elements and by applying sequence…

Signal Processing · Electrical Eng. & Systems 2021-06-09 Jakub Nikonowicz , Łukasz Matuszewski , Paweł Kubczak

Online Learning via Sequential Complexities

We consider the problem of sequential prediction and provide tools to study the minimax value of the associated game. Classical statistical learning theory provides several useful complexity measures to study learning with i.i.d. data. Our…

Machine Learning · Computer Science 2014-08-13 Alexander Rakhlin , Karthik Sridharan , Ambuj Tewari

Using String Invariants for Prediction Searching for Optimal Parameters

We have developed a novel prediction method based on string invariants. The method does not require learning but a small set of parameters must be set to achieve optimal performance. We have implemented an evolutionary algorithm for the…

Statistical Finance · Quantitative Finance 2016-06-29 Marek Bundzel , Tomas Kasanicky , Richard Pincak

Sampling First Order Logical Particles

Approximate inference in dynamic systems is the problem of estimating the state of the system given a sequence of actions and partial observations. High precision estimation is fundamental in many applications like diagnosis, natural…

Artificial Intelligence · Computer Science 2012-06-18 Hannaneh Hajishirzi , Eyal Amir

Time complexity for deterministic string machines

Algorithms which learn environments represented by automata in the past have had complexity scaling with the number of states in the automaton, which can be exponentially large even for automata recognizing regular expressions with a small…

Formal Languages and Automata Theory · Computer Science 2024-05-13 Ali Cataltepe , Vanessa Kosoy

Efficient Algorithm for the Linear Complexity of Sequences and Some Related Consequences

The linear complexity of a sequence $s$ is one of the measures of its predictability. It represents the smallest degree of a linear recursion which the sequence satisfies. There are several algorithms to find the linear complexity of a…

Cryptography and Security · Computer Science 2019-12-30 Yeow Meng Chee , Johan Chrisnata , Tuvi Etzion , Han Mao Kiah

A Comparative Study on String Matching Algorithm of Biological Sequences

String matching algorithm plays the vital role in the Computational Biology. The functional and structural relationship of the biological sequence is determined by similarities on that sequence. For that, the researcher is supposed to aware…

Data Structures and Algorithms · Computer Science 2014-01-30 Pandiselvam. P , Marimuthu. T , Lawrance. R

Chunk Reduction for Multi-Parameter Persistent Homology

The extension of persistent homology to multi-parameter setups is an algorithmic challenge. Since most computation tasks scale badly with the size of the input complex, an important pre-processing step consists of simplifying the input…

Algebraic Topology · Mathematics 2019-03-19 Ulderico Fugacci , Michael Kerber

Efficient Approximation Algorithms for String Kernel Based Sequence Classification

Sequence classification algorithms, such as SVM, require a definition of distance (similarity) measure between two sequences. A commonly used notion of similarity is the number of matches between $k$-mers ($k$-length subsequences) in the…

Data Structures and Algorithms · Computer Science 2017-12-13 Muhammad Farhan , Juvaria Tariq , Arif Zaman , Mudassir Shabbir , Imdad Ullah Khan

Interpretable Sequence Classification via Discrete Optimization

Sequence classification is the task of predicting a class label given a sequence of observations. In many applications such as healthcare monitoring or intrusion detection, early classification is crucial to prompt intervention. In this…

Machine Learning · Computer Science 2020-10-07 Maayan Shvo , Andrew C. Li , Rodrigo Toro Icarte , Sheila A. McIlraith

Number Sequence Prediction Problems for Evaluating Computational Powers of Neural Networks

Inspired by number series tests to measure human intelligence, we suggest number sequence prediction tasks to assess neural network models' computational powers for solving algorithmic problems. We define the complexity and difficulty of a…

Neural and Evolutionary Computing · Computer Science 2018-11-13 Hyoungwook Nam , Segwang Kim , Kyomin Jung

Sampling from Stochastic Finite Automata with Applications to CTC Decoding

Stochastic finite automata arise naturally in many language and speech processing tasks. They include stochastic acceptors, which represent certain probability distributions over random strings. We consider the problem of efficient…

Computation and Language · Computer Science 2019-09-24 Martin Jansche , Alexander Gutkin

Fast Sequence Segmentation using Log-Linear Models

Sequence segmentation is a well-studied problem, where given a sequence of elements, an integer K, and some measure of homogeneity, the task is to split the sequence into K contiguous segments that are maximally homogeneous. A classic…

Data Structures and Algorithms · Computer Science 2019-02-12 Nikolaj Tatti

On Finding a First-Order Sentence Consistent with a Sample of Strings

We investigate the following problem: given a sample of classified strings, find a first-order sentence of minimal quantifier rank that is consistent with the sample. We represent strings as successor string structures, that is, finite…

Logic in Computer Science · Computer Science 2018-09-11 Thiago Alves Rocha , Ana Teresa Martins , Francicleber Martins Ferreira

Sentence-wise Smooth Regularization for Sequence to Sequence Learning

Maximum-likelihood estimation (MLE) is widely used in sequence to sequence tasks for model training. It uniformly treats the generation/prediction of each target token as multi-class classification, and yields non-smooth prediction…

Computation and Language · Computer Science 2018-12-13 Chengyue Gong , Xu Tan , Di He , Tao Qin

Learning the Experts for Online Sequence Prediction

Online sequence prediction is the problem of predicting the next element of a sequence given previous elements. This problem has been extensively studied in the context of individual sequence prediction, where no prior assumptions are made…

Machine Learning · Computer Science 2012-06-22 Elad Eban , Aharon Birnbaum , Shai Shalev-Shwartz , Amir Globerson

Symbolic Regression by Exhaustive Search: Reducing the Search Space Using Syntactical Constraints and Efficient Semantic Structure Deduplication

Symbolic regression is a powerful system identification technique in industrial scenarios where no prior knowledge on model structure is available. Such scenarios often require specific model properties such as interpretability, robustness,…

Machine Learning · Computer Science 2021-09-29 Lukas Kammerer , Gabriel Kronberger , Bogdan Burlacu , Stephan M. Winkler , Michael Kommenda , Michael Affenzeller

Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective

Tasks that model the relation between pairs of tokens in a string are a vital part of understanding natural language. Such tasks, in general, require exhaustive pair-wise comparisons of tokens, thus having a quadratic runtime complexity in…

Computation and Language · Computer Science 2023-12-13 Tianyu Liu , Afra Amini , Mrinmaya Sachan , Ryan Cotterell

Similarity of symbolic sequences

A new numerical characterization of symbolic sequences is proposed. The partition of sequence based on Ke and Tong algorithm is a starting point. Algorithm decomposes original sequence into set of distinct subsequences - a patterns. The set…

Quantitative Methods · Quantitative Biology 2011-09-08 B. Kozarzewski

Incremental construction of minimal acyclic finite-state automata

In this paper, we describe a new method for constructing minimal, deterministic, acyclic finite-state automata from a set of strings. Traditional methods consist of two phases: the first to construct a trie, the second one to minimize it.…

Computation and Language · Computer Science 2007-05-23 Jan Daciuk , Stoyan Mihov , Bruce Watson , Richard Watson