English
Related papers

Related papers: Stringological sequence prediction I: efficient al…

200 papers

This paper presents a new approach to statistical similarity assessment based on sequence alignment. The algorithm performs mutual matching of two random sequences by successively searching for common elements and by applying sequence…

Signal Processing · Electrical Eng. & Systems 2021-06-09 Jakub Nikonowicz , Łukasz Matuszewski , Paweł Kubczak

We consider the problem of sequential prediction and provide tools to study the minimax value of the associated game. Classical statistical learning theory provides several useful complexity measures to study learning with i.i.d. data. Our…

Machine Learning · Computer Science 2014-08-13 Alexander Rakhlin , Karthik Sridharan , Ambuj Tewari

We have developed a novel prediction method based on string invariants. The method does not require learning but a small set of parameters must be set to achieve optimal performance. We have implemented an evolutionary algorithm for the…

Statistical Finance · Quantitative Finance 2016-06-29 Marek Bundzel , Tomas Kasanicky , Richard Pincak

Approximate inference in dynamic systems is the problem of estimating the state of the system given a sequence of actions and partial observations. High precision estimation is fundamental in many applications like diagnosis, natural…

Artificial Intelligence · Computer Science 2012-06-18 Hannaneh Hajishirzi , Eyal Amir

Algorithms which learn environments represented by automata in the past have had complexity scaling with the number of states in the automaton, which can be exponentially large even for automata recognizing regular expressions with a small…

Formal Languages and Automata Theory · Computer Science 2024-05-13 Ali Cataltepe , Vanessa Kosoy

The linear complexity of a sequence $s$ is one of the measures of its predictability. It represents the smallest degree of a linear recursion which the sequence satisfies. There are several algorithms to find the linear complexity of a…

Cryptography and Security · Computer Science 2019-12-30 Yeow Meng Chee , Johan Chrisnata , Tuvi Etzion , Han Mao Kiah

String matching algorithm plays the vital role in the Computational Biology. The functional and structural relationship of the biological sequence is determined by similarities on that sequence. For that, the researcher is supposed to aware…

Data Structures and Algorithms · Computer Science 2014-01-30 Pandiselvam. P , Marimuthu. T , Lawrance. R

The extension of persistent homology to multi-parameter setups is an algorithmic challenge. Since most computation tasks scale badly with the size of the input complex, an important pre-processing step consists of simplifying the input…

Algebraic Topology · Mathematics 2019-03-19 Ulderico Fugacci , Michael Kerber

Sequence classification algorithms, such as SVM, require a definition of distance (similarity) measure between two sequences. A commonly used notion of similarity is the number of matches between $k$-mers ($k$-length subsequences) in the…

Data Structures and Algorithms · Computer Science 2017-12-13 Muhammad Farhan , Juvaria Tariq , Arif Zaman , Mudassir Shabbir , Imdad Ullah Khan

Sequence classification is the task of predicting a class label given a sequence of observations. In many applications such as healthcare monitoring or intrusion detection, early classification is crucial to prompt intervention. In this…

Machine Learning · Computer Science 2020-10-07 Maayan Shvo , Andrew C. Li , Rodrigo Toro Icarte , Sheila A. McIlraith

Inspired by number series tests to measure human intelligence, we suggest number sequence prediction tasks to assess neural network models' computational powers for solving algorithmic problems. We define the complexity and difficulty of a…

Neural and Evolutionary Computing · Computer Science 2018-11-13 Hyoungwook Nam , Segwang Kim , Kyomin Jung

Stochastic finite automata arise naturally in many language and speech processing tasks. They include stochastic acceptors, which represent certain probability distributions over random strings. We consider the problem of efficient…

Computation and Language · Computer Science 2019-09-24 Martin Jansche , Alexander Gutkin

Sequence segmentation is a well-studied problem, where given a sequence of elements, an integer K, and some measure of homogeneity, the task is to split the sequence into K contiguous segments that are maximally homogeneous. A classic…

Data Structures and Algorithms · Computer Science 2019-02-12 Nikolaj Tatti

We investigate the following problem: given a sample of classified strings, find a first-order sentence of minimal quantifier rank that is consistent with the sample. We represent strings as successor string structures, that is, finite…

Logic in Computer Science · Computer Science 2018-09-11 Thiago Alves Rocha , Ana Teresa Martins , Francicleber Martins Ferreira

Maximum-likelihood estimation (MLE) is widely used in sequence to sequence tasks for model training. It uniformly treats the generation/prediction of each target token as multi-class classification, and yields non-smooth prediction…

Computation and Language · Computer Science 2018-12-13 Chengyue Gong , Xu Tan , Di He , Tao Qin

Online sequence prediction is the problem of predicting the next element of a sequence given previous elements. This problem has been extensively studied in the context of individual sequence prediction, where no prior assumptions are made…

Machine Learning · Computer Science 2012-06-22 Elad Eban , Aharon Birnbaum , Shai Shalev-Shwartz , Amir Globerson

Symbolic regression is a powerful system identification technique in industrial scenarios where no prior knowledge on model structure is available. Such scenarios often require specific model properties such as interpretability, robustness,…

Tasks that model the relation between pairs of tokens in a string are a vital part of understanding natural language. Such tasks, in general, require exhaustive pair-wise comparisons of tokens, thus having a quadratic runtime complexity in…

Computation and Language · Computer Science 2023-12-13 Tianyu Liu , Afra Amini , Mrinmaya Sachan , Ryan Cotterell

A new numerical characterization of symbolic sequences is proposed. The partition of sequence based on Ke and Tong algorithm is a starting point. Algorithm decomposes original sequence into set of distinct subsequences - a patterns. The set…

Quantitative Methods · Quantitative Biology 2011-09-08 B. Kozarzewski

In this paper, we describe a new method for constructing minimal, deterministic, acyclic finite-state automata from a set of strings. Traditional methods consist of two phases: the first to construct a trie, the second one to minimize it.…

Computation and Language · Computer Science 2007-05-23 Jan Daciuk , Stoyan Mihov , Bruce Watson , Richard Watson
‹ Prev 1 2 3 10 Next ›