Related papers: String comparison by transposition networks

Semi-local string comparison: algorithmic techniques and applications

A classical measure of string comparison is given by the longest common subsequence (LCS) problem on a pair of strings. We consider its generalisation, called the semi-local LCS problem, which arises naturally in many string-related…

Data Structures and Algorithms · Computer Science 2015-03-13 Alexander Tiskin

A Comparative Study on String Matching Algorithm of Biological Sequences

String matching algorithm plays the vital role in the Computational Biology. The functional and structural relationship of the biological sequence is determined by similarities on that sequence. For that, the researcher is supposed to aware…

Data Structures and Algorithms · Computer Science 2014-01-30 Pandiselvam. P , Marimuthu. T , Lawrance. R

Calibrating Analytical Models for Semilocal Strings

In this work we calibrate two different analytic models of semilocal strings by constraining the values of their free parameters. In order to do so, we use data obtained from the largest and most accurate field theory simulations of…

High Energy Physics - Phenomenology · Physics 2015-05-20 A. Lopez-Eiguren

Transduce: learning transduction grammars for string transformation

The synthesis of string transformation programs from input-output examples utilizes various techniques, all based on an inductive bias that comprises a restricted set of basic operators to be combined. A new algorithm, Transduce, is…

Machine Learning · Computer Science 2024-01-19 Francis Frydman , Philippe Mangion

Evolution of Semilocal String Networks: I. Large-scale Properties

We report on a detailed numerical study of the evolution of semilocal string networks, based on the largest and most accurate field theory simulations of these objects to date. We focus on the large-scale network properties, confirming…

High Energy Physics - Phenomenology · Physics 2014-03-24 A. Achúcarro , A. Avgoustidis , A. M. M. Leite , A. Lopez-Eiguren , C. J. A. P. Martins , A. S. Nunes , J. Urrestilla

Combining a Context Aware Neural Network with a Denoising Autoencoder for Measuring String Similarities

Measuring similarities between strings is central for many established and fast growing research areas including information retrieval, biology, and natural language processing. The traditional approach for string similarity measurements is…

Information Retrieval · Computer Science 2018-08-20 Mehdi Ben Lazreg , Morten Goodwin

Linear-Time Modeling of Linguistic Structure: An Order-Theoretic Perspective

Tasks that model the relation between pairs of tokens in a string are a vital part of understanding natural language. Such tasks, in general, require exhaustive pair-wise comparisons of tokens, thus having a quadratic runtime complexity in…

Computation and Language · Computer Science 2023-12-13 Tianyu Liu , Afra Amini , Mrinmaya Sachan , Ryan Cotterell

Machine Translation Model based on Non-parallel Corpus and Semi-supervised Transductive Learning

Although the parallel corpus has an irreplaceable role in machine translation, its scale and coverage is still beyond the actual needs. Non-parallel corpus resources on the web have an inestimable potential value in machine translation and…

Computation and Language · Computer Science 2014-05-23 Lijiang Chen

A Novel Algorithm for String Matching with Mismatches

We present an online algorithm to deal with pattern matching in strings. The problem we investigate is commonly known as string matching with mismatches in which the objective is to report the number of characters that match when a pattern…

Data Structures and Algorithms · Computer Science 2016-03-11 Vinodprasad P

Transductive Learning with String Kernels for Cross-Domain Text Classification

For many text classification tasks, there is a major problem posed by the lack of labeled data in a target domain. Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of…

Computation and Language · Computer Science 2018-11-06 Radu Tudor Ionescu , Andrei M. Butnaru

Comparing Two Counting Methods for Estimating the Probabilities of Strings

There are two methods for counting the number of occurrences of a string in another large string. One is to count the number of places where the string is found. The other is to determine how many pieces of string can be extracted without…

Data Structures and Algorithms · Computer Science 2022-11-09 Ayaka Takamoto , Mitsuo Yoshida , Kyoji Umemura

Network Alignment

Complex networks are frequently employed to model physical or virtual complex systems. When certain entities exist across multiple systems simultaneously, unveiling their corresponding relationships across the networks becomes crucial. This…

Physics and Society · Physics 2025-04-16 Rui Tang , Ziyun Yong , Shuyu Jiang , Xingshu Chen , Yaofang Liu , Yi-Cheng Zhang , Gui-Quan Sun , Wei Wang

Unaligned Sequence Similarity Search Using Deep Learning

Gene annotation has traditionally required direct comparison of DNA sequences between an unknown gene and a database of known ones using string comparison methods. However, these methods do not provide useful information when a gene does…

Machine Learning · Computer Science 2019-09-17 James K. Senter , Taylor M. Royalty , Andrew D. Steen , Amir Sadovnik

The Exact String Matching Problem: a Comprehensive Experimental Evaluation

This paper addresses the online exact string matching problem which consists in finding all occurrences of a given pattern p in a text t. It is an extensively studied problem in computer science, mainly due to its direct applications to…

Data Structures and Algorithms · Computer Science 2010-12-14 Simone Faro , Thierry Lecroq

Approximate String Matching: Theory and Applications (La Recherche Approch\'ee de Motifs : Th\'eorie et Applications)

The approximate string matching is a fundamental and recurrent problem that arises in most computer science fields. This problem can be defined as follows: Let $D=\{x_1,x_2,\ldots x_d\}$ be a set of $d$ words defined on an alphabet…

Data Structures and Algorithms · Computer Science 2017-01-31 Ibrahim Chegrane

Alignments as Compositional Structures

Alignments, i.e., position-wise comparisons of two or more strings or ordered lists are of utmost practical importance in computational biology and a host of other fields, including historical linguistics and emerging areas of research in…

Combinatorics · Mathematics 2018-10-19 Sarah J. Berkemer , Christian Höner zu Siederdissen , Peter F. Stadler

Semi-Supervised Learning on Graphs Based on Local Label Distributions

Most approaches that tackle the problem of node classification consider nodes to be similar, if they have shared neighbors or are close to each other in the graph. Recent methods for attributed graphs additionally take attributes of…

Machine Learning · Computer Science 2018-05-23 Evgeniy Faerman , Felix Borutta , Julian Busch , Matthias Schubert

Linear Transformations for Cross-lingual Semantic Textual Similarity

Cross-lingual semantic textual similarity systems estimate the degree of the meaning similarity between two sentences, each in a different language. State-of-the-art algorithms usually employ machine translation and combine vast amount of…

Computation and Language · Computer Science 2018-07-12 Tomáš Brychcín

Message Passing Algorithms for Sparse Network Alignment

Network alignment generalizes and unifies several approaches for forming a matching or alignment between the vertices of two graphs. We study a mathematical programming framework for network alignment problem and a sparse variation of it…

Optimization and Control · Mathematics 2011-11-03 Mohsen Bayati , David F. Gleich , Amin Saberi , Ying Wang

Accurate Online Posterior Alignments for Principled Lexically-Constrained Decoding

Online alignment in machine translation refers to the task of aligning a target word to a source word when the target sequence has only been partially decoded. Good online alignments facilitate important applications such as lexically…

Computation and Language · Computer Science 2022-04-05 Soumya Chatterjee , Sunita Sarawagi , Preethi Jyothi