Related papers: A Generic Framework for Efficient and Effective Su…

Efficient Non-Learning Similar Subtrajectory Search

Similar subtrajectory search is a finer-grained operator that can better capture the similarities between one query trajectory and a portion of a data trajectory than the traditional similar trajectory search, which requires the two checked…

Databases · Computer Science 2023-08-09 Jiabao Jin , Peng Cheng , Lei Chen , Xuemin Lin , Wenjie Zhang

Generic Subsequence Matching Framework: Modularity, Flexibility, Efficiency

Subsequence matching has appeared to be an ideal approach for solving many problems related to the fields of data mining and similarity retrieval. It has been shown that almost any data class (audio, image, biometrics, signals) is or can be…

Multimedia · Computer Science 2012-06-13 David Novak , Petr Volny , Pavel Zezula

Exact Trajectory Similarity Search With N-tree: An Efficient Metric Index for kNN and Range Queries

Similarity search is the problem of finding in a collection of objects those that are similar to a given query object. It is a fundamental problem in modern applications and the objects considered may be as diverse as locations in space,…

Databases · Computer Science 2024-08-15 Ralf Hartmut Güting , Suvam Kumar Das , Fabio Valdés , Suprio Ray

Faster Approximate Pattern Matching: A Unified Approach

Approximate pattern matching is a natural and well-studied problem on strings: Given a text $T$, a pattern $P$, and a threshold $k$, find (the starting positions of) all substrings of $T$ that are at distance at most $k$ from $P$. We…

Data Structures and Algorithms · Computer Science 2020-11-17 Panagiotis Charalampopoulos , Tomasz Kociumaka , Philip Wellnitz

Inferring the Most Similar Variable-length Subsequences between Multidimensional Time Series

Finding the most similar subsequences between two multidimensional time series has many applications: e.g. capturing dependency in stock market or discovering coordinated movement of baboons. Considering one pattern occurring in one time…

Machine Learning · Computer Science 2025-05-19 Thanadej Rattanakornphan , Piyanon Charoenpoonpanich , Chainarong Amornbunchornvej

Robust Time-Series Retrieval Using Probabilistic Adaptive Segmental Alignment

Traditional pairwise sequence alignment is based on matching individual samples from two sequences, under time monotonicity constraints. However, in many application settings matching subsequences (segments) instead of individual samples…

Databases · Computer Science 2016-09-28 Shahriar Shariat , Vladimir Pavlovic

Free congruence: an exploration of expanded similarity measures for time series data

Time series similarity measures are highly relevant in a wide range of emerging applications including training machine learning models, classification, and predictive modeling. Standard similarity measures for time series most often…

Machine Learning · Computer Science 2021-01-22 Lucas Cassiel Jacaruso

Search Efficiency in Indexing Structures for Similarity Searching

Similarity searching finds application in a wide variety of domains including multilingual databases, computational biology, pattern recognition and text retrieval. Similarity is measured in terms of a distance function, edit distance, in…

Databases · Computer Science 2007-05-23 Girish Motwani , Sandhya G. Nair

A Personalized Dense Retrieval Framework for Unified Information Access

Developing a universal model that can efficiently and effectively respond to a wide range of information access requests -- from retrieval to recommendation to question answering -- has been a long-lasting goal in the information retrieval…

Information Retrieval · Computer Science 2023-04-27 Hansi Zeng , Surya Kallumadi , Zaid Alibadi , Rodrigo Nogueira , Hamed Zamani

Sequence Covering Similarity for Symbolic Sequence Comparison

This paper introduces the sequence covering similarity, that we formally define for evaluating the similarity between a symbolic sequence (string) and a set of symbolic sequences (strings). From this covering similarity we derive a…

Data Structures and Algorithms · Computer Science 2018-03-12 Pierre-François Marteau

Edit distance similarity search, also called approximate pattern matching, is a fundamental problem with widespread database applications. The goal of the problem is to preprocess $n$ strings of length $d$, to quickly answer queries $q$ of…

Data Structures and Algorithms · Computer Science 2020-07-10 Samuel McCauley

Retrieval Based Time Series Forecasting

Time series data appears in a variety of applications such as smart transportation and environmental monitoring. One of the fundamental problems for time series analysis is time series forecasting. Despite the success of recent deep time…

Artificial Intelligence · Computer Science 2022-09-28 Baoyu Jing , Si Zhang , Yada Zhu , Bin Peng , Kaiyu Guan , Andrew Margenot , Hanghang Tong

The SuperM-Tree: Indexing metric spaces with sized objects

A common approach to implementing similarity search applications is the usage of distance functions, where small distances indicate high similarity. In the case of metric distance functions, metric index structures can be used to accelerate…

Data Structures and Algorithms · Computer Science 2019-02-05 Jörg P. Bachmann

HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks

Similarity search is an important function in many applications, which usually focuses on measuring the similarity between objects with the same type. However, in many scenarios, we need to measure the relatedness between objects with…

Information Retrieval · Computer Science 2013-10-01 Chuan Shi , Xiangnan Kong , Yue Huang , Philip S. Yu , Bin Wu

Return of the Lernaean Hydra: Experimental Evaluation of Data Series Approximate Similarity Search

Data series are a special type of multidimensional data present in numerous domains, where similarity search is a key operation that has been extensively studied in the data series literature. In parallel, the multidimensional community has…

Databases · Computer Science 2020-06-23 Karima Echihabi , Kostas Zoumpatianos , Themis Palpanas , Houda Benbrahim

Formal Languages and Algorithms for Similarity based Retrieval from Sequence Databases

The paper considers various formalisms based on Automata, Temporal Logic and Regular Expressions for specifying queries over sequences. Unlike traditional binary semantics, the paper presents a similarity based semantics for thse…

Logic in Computer Science · Computer Science 2007-05-23 A. Prasad Sistla

The Extended Edit Distance Metric

Similarity search is an important problem in information retrieval. This similarity is based on a distance. Symbolic representation of time series has attracted many researchers recently, since it reduces the dimensionality of these high…

Information Retrieval · Computer Science 2010-06-18 Muhammad Marwan Muhammad Fuad , Pierre-François Marteau

The rise of internet has resulted in an explosion of data consisting of millions of articles, images, songs, and videos. Most of this data is high dimensional and sparse. The need to perform an efficient search for similar objects in such…

Data Structures and Algorithms · Computer Science 2016-12-20 Raghav Kulkarni , Rameshwar Pratap

Good parts first - a new algorithm for approximate search in lexica and string databases

We present a new efficient method for approximate search in electronic lexica. Given an input string (the pattern) and a similarity threshold, the algorithm retrieves all entries of the lexicon that are sufficiently similar to the pattern.…

Computation and Language · Computer Science 2015-12-04 Stefan Gerdjikov , Stoyan Mihov , Petar Mitankin , Klaus U. Schulz

Learning Correlation Space for Time Series

We propose an approximation algorithm for efficient correlation search in time series data. In our method, we use Fourier transform and neural network to embed time series into a low-dimensional Euclidean space. The given space is learned…

Machine Learning · Computer Science 2018-05-16 Han Qiu , Hoang Thanh Lam , Francesco Fusco , Mathieu Sinn