English
Related papers

Related papers: A Subsequence Interleaving Model for Sequential Pa…

200 papers

An ideal outcome of pattern mining is a small set of informative patterns, containing no redundancy or noise, that identifies the key structure of the data at hand. Standard frequent pattern miners do not achieve this goal, as due to the…

Data Structures and Algorithms · Computer Science 2019-02-11 Nikolaj Tatti , Jilles Vreeken

Discovering the key structure of a database is one of the main goals of data mining. In pattern set mining we do so by discovering a small set of patterns that together describe the data well. The richer the class of patterns we consider,…

Artificial Intelligence · Computer Science 2017-08-11 Apratim Bhattacharyya , Jilles Vreeken

This is about the Minimum Description Length (MDL) principle applied to pattern mining. The length of this description is kept to the minimum. Mining patterns is a core task in data analysis and, beyond issues of efficient enumeration, the…

Databases · Computer Science 2022-07-29 Esther Galbrun

Constrained sequential pattern mining aims at identifying frequent patterns on a sequential database of items while observing constraints defined over the item attributes. We introduce novel techniques for constraint-based sequential…

Machine Learning · Computer Science 2019-01-01 Amin Hosseininasab , Willem-Jan van Hoeve , Andre A. Cire

The quantity of event logs available is increasing rapidly, be they produced by industrial processes, computing systems, or life tracking, for instance. It is thus important to design effective ways to uncover the information they contain.…

Databases · Computer Science 2018-07-06 Esther Galbrun , Peggy Cellier , Nikolaj Tatti , Alexandre Termier , Bruno Crémilleux

The power of sparse signal coding with learned dictionaries has been demonstrated in a variety of applications and fields, from signal processing to statistical inference and machine learning. However, the statistical properties of these…

Information Theory · Computer Science 2010-10-25 Ignacio Ramírez , Guillermo Sapiro

Nowadays, with the rapid development of the Internet, the era of big data has come. The Internet generates huge amounts of data every day. However, extracting meaningful information from massive data is like looking for a needle in a…

Artificial Intelligence · Computer Science 2022-12-21 Xinhong Chen , Wensheng Gan , Shicheng Wan , Tianlong Gu

This is an up-to-date introduction to and overview of the Minimum Description Length (MDL) Principle, a theory of inductive inference that can be applied to general problems in statistics, machine learning and pattern recognition. While MDL…

Methodology · Statistics 2019-12-19 Peter Grünwald , Teemu Roos

Constraint-based pattern discovery is at the core of numerous data mining tasks. Patterns are extracted with respect to a given set of constraints (frequency, closedness, size, etc). In the context of sequential pattern mining, a large…

Artificial Intelligence · Computer Science 2013-11-28 Jean-Philippe Métivier , Samir Loudni , Thierry Charnois

In this uncertain world, data uncertainty is inherent in many applications and its importance is growing drastically due to the rapid development of modern technologies. Nowadays, researchers have paid more attention to mine patterns in…

Sequential pattern mining techniques extract patterns corresponding to frequent subsequences from a sequence database. A practical limitation of these techniques is that they overload the user with too many patterns. Local Process Model…

Data Structures and Algorithms · Computer Science 2018-09-24 Niek Tax , Marlon Dumas

The Minimum Description Length (MDL) principle states that the optimal model for a given data set is that which compresses it best. Due to practial limitations the model can be restricted to a class such as linear regression models, which…

Machine Learning · Statistics 2015-03-13 Florin Popescu , Daniel Renz

The power of sparse signal modeling with learned over-complete dictionaries has been demonstrated in a variety of applications and fields, from signal processing to statistical inference and machine learning. However, the statistical…

Information Theory · Computer Science 2017-04-26 Ignacio Ramírez , Guillermo Sapiro

Most pattern mining methods output a very large number of frequent patterns and isolating a small but relevant subset is a challenging problem of current interest in frequent pattern mining. In this paper we consider discovery of a small…

Databases · Computer Science 2014-10-14 A. Ibrahim , Shivakumar Sastry , P. S. Sastry

Sequential pattern mining (SPM) has excellent prospects and application spaces and has been widely used in different fields. The non-overlapping SPM, as one of the data mining techniques, has been used to discover patterns that have…

Databases · Computer Science 2023-04-25 Zefeng Chen , Wensheng Gan , Gengsen Huang , Yan Li , Zhenlian Qi

We analyze differences between two information-theoretically motivated approaches to statistical inference and model selection: the Minimum Description Length (MDL) principle, and the Minimum Message Length (MML) principle. Based on this…

Machine Learning · Computer Science 2013-02-01 Peter D Grunwald , Petri Kontkanen , Petri Myllymaki , Tomi Silander , Henry Tirri

In pattern mining, sequential rules provide a formal framework to capture the temporal relationships and inferential dependencies between items. However, the discovery process is computationally intensive. To obtain mining results…

Databases · Computer Science 2026-02-20 Wensheng Gan , Gengsen Huang , Junyu Ren , Philip S. Yu

In this paper we address the problem of discovering a small set of frequent serial episodes from sequential data so as to adequately characterize or summarize the data. We discuss an algorithm based on the Minimum Description Length (MDL)…

Machine Learning · Computer Science 2019-04-02 Soumyajit Mitra , P S Sastry

High-dimensional datasets often contain multiple meaningful clusterings in different subspaces. For example, objects can be clustered either by color, weight, or size, revealing different interpretations of the given dataset. A variety of…

Machine Learning · Computer Science 2025-04-08 Collin Leiber , Dominik Mautz , Claudia Plant , Christian Böhm

Data mining is the task of discovering interesting patterns from large amounts of data. There are many data mining tasks, such as classification, clustering, association rule mining, and sequential pattern mining. Sequential pattern mining…

Databases · Computer Science 2010-02-08 Mahdi Esmaeili , Fazekas Gabor
‹ Prev 1 2 3 10 Next ›