Related papers: Summarization Techniques for Pattern Collections i…

Graph Summarization Methods and Applications: A Survey

While advances in computing resources have made processing enormous amounts of data possible, human ability to identify patterns in such data has not scaled accordingly. Efficient computational methods for condensing and simplifying data…

Information Retrieval · Computer Science 2020-04-03 Yike Liu , Tara Safavi , Abhilash Dighe , Danai Koutra

A Model for Managing Collections of Patterns

Data mining algorithms are now able to efficiently deal with huge amount of data. Various kinds of patterns may be discovered and may have some great impact on the general development of knowledge. In many domains, end users may want to…

Artificial Intelligence · Computer Science 2009-02-09 Baptiste Jeudy , Christine Largeron , François Jacquenet

Graph Summarization

The continuous and rapid growth of highly interconnected datasets, which are both voluminous and complex, calls for the development of adequate processing and analytical techniques. One method for condensing and simplifying such datasets is…

Databases · Computer Science 2020-05-13 Angela Bonifati , Stefania Dumbrava , Haridimos Kondylakis

Pattern-Based Classification: A Unifying Perspective

The use of patterns in predictive models is a topic that has received a lot of attention in recent years. Pattern mining can help to obtain models for structured domains, such as graphs and sequences, and has been proposed as a means to…

Artificial Intelligence · Computer Science 2011-11-29 Björn Bringmann , Siegfried Nijssen , Albrecht Zimmermann

Finding Sequential Patterns from Large Sequence Data

Data mining is the task of discovering interesting patterns from large amounts of data. There are many data mining tasks, such as classification, clustering, association rule mining, and sequential pattern mining. Sequential pattern mining…

Databases · Computer Science 2010-02-08 Mahdi Esmaeili , Fazekas Gabor

Efficiently Summarising Event Sequences with Rich Interleaving Patterns

Discovering the key structure of a database is one of the main goals of data mining. In pattern set mining we do so by discovering a small set of patterns that together describe the data well. The richer the class of patterns we consider,…

Artificial Intelligence · Computer Science 2017-08-11 Apratim Bhattacharyya , Jilles Vreeken

Abstract Representations and Frequent Pattern Discovery

We discuss the frequent pattern mining problem in a general setting. From an analysis of abstract representations, summarization and frequent pattern mining, we arrive at a generalization of the problem. Then, we show how the problem can be…

Artificial Intelligence · Computer Science 2012-02-13 Eray Ozkural

Multi-source Data Mining for e-Learning

Data mining is the task of discovering interesting, unexpected or valuable structures in large datasets and transforming them into an understandable structure for further use . Different approaches in the domain of data mining have been…

Databases · Computer Science 2020-09-21 Julie Bu Daher , Armelle Brun , Anne Boyer

The Long and the Short of It: Summarising Event Sequences with Serial Episodes

An ideal outcome of pattern mining is a small set of informative patterns, containing no redundancy or noise, that identifies the key structure of the data at hand. Standard frequent pattern miners do not achieve this goal, as due to the…

Data Structures and Algorithms · Computer Science 2019-02-11 Nikolaj Tatti , Jilles Vreeken

Data Partitioning View of Mining Big Data

There are two main approximations of mining big data in memory. One is to partition a big dataset to several subsets, so as to mine each subset in memory. By this way, global patterns can be obtained by synthesizing all local patterns…

Databases · Computer Science 2016-11-30 Shichao Zhang

A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and Condensation

Many real-world datasets can be naturally represented as graphs, spanning a wide range of domains. However, the increasing complexity and size of graph datasets present significant challenges for analysis and computation. In response, graph…

Social and Information Networks · Computer Science 2024-07-02 Mohammad Hashemi , Shengbo Gong , Juntong Ni , Wenqi Fan , B. Aditya Prakash , Wei Jin

Data Stream Clustering: Challenges and Issues

Very large databases are required to store massive amounts of data that are continuously inserted and queried. Analyzing huge data sets and extracting valuable pattern in many applications are interesting for researchers. We can identify…

Databases · Computer Science 2010-06-29 Madjid Khalilian , Norwati Mustapha

Data Distillation: A Survey

The popularity of deep learning has led to the curation of a vast number of massive and multifarious datasets. Despite having close-to-human performance on individual tasks, training parameter-hungry models on large datasets poses…

Machine Learning · Computer Science 2023-09-27 Noveen Sachdeva , Julian McAuley

Graph-based Ontology Summarization: A Survey

Ontologies have been widely used in numerous and varied applications, e.g., to support data modeling, information integration, and knowledge management. With the increasing size of ontologies, ontology understanding, which is playing an…

Information Retrieval · Computer Science 2018-05-17 Seyedamin Pouriyeh , Mehdi Allahyari , Qingxia Liu , Gong Cheng , Hamid Reza Arabnia , Yuzhong Qu , Krys Kochut

Hierarchical Clustering for Finding Symmetries and Other Patterns in Massive, High Dimensional Datasets

Data analysis and data mining are concerned with unsupervised pattern finding and structure determination in data sets. "Structure" can be understood as symmetry and a range of symmetries are expressed by hierarchy. Such symmetries directly…

Machine Learning · Statistics 2015-03-17 Fionn Murtagh , Pedro Contreras

A Supervised Approach to Extractive Summarisation of Scientific Papers

Automatic summarisation is a popular approach to reduce a document to its main arguments. Recent research in the area has focused on neural approaches to summarisation, which can be very data-hungry. However, few large datasets exist and…

Computation and Language · Computer Science 2017-06-14 Ed Collins , Isabelle Augenstein , Sebastian Riedel

Towards Personalized and Human-in-the-Loop Document Summarization

The ubiquitous availability of computing devices and the widespread use of the internet have generated a large amount of data continuously. Therefore, the amount of available information on any given topic is far beyond humans' processing…

Artificial Intelligence · Computer Science 2023-07-11 Samira Ghodratnama

Machine Learning of Generic and User-Focused Summarization

A key problem in text summarization is finding a salience function which determines what information in the source should be included in the summary. This paper describes the use of machine learning on a training corpus of documents and…

Computation and Language · Computer Science 2007-05-23 Inderjeet Mani , Eric Bloedorn

Clustering For Point Pattern Data

Clustering is one of the most common unsupervised learning tasks in machine learning and data mining. Clustering algorithms have been used in a plethora of applications across several scientific fields. However, there has been limited…

Machine Learning · Computer Science 2017-02-09 Quang N. Tran , Ba-Ngu Vo , Dinh Phung , Ba-Tuong Vo

Time-to-Pattern: Information-Theoretic Unsupervised Learning for Scalable Time Series Summarization

Data summarization is the process of generating interpretable and representative subsets from a dataset. Existing time series summarization approaches often search for recurring subsequences using a set of manually devised similarity…

Machine Learning · Computer Science 2023-08-29 Alireza Ghods , Trong Nghia Hoang , Diane Cook