English
Related papers

Related papers: Fast and Memory-Efficient Significant Pattern Mini…

200 papers

Itemset mining has been an active area of research due to its successful application in various data mining scenarios including finding association rules. Though most of the past work has been on finding frequent itemsets, infrequent…

Databases · Computer Science 2012-07-23 Ashish Gupta , Akshay Mittal , Arnab Bhattacharya

Test statistics are often strongly dependent in large-scale multiple testing applications. Most corrections for multiplicity are unduly conservative for correlated test statistics, resulting in a loss of power to detect true positives. We…

Statistics Theory · Mathematics 2012-03-20 Nicolai Meinshausen , Marloes H. Maathuis , Peter Bühlmann

Frequent Subgraph Mining (FSM) is the key task in many graph mining and machine learning applications. Numerous systems have been proposed for FSM in the past decade. Although these systems show good performance for small patterns (with no…

Databases · Computer Science 2021-02-09 Peng Jiang , Rujia Wang , Bo Wu

The discovery of new and interesting patterns in large datasets, known as data mining, draws more and more interest as the quantities of available data are exploding. Data mining techniques may be applied to different domains and fields…

Software Engineering · Computer Science 2012-09-17 Mehdi Adda , Lei Wu , Sharon White , Yi Feng

Learning of interpretable classification models has been attracting much attention for the last few years. Discovery of succinct and contrasting patterns that can highlight the differences between the two classes is very important. Such…

Databases · Computer Science 2020-04-20 Hiroaki Iwashita , Takuya Takagi , Hirofumi Suzuki , Keisuke Goto , Kotaro Ohori , Hiroki Arimura

It is conventionally believed that a permutation test should ideally use all permutations. If this is computationally unaffordable, it is believed one should use the largest affordable Monte Carlo sample or (algebraic) subgroup of…

Statistics Theory · Mathematics 2023-11-27 Nick W. Koning

Frequent Pattern Mining is a one field of the most significant topics in data mining. In recent years, many algorithms have been proposed for mining frequent itemsets. A new algorithm has been presented for mining frequent itemsets based on…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-05-23 Arkan A. G. Al-Hamodi , Songfeng Lu

Maximal frequent patterns superset checking plays an important role in the efficient mining of complete Maximal Frequent Itemsets (MFI) and maximal search space pruning. In this paper we present a new indexing approach, FastLMFI for local…

Databases · Computer Science 2016-11-17 Shariq Bashir , Abdul Rauf Baig

We develop a new, powerful method for counting elements in a multiset. As a first application, we use this algorithm to study the number of occurrences of patterns in a permutation. For patterns of length 3 there are two Wilf classes, and…

Combinatorics · Mathematics 2024-03-05 Andrew R Conway , Anthony J Guttmann

Efficient discovery of frequent itemsets in large datasets is a crucial task of data mining. In recent years, several approaches have been proposed for generating high utility patterns, they arise the problems of producing a large number of…

Databases · Computer Science 2012-12-04 B. Adinarayana Reddy , O. Srinivasa Rao , M. H. M. Krishna Prasad

In this paper we introduce a new type of pattern -- a flipping correlation pattern. The flipping patterns are obtained from contrasting the correlations between items at different levels of abstraction. They represent surprising…

Databases · Computer Science 2015-03-19 Marina Barsky , Sangkyum Kim , Tim Weninger , Jiawei Han

In this paper we propose the Ray-Patch querying, a novel model to efficiently query transformers to decode implicit representations into target views. Our Ray-Patch decoding reduces the computational footprint and increases inference speed…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 T. Berriel Martins , Javier Civera

Contrast pattern mining (CPM) aims to discover patterns whose support increases significantly from a background dataset compared to a target dataset. CPM is particularly useful for characterising changes in evolving systems, e.g., in…

Networking and Internet Architecture · Computer Science 2020-12-01 Elaheh AlipourChavary , Sarah M. Erfani , Christopher Leckie

With the widespread application of efficient pattern mining algorithms, sequential patterns that allow gap constraints have become a valuable tool to discover knowledge from biological data such as DNA and protein sequences. Among all kinds…

Databases · Computer Science 2023-06-13 Zefeng Chen , Wensheng Gan , Gengsen Huang , Zhenlian Qi , Yan Li , Philip S. Yu

The need to analyze information from streams arises in a variety of applications. One of its fundamental research directions is to mine sequential patterns over data streams. Current studies mine series of items based on the presence of the…

Databases · Computer Science 2022-04-12 Thomas Guyet , Wenbin Zhang , Albert Bifet

Action repetition counting is to estimate the occurrence times of the repetitive motion in one action, which is a relatively new, important but challenging measurement problem. To solve this problem, we propose a new method superior to the…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Jianqin Yin , Yanchun Wu , Huaping Liu , Yonghao Dang , Zhiyi Liu , Jun Liu

Real world datasets are sparse, dirty and contain hundreds of items. In such situations, discovering interesting rules (results) using traditional frequent itemset mining approach by specifying a user defined input support threshold is not…

Databases · Computer Science 2009-04-22 Shariq Bashir , Zahoor Jan , Abdul Rauf Baig

Learning permutations is fundamental to sorting, ranking, and matching, but existing differentiable methods based on entropy-regularized Sinkhorn produce a single softened solution and collapse under ambiguity. We present PermFlow, a…

Machine Learning · Computer Science 2026-05-19 Yimeng Min , Carla P. Gomes

We consider the problem of detecting abrupt changes in the distribution of a multi-dimensional time series, with limited computing power and memory. In this paper, we propose a new, simple method for model-free online change-point detection…

Machine Learning · Computer Science 2020-04-02 Nicolas Keriven , Damien Garreau , Iacopo Poli

Sequence discovery tools play a central role in several fields of computational biology. In the framework of Transcription Factor binding studies, motif finding algorithms of increasingly high performance are required to process the big…

Quantitative Methods · Quantitative Biology 2014-08-27 Nicolò Colombo , Nikos Vlassis
‹ Prev 1 2 3 10 Next ›