Related papers: Applying the Cluster Method to Count Occurrences o…

A lifting of the Goulden-Jackson cluster method to the Malvenuto-Reutenauer algebra

The Goulden-Jackson cluster method is a powerful tool for counting words by occurrences of prescribed subwords, and was adapted by Elizalde and Noy for counting permutations by occurrences of prescribed consecutive patterns. In this paper,…

Combinatorics · Mathematics 2023-01-12 Yan Zhuang

Counting occurrences of patterns in permutations

We develop a new, powerful method for counting elements in a multiset. As a first application, we use this algorithm to study the number of occurrences of patterns in a permutation. For patterns of length 3 there are two Wilf classes, and…

Combinatorics · Mathematics 2024-03-05 Andrew R Conway , Anthony J Guttmann

Block patterns in generalized Euler Permutations

Goulden and Jackson introduced a very powerful method to study the distributions of certain consecutive patterns in permutations, words, and other combinatorial objects which is now called the cluster method. There are a number of natural…

Combinatorics · Mathematics 2017-06-06 Ran Pan , Jeffrey Brian Remmel

Determining the Number of Clusters via Iterative Consensus Clustering

We use a cluster ensemble to determine the number of clusters, k, in a group of data. A consensus similarity matrix is formed from the ensemble using multiple algorithms and several values for k. A random walk is induced on the graph…

Machine Learning · Statistics 2014-08-06 Shaina Race , Carl Meyer , Kevin Valakuzhy

Counting occurrences of a pattern of type (1,2) or (2,1) in permutations

Babson and Steingr\'{\i}msson introduced generalized permutation patterns that allow the requirement that two adjacent letters in a pattern must be adjacent in the permutation. Claesson presented a complete solution for the number of…

Combinatorics · Mathematics 2007-05-23 Anders Claesson , Toufik Mansour

Statistics on clusters and $r$-Stirling permutations

The Goulden$\unicode{x2013}$Jackson cluster method, adapted to permutations by Elizalde and Noy, reduces the problem of counting permutations by occurrences of a prescribed consecutive pattern to that of counting clusters, which are special…

Combinatorics · Mathematics 2023-05-19 Sergi Elizalde , Justin M. Troyka , Yan Zhuang

Clusters, generating functions and asymptotics for consecutive patterns in permutations

We use the cluster method to enumerate permutations avoiding consecutive patterns. We reprove and generalize in a unified way several known results and obtain new ones, including some patterns of length 4 and 5, as well as some infinite…

Combinatorics · Mathematics 2012-10-24 Sergi Elizalde , Marc Noy

Describing West-3-stack-sortable permutations with permutation patterns

We describe a new method for finding patterns in permutations that produce a given pattern after the permutation has been passed once through a stack. We use this method to describe West-3-stack-sortable permutations, that is, permutations…

Combinatorics · Mathematics 2012-03-13 Henning Úlfarsson

Clustering For Point Pattern Data

Clustering is one of the most common unsupervised learning tasks in machine learning and data mining. Clustering algorithms have been used in a plethora of applications across several scientific fields. However, there has been limited…

Machine Learning · Computer Science 2017-02-09 Quang N. Tran , Ba-Ngu Vo , Dinh Phung , Ba-Tuong Vo

Generalized Negative Binomial Processes and the Representation of Cluster Structures

The paper introduces the concept of a cluster structure to define a joint distribution of the sample size and its exchangeable random partitions. The cluster structure allows the probability distribution of the random partitions of a subset…

Methodology · Statistics 2013-10-08 Mingyuan Zhou

Clustering Co-occurrence of Maximal Frequent Patterns in Streams

One way of getting a better view of data is using frequent patterns. In this paper frequent patterns are subsets that occur a minimal number of times in a stream of itemsets. However, the discovery of frequent patterns in streams has always…

Artificial Intelligence · Computer Science 2007-05-23 Edgar H. de Graaf , Joost N. Kok , Walter A. Kosters

Clustering -- Basic concepts and methods

We review clustering as an analysis tool and the underlying concepts from an introductory perspective. What is clustering and how can clusterings be realised programmatically? How can data be represented and prepared for a clustering task?…

Machine Learning · Computer Science 2022-12-05 Jan-Oliver Felix Kapp-Joswig , Bettina G. Keller

Model-based Clustering

Mixture models extend the toolbox of clustering methods available to the data analyst. They allow for an explicit definition of the cluster shapes and structure within a probabilistic framework and exploit estimation and inference…

Methodology · Statistics 2025-09-15 Bettina Grün

Permutation tests under a rotating sampling plan with clustered data

Consider a population consisting of clusters of sampling units, evolving temporally, spatially, or according to other dynamics. We wish to monitor the evolution of its means, medians, or other parameters. For administrative convenience and…

Methodology · Statistics 2020-04-30 Jiahua Chen , Yukun Liu , James Zidek

Estimating the number of clusters using cross-validation

Many clustering methods, including k-means, require the user to specify the number of clusters as an input parameter. A variety of methods have been devised to choose the number of clusters automatically, but they often rely on strong…

Methodology · Statistics 2017-02-10 Wei Fu , Patrick O. Perry

Refining enumeration schemes to count according to permutation statistics

We consider the question of computing the distribution of a permutation statistics over restricted permutations via enumeration schemes. The restricted permutations are those avoiding sets of vincular patterns (which include both classical…

Combinatorics · Mathematics 2014-01-03 Andrew M. Baxter

A Flexible Iterative Framework for Consensus Clustering

A novel framework for consensus clustering is presented which has the ability to determine both the number of clusters and a final solution using multiple algorithms. A consensus similarity matrix is formed from an ensemble using multiple…

Machine Learning · Statistics 2014-08-06 Shaina Race , Carl Meyer

Model based clustering of multinomial count data

We consider the problem of inferring an unknown number of clusters in replicated multinomial data. Under a model based clustering point of view, this task can be treated by estimating finite mixtures of multinomial distributions with or…

Methodology · Statistics 2023-07-07 Panagiotis Papastamoulis

Estimating the Mean Number of K-Means Clusters to Form

Utilizing the sample size of a dataset, the random cluster model is employed in order to derive an estimate of the mean number of K-Means clusters to form during classification of a dataset.

Machine Learning · Computer Science 2016-02-12 Robert A. Murphy

Clustering Mixed Numeric and Categorical Data: A Cluster Ensemble Approach

Clustering is a widely used technique in data mining applications for discovering patterns in underlying data. Most traditional clustering algorithms are limited to handling datasets that contain either numeric or categorical attributes.…

Artificial Intelligence · Computer Science 2007-05-23 Zengyou He , Xiaofei Xu , Shengchun Deng