Related papers: Generalizations of the Goulden-Jackson Cluster Met…

The Goulden-Jackson Cluster Method: Extensions, Applications and Implementations

The powerful (and so far under-utilized) Goulden-Jackson Cluster method for finding the generating function for the number of words avoiding, as factors, the members of a prescribed set of `dirty words', is tutorialized and extended in…

Combinatorics · Mathematics 2007-05-23 John Noonan , Doron Zeilberger

A generalized Goulden-Jackson cluster method and lattice path enumeration

The Goulden-Jackson cluster method is a powerful tool for obtaining generating functions for counting words in a free monoid by occurrences of a set of subwords. We introduce a generalization of the cluster method for monoid networks, which…

Combinatorics · Mathematics 2018-02-20 Yan Zhuang

Words restricted by patterns with at most 2 distinct letters

We find generating functions for the number of words avoiding certain patterns or sets of patterns on at most 2 distinct letters and determine which of them are equally avoided. We also find the exact number of words avoiding certain…

Combinatorics · Mathematics 2007-05-23 Alexander Burstein , Toufik Mansour

A lifting of the Goulden-Jackson cluster method to the Malvenuto-Reutenauer algebra

The Goulden-Jackson cluster method is a powerful tool for counting words by occurrences of prescribed subwords, and was adapted by Elizalde and Noy for counting permutations by occurrences of prescribed consecutive patterns. In this paper,…

Combinatorics · Mathematics 2023-01-12 Yan Zhuang

Words restricted by 3-letter generalized multipermutation patterns

We find exact formulas and/or generating functions for the number of words avoiding 3-letter generalized multipermutation patterns and find which of them are equally avoided.

Combinatorics · Mathematics 2007-05-23 Alexander Burstein , Toufik Mansour

Generating functions for generalized binomial distributions

In a recent article a generalization of the binomial distribution associated with a sequence of positive numbers was examined. The analysis of the nonnegativeness of the formal expressions was a key-point to allow to give them a statistical…

Mathematical Physics · Physics 2015-06-04 H. Bergeron , E. M. F. Curado , J. P. Gazeau , Ligia M. C. S. Rodrigues

Counting occurrences of some subword patterns

We find generating functions the number of strings (words) containing a specified number of occurrences of certain types of order-isomorphic classes of substrings called subword patterns. In particular, we find generating functions for the…

Combinatorics · Mathematics 2007-05-23 A. Burstein , T. Mansour

Counting words with Laguerre series

We develop a method for counting words subject to various restrictions by finding a combinatorial interpretation for a product of weighted sums of Laguerre polynomials with parameter \alpha = -1. We describe how such a series can be…

Combinatorics · Mathematics 2013-06-27 Jair Taylor

Applying the Cluster Method to Count Occurrences of Generalized Permutation Patterns

We apply ideas from the cluster method to q-count the permutations of a multiset according to the number of occurrences of certain generalized patterns, as defined by Babson and Steingrimsson. In particular, we consider those patterns with…

Combinatorics · Mathematics 2009-06-01 Andrew M. Baxter

Generating function for Naturalized Series: The case of Ordered Motzkin Words

We continue to consider the ordered lexicographic sequence, which is constructed according to the formal characteristics of a series of natural numbers. For analysis, we selected balanced parentheses with zeros, Motzkin words. As you know,…

Combinatorics · Mathematics 2020-02-20 Gennady Eremin

Distributional Clustering of English Words

We describe and experimentally evaluate a method for automatically clustering words according to their distribution in particular syntactic contexts. Deterministic annealing is used to find lowest distortion sets of clusters. As the…

cmp-lg · Computer Science 2008-02-03 Fernando Pereira , Naftali Tishby , Lillian Lee

Generalized Negative Binomial Processes and the Representation of Cluster Structures

The paper introduces the concept of a cluster structure to define a joint distribution of the sample size and its exchangeable random partitions. The cluster structure allows the probability distribution of the random partitions of a subset…

Methodology · Statistics 2013-10-08 Mingyuan Zhou

Information-Theoretic Generative Clustering of Documents

We present {\em generative clustering} (GC) for clustering a set of documents, $\mathrm{X}$, by using texts $\mathrm{Y}$ generated by large language models (LLMs) instead of by clustering the original documents $\mathrm{X}$. Because LLMs…

Machine Learning · Computer Science 2024-12-19 Xin Du , Kumiko Tanaka-Ishii

restricted 1-3-2 permutations and generalized patterns

Recently, Babson and Steingrimsson (see [BS]) introduced generalized permutations patterns that allow the requirement that two adjacent letters in a pattern must be adjacent in the permutation. We study generating functions for the number…

Combinatorics · Mathematics 2007-05-23 T. Mansour

Generating trees for permutations avoiding generalized patterns

We construct generating trees with one, two, and three labels for some classes of permutations avoiding generalized patterns of length 3 and 4. These trees are built by adding at each level an entry to the right end of the permutation,…

Combinatorics · Mathematics 2007-08-01 Sergi Elizalde

Hierarchical Latent Word Clustering

This paper presents a new Bayesian non-parametric model by extending the usage of Hierarchical Dirichlet Allocation to extract tree structured word clusters from text data. The inference algorithm of the model collects words in a cluster if…

Computation and Language · Computer Science 2016-01-22 Halid Ziya Yerebakan , Fitsum Reda , Yiqiang Zhan , Yoshihisa Shinagawa

Automation Strategies for Unconstrained Crossword Puzzle Generation

An unconstrained crossword puzzle is a generalization of the constrained crossword problem. In this problem, only the word vocabulary, and optionally the grid dimensions are known. Hence, it not only requires the algorithm to determine the…

Artificial Intelligence · Computer Science 2020-07-10 Charu Agarwal , Rushikesh K. Joshi

Block patterns in generalized Euler Permutations

Goulden and Jackson introduced a very powerful method to study the distributions of certain consecutive patterns in permutations, words, and other combinatorial objects which is now called the cluster method. There are a number of natural…

Combinatorics · Mathematics 2017-06-06 Ran Pan , Jeffrey Brian Remmel

Bubble-Flip -- A New Generation Algorithm for Prefix Normal Words

We present a new recursive generation algorithm for prefix normal words. These are binary strings with the property that no substring has more 1s than the prefix of the same length. The new algorithm uses two operations on binary strings,…

Data Structures and Algorithms · Computer Science 2024-04-16 Ferdinando Cicalese , Zsuzsanna Lipták , Massimiliano Rossi

Brute force searching, the typical set and Guesswork

Consider the situation where a word is chosen probabilistically from a finite list. If an attacker knows the list and can inquire about each word in turn, then selecting the word via the uniform distribution maximizes the attacker's…

Information Theory · Computer Science 2013-05-14 Mark M. Christiansen , Ken R. Duffy , Flavio du Pin Calmon , Muriel Medard