English
Related papers

Related papers: The structure of verbal sequences analyzed with un…

200 papers

Data mining is the task of discovering interesting patterns from large amounts of data. There are many data mining tasks, such as classification, clustering, association rule mining, and sequential pattern mining. Sequential pattern mining…

Databases · Computer Science 2010-02-08 Mahdi Esmaeili , Fazekas Gabor

When looking at the structure of natural language, "phrases" and "words" are central notions. We consider the problem of identifying such "meaningful subparts" of language of any length and underlying composition principles in a completely…

Computation and Language · Computer Science 2016-02-19 Stefan Gerdjikov , Klaus U. Schulz

Summarizing event sequences is a key aspect of data mining. Most existing methods neglect conditional dependencies and focus on discovering sequential patterns only. In this paper, we study the problem of discovering both conditional and…

Artificial Intelligence · Computer Science 2025-05-12 Aleena Siji , Joscha Cüppers , Osman Ali Mian , Jilles Vreeken

Data of sequential nature arise in many application domains in forms of, e.g. textual data, DNA sequences, and software execution traces. Different research disciplines have developed methods to learn sequence models from such datasets: (i)…

Machine Learning · Statistics 2018-11-02 Niek Tax , Irene Teinemaa , Sebastiaan J. van Zelst

In this paper we introduce a method to detect words or phrases in a given sequence of alphabets without knowing the lexicon. Our linear time unsupervised algorithm relies entirely on statistical relationships among alphabets in the input…

Computation and Language · Computer Science 2013-12-31 Tamal Chowdhury , Rabindra Rakshit , Arko Banerjee

In this paper we introduce a method for visually analyzing contextualized embeddings produced by deep neural network-based language models. Our approach is inspired by linguistic probes for natural language processing, where tasks are…

Human-Computer Interaction · Computer Science 2020-09-08 Matthew Berger

Nowadays data sets are available in very complex and heterogeneous ways. Mining of such data collections is essential to support many real-world applications ranging from healthcare to marketing. In this work, we focus on the analysis of…

Artificial Intelligence · Computer Science 2015-04-10 Aleksey Buzmakov , Elias Egho , Nicolas Jay , Sergei O. Kuznetsov , Amedeo Napoli , Chedy Raïssi

Unsupervised machine learning, and in particular data clustering, is a powerful approach for the analysis of datasets and identification of characteristic features occurring throughout a dataset. It is gaining popularity across scientific…

Mesoscale and Nanoscale Physics · Physics 2021-03-23 Maria El Abbassi , Jan Overbeck , Oliver Braun , Michel Calame , Herre S. J. van der Zant , Mickael L. Perrin

Language segmentation consists in finding the boundaries where one language ends and another language begins in a text written in more than one language. This is important for all natural language processing tasks. The problem can be solved…

Computation and Language · Computer Science 2015-10-07 David Alfter

We describe a technique to learn the underlying structure of collider events directly from the data, without having a particular theoretical model in mind. It allows to infer aspects of the theoretical model that may have given rise to this…

High Energy Physics - Phenomenology · Physics 2020-12-02 Barry M. Dillon , Darius A. Faroughy , Jernej F. Kamenik , Manuel Szewc

Inducing a meaningful structural representation from one or a set of dialogues is a crucial but challenging task in computational linguistics. Advancement made in this area is critical for dialogue system design and discourse analysis. It…

Computation and Language · Computer Science 2021-03-15 Liang Qiu , Yizhou Zhao , Weiyan Shi , Yuan Liang , Feng Shi , Tao Yuan , Zhou Yu , Song-Chun Zhu

Sequential modelling entails making sense of sequential data, which naturally occurs in a wide array of domains. One example is systems that interact with users, log user actions and behaviour, and make recommendations of items of potential…

Information Retrieval · Computer Science 2021-09-15 Christian Hansen

Clustering is an unsupervised machine learning methodology where unlabeled elements/objects are grouped together aiming to the construction of well-established clusters that their elements are classified according to their similarity. The…

Machine Learning · Statistics 2023-10-20 Dimitrios Saligkaras , Vasileios E. Papageorgiou

Most state-of-the-art systems today produce morphological analysis based only on orthographic patterns. In contrast, we propose a model for unsupervised morphological analysis that integrates orthographic and semantic views of words. We…

Computation and Language · Computer Science 2015-03-10 Karthik Narasimhan , Regina Barzilay , Tommi Jaakkola

Computing universal distributed representations of sentences is a fundamental task in natural language processing. We propose ConsSent, a simple yet surprisingly powerful unsupervised method to learn such representations by enforcing…

Computation and Language · Computer Science 2019-01-25 Siddhartha Brahma

We analyze here a particular kind of linguistic network where vertices representwords and edges stand for syntactic relationships between words. The statisticalproperties of these networks have been recently studied and various features…

Statistical Mechanics · Physics 2007-05-23 Ramon Ferrer i Cancho , Andrea Capocci , Guido Caldarelli

We present an unsupervised word segmentation model, in which the learning objective is to maximize the generation probability of a sentence given its all possible segmentation. Such generation probability can be factorized into the…

Computation and Language · Computer Science 2021-03-03 Lihao Wang , Zongyi Li , Xiaoqing Zheng

The concept of symbolic sequences play important role in study of complex systems. In the work we are interested in ultrametric structure of the set of cyclic sequences naturally arising in theory of dynamical systems. Aimed at construction…

Mathematical Physics · Physics 2016-06-22 Vladimir Al. Osipov

Discovering the semantics of multimodal utterances is essential for understanding human language and enhancing human-machine interactions. Existing methods manifest limitations in leveraging nonverbal information for discerning complex…

Multimedia · Computer Science 2024-05-22 Hanlei Zhang , Hua Xu , Fei Long , Xin Wang , Kai Gao

Unsupervised extractive document summarization aims to select important sentences from a document without using labeled summaries during training. Existing methods are mostly graph-based with sentences as nodes and edge weights measured by…

Computation and Language · Computer Science 2021-12-14 Shusheng Xu , Xingxing Zhang , Yi Wu , Furu Wei , Ming Zhou
‹ Prev 1 2 3 10 Next ›