English
Related papers

Related papers: Stream Clustering using Probabilistic Data Structu…

200 papers

The non-stationary nature of data streams strongly challenges traditional machine learning techniques. Although some solutions have been proposed to extend traditional machine learning techniques for handling data streams, these approaches…

Machine Learning · Computer Science 2021-06-23 Xuyang Yan , Abdollah Homaifar , Mrinmoy Sarkar , Abenezer Girma , Edward Tunstel

The data stream model has been defined for new classes of applications involving massive data being generated at a fast pace. Web click stream analysis and detection of network intrusions are two examples. Cluster analysis on data streams…

Databases · Computer Science 2007-05-23 Zengyou He , Xiaofei Xu , Shengchun Deng , Joshua Zhexue Huang

Number of connected devices is steadily increasing and these devices continuously generate data streams. Real-time processing of data streams is arousing interest despite many challenges. Clustering is one of the most suitable methods for…

Machine Learning · Computer Science 2020-07-22 Alaettin Zubaroğlu , Volkan Atalay

With the dawn of the Big Data era, data sets are growing rapidly. Data is streaming from everywhere - from cameras, mobile phones, cars, and other electronic devices. Clustering streaming data is a very challenging problem. Unlike the…

Machine Learning · Computer Science 2019-02-08 Shlomo Bugdary , Shay Maymon

We introduce a novel algorithm to perform graph clustering in the edge streaming setting. In this model, the graph is presented as a sequence of edges that can be processed strictly once. Our streaming algorithm has an extremely low memory…

Machine Learning · Computer Science 2017-12-13 Alexandre Hollocou , Julien Maudet , Thomas Bonald , Marc Lelarge

In recent years, data streaming has gained prominence due to advances in technologies that enable many applications to generate continuous flows of data. This increases the need to develop algorithms that are able to efficiently process…

Data Structures and Algorithms · Computer Science 2015-03-20 Vaneet Aggarwal , Shankar Krishnan

In this paper, we consider sparse networks consisting of a finite number of non-overlapping communities, i.e. disjoint clusters, so that there is higher density within clusters than across clusters. Both the intra- and inter-cluster edge…

Social and Information Networks · Computer Science 2014-11-06 Se-Young Yun , Marc Lelarge , Alexandre Proutiere

We propose the DPSM method, a density-based node clustering approach that automatically determines the number of clusters and can be applied in both data space and graph space. Unlike traditional density-based clustering methods, which…

Machine Learning · Computer Science 2024-11-05 Feiping Nie , Yitao Song , Jingjing Xue , Rong Wang , Xuelong Li

This article explores and analyzes the unsupervised clustering of large partially observed graphs. We propose a scalable and provable randomized framework for clustering graphs generated from the stochastic block model. The clustering is…

Social and Information Networks · Computer Science 2022-12-06 Mostafa Rahmani , Andre Beckus , Adel Karimian , George Atia

Spectral clustering has been one of the widely used methods for community detection in networks. However, large-scale networks bring computational challenges to the eigenvalue decomposition therein. In this paper, we study the spectral…

Social and Information Networks · Computer Science 2022-01-07 Hai Zhang , Xiao Guo , Xiangyu Chang

In data stream clustering, systematic theory of stream clustering algorithms remains relatively scarce. Recently, density-based methods have gained attention. However, existing algorithms struggle to simultaneously handle arbitrarily…

Machine Learning · Computer Science 2026-05-07 Qifen Zeng , Haomin Bao , Yuanzhuo Hu , Zirui Zhang , Yuheng Zheng , Luosheng Wen

One way of getting a better view of data is using frequent patterns. In this paper frequent patterns are subsets that occur a minimal number of times in a stream of itemsets. However, the discovery of frequent patterns in streams has always…

Artificial Intelligence · Computer Science 2007-05-23 Edgar H. de Graaf , Joost N. Kok , Walter A. Kosters

The analysis of data streams has received considerable attention over the past few decades due to sensors, social media, etc. It aims to recognize patterns in an unordered, infinite, and evolving stream of observations. Clustering this type…

Machine Learning · Computer Science 2022-01-14 Mohammed Oualid Attaoui , Hanene Azzag , Mustapha Lebbah , Nabil Keskes

The problem of analyzing data streams of very large volumes is important and is very desirable for many application domains. In this paper we present and demonstrate effective working of an algorithm to find clusters and anomalous data…

Machine Learning · Computer Science 2025-03-25 Aniket Bhanderi , Raj Bhatnagar

Cut-based directed graph (digraph) clustering often focuses on finding dense within-cluster or sparse between-cluster connections, similar to cut-based undirected graph clustering methods. In contrast, for flow-based clusterings the edges…

Machine Learning · Computer Science 2022-03-04 Koby Hayashi , Sinan G. Aksoy , Haesun Park

Stream clustering is a fundamental problem in many streaming data analysis applications. Comparing to classical batch-mode clustering, there are two key challenges in stream clustering: (i) Given that input data are changing continuously,…

Databases · Computer Science 2017-10-04 Shufeng Gong , Yanfeng Zhang , Ge Yu

In data stream applications, one of the critical issues is to estimate the frequency of each item in the specific multiset. The multiset means that each item in this set can appear multiple times. The data streams in many applications are…

Data Structures and Algorithms · Computer Science 2020-01-07 Ning Li

The amount of data coming from different sources such as IoT-sensors, social networks, cellular networks, has increased exponentially during the last few years. Probabilistic Data Structures (PDS) are efficient alternatives to deterministic…

Data Structures and Algorithms · Computer Science 2022-11-02 Remy Scholler , Jean-Francois Couchot , Oumaima Alaoui-Ismaili , Denis Renaud , Eric Ballot

Practical tools for clustering streaming data must be fast enough to handle the arrival rate of the observations. Typically, they also must adapt on the fly to possible lack of stationarity; i.e., the data statistics may be time-dependent…

Machine Learning · Computer Science 2022-03-01 Or Dinari , Oren Freifeld

We introduce Density sketches (DS): a succinct online summary of the data distribution. DS can accurately estimate point wise probability density. Interestingly, DS also provides a capability to sample unseen novel data from the underlying…

Data Structures and Algorithms · Computer Science 2021-02-25 Aditya Desai , Benjamin Coleman , Anshumali Shrivastava
‹ Prev 1 2 3 10 Next ›