相关论文: Performance Evaluation for Clustering Algorithms i…

Simulation of clustering algorithms in OODBs in order to evaluate their performances

A good object clustering is critical to the performance of object-oriented databases. However, it always involves some kind of overhead for the system. The aim of this paper is to propose a modelling methodology in order to evaluate the…

数据库 · 计算机科学 2017-01-01 Jérôme Darmont , Amar Attoui , Michel Gourgand

A comparison study of object-oriented database clustering techniques

It is widely acknowledged that a good object clustering is critical to the performance of OODBs. Clustering means storing related objects close together on secondary storage so that when one object is accessed from disk, all its related…

数据库 · 计算机科学 2017-01-01 Jérôme Darmont , Le Gruenwald

OCB: A Generic Benchmark to Evaluate the Performances of Object-Oriented Database Systems

We present in this paper a generic object-oriented benchmark (the Object Clustering Benchmark) that has been designed to evaluate the performances of clustering policies in object-oriented databases. OCB is generic because its sample…

数据库 · 计算机科学 2007-05-23 Jérôme Darmont , Bertrand Petit , Michel Schneider

Performance Comparison for Scientific Computations on the Edge via Relative Performance

In a typical Internet-of-Things setting that involves scientific applications, a target computation can be evaluated in many different ways depending on the split of computations among various devices. On the one hand, different…

性能 · 计算机科学 2022-08-09 Aravind Sankaran , Paolo Bientinesi

An Analytical Approach to Document Clustering Based on Internal Criterion Function

Fast and high quality document clustering is an important task in organizing information, search engine results obtaining from user query, enhancing web crawling and information retrieval. With the large amount of data available and with a…

信息检索 · 计算机科学 2010-03-11 Alok Ranjan , Harish Verma , Eatesh Kandpal , Joydip Dhar

Estimating the Optimal Number of Clusters in Categorical Data Clustering by Silhouette Coefficient

The problem of estimating the number of clusters (say k) is one of the major challenges for the partitional clustering. This paper proposes an algorithm named k-SCC to estimate the optimal k in categorical data clustering. For the…

机器学习 · 计算机科学 2025-01-28 Duy-Tai Dinh , Tsutomu Fujinami , Van-Nam Huynh

Performance Evaluation of Incremental K-means Clustering Algorithm

The incremental K-means clustering algorithm has already been proposed and analysed in paper [Chakraborty and Nagwani, 2011]. It is a very innovative approach which is applicable in periodically incremental environment and dealing with a…

信息检索 · 计算机科学 2014-06-19 Sanjay Chakraborty , N. K. Nagwani

Benchmarking OODBs with a Generic Tool

We present in this paper a generic object-oriented benchmark (OCB: the Object Clustering Benchmark) that has been designed to evaluate the performances of Object-Oriented Data-bases (OODBs), and more specifically the performances of…

数据库 · 计算机科学 2016-11-29 Jérôme Darmont , Michel Schneider

Comparison of Clustering Algorithms for Statistical Features of Vibration Data Sets

Vibration-based condition monitoring systems are receiving increasing attention due to their ability to accurately identify different conditions by capturing dynamic features over a broad frequency range. However, there is little research…

机器学习 · 计算机科学 2023-05-12 Philipp Sepin , Jana Kemnitz , Safoura Rezapour Lakani , Daniel Schall

Improved Performance of Unsupervised Method by Renovated K-Means

Clustering is a separation of data into groups of similar objects. Every group called cluster consists of objects that are similar to one another and dissimilar to objects of other groups. In this paper, the K-Means algorithm is implemented…

机器学习 · 计算机科学 2013-04-03 P. Ashok , G. M Kadhar Nawaz , E. Elayaraja , V. Vadivel

CAS Condensed and Accelerated Silhouette: An Efficient Method for Determining the Optimal K in K-Means Clustering

Clustering is a critical component of decision-making in todays data-driven environments. It has been widely used in a variety of fields such as bioinformatics, social network analysis, and image processing. However, clustering accuracy…

机器学习 · 计算机科学 2025-07-14 Krishnendu Das , Sumit Gupta , Awadhesh Kumar

Efficient techniques for mining spatial databases

Clustering is one of the major tasks in data mining. In the last few years, Clustering of spatial data has received a lot of research attention. Spatial databases are components of many advanced information systems like geographic…

数据库 · 计算机科学 2012-06-04 Mohamed A. El-Zawawy

Rethinking Recommender Systems: Cluster-based Algorithm Selection

Cluster-based algorithm selection deals with selecting recommendation algorithms on clusters of users to obtain performance gains. No studies have been attempted for many combinations of clustering approaches and recommendation algorithms.…

信息检索 · 计算机科学 2024-05-29 Andreas Lizenberger , Ferdinand Pfeifer , Bastian Polewka

Dynamic Clustering in Object-Oriented Databases: An Advocacy for Simplicity

We present in this paper three dynamic clustering techniques for Object-Oriented Databases (OODBs). The first two, Dynamic, Statistical & Tunable Clustering (DSTC) and StatClust, exploit both comprehensive usage statistics and the…

数据库 · 计算机科学 2007-05-23 Jérôme Darmont , Christophe Fromantin , Stéphane Régnier , Le Gruenwald , Michel Schneider

POCS-based Clustering Algorithm

A novel clustering technique based on the projection onto convex set (POCS) method, called POCS-based clustering algorithm, is proposed in this paper. The proposed POCS-based clustering algorithm exploits a parallel projection method of…

机器学习 · 计算机科学 2023-03-24 Le-Anh Tran , Henock M. Deberneh , Truong-Dong Do , Thanh-Dat Nguyen , My-Ha Le , Dong-Chul Park

Point-Set Kernel Clustering

Measuring similarity between two objects is the core operation in existing clustering algorithms in grouping similar objects into clusters. This paper introduces a new similarity measure called point-set kernel which computes the similarity…

机器学习 · 计算机科学 2022-01-07 Kai Ming Ting , Jonathan R. Wells , Ye Zhu

An Efficient $k$-modes Algorithm for Clustering Categorical Datasets

Mining clusters from data is an important endeavor in many applications. The $k$-means method is a popular, efficient, and distribution-free approach for clustering numerical-valued data, but does not apply for categorical-valued…

统计方法学 · 统计学 2021-08-24 Karin S. Dorman , Ranjan Maitra

Comparative Analysis of Optimization Strategies for K-means Clustering in Big Data Contexts: A Review

This paper presents a comparative analysis of different optimization techniques for the K-means algorithm in the context of big data. K-means is a widely used clustering algorithm, but it can suffer from scalability issues when dealing with…

机器学习 · 计算机科学 2024-05-21 Ravil Mussabayev , Rustam Mussabayev

A sampling-based approach for efficient clustering in large datasets

We propose a simple and efficient clustering method for high-dimensional data with a large number of clusters. Our algorithm achieves high-performance by evaluating distances of datapoints with a subset of the cluster centres. Our…

机器学习 · 计算机科学 2022-03-30 Georgios Exarchakis , Omar Oubari , Gregor Lenz

HMS-OS: Improving the Human Mental Search Optimisation Algorithm by Grouping in both Search and Objective Space

The human mental search (HMS) algorithm is a relatively recent population-based metaheuristic algorithm, which has shown competitive performance in solving complex optimisation problems. It is based on three main operators: mental search,…

神经与进化计算 · 计算机科学 2021-12-06 Seyed Jalaleddin Mousavirad , Gerald Schaefer , Iakov Korovin , Diego Oliva , Mahshid Helali Moghadam , Mehrdad Saadatmand