Related papers: Approximation algorithms for stochastic clustering

Clustering under Local Stability: Bridging the Gap between Worst-Case and Beyond Worst-Case Analysis

Recently, there has been substantial interest in clustering research that takes a beyond worst-case approach to the analysis of algorithms. The typical idea is to design a clustering algorithm that outputs a near-optimal solution, provided…

Data Structures and Algorithms · Computer Science 2018-12-31 Maria-Florina Balcan , Colin White

Local algorithms for interactive clustering

We study the design of interactive clustering algorithms for data sets satisfying natural stability assumptions. Our algorithms start with any initial clustering and only make local changes in each step; both are desirable features in many…

Data Structures and Algorithms · Computer Science 2015-03-23 Pranjal Awasthi , Maria-Florina Balcan , Konstantin Voevodski

A Short Survey on Data Clustering Algorithms

With rapidly increasing data, clustering algorithms are important tools for data analytics in modern research. They have been successfully applied to a wide range of domains; for instance, bioinformatics, speech recognition, and financial…

Data Structures and Algorithms · Computer Science 2015-12-01 Ka-Chun Wong

A sampling-based approach for efficient clustering in large datasets

We propose a simple and efficient clustering method for high-dimensional data with a large number of clusters. Our algorithm achieves high-performance by evaluating distances of datapoints with a subset of the cluster centres. Our…

Machine Learning · Computer Science 2022-03-30 Georgios Exarchakis , Omar Oubari , Gregor Lenz

A Rapid Review of Clustering Algorithms

Clustering algorithms aim to organize data into groups or clusters based on the inherent patterns and similarities within the data. They play an important role in today's life, such as in marketing and e-commerce, healthcare, data…

Machine Learning · Computer Science 2024-01-17 Hui Yin , Amir Aryani , Stephen Petrie , Aishwarya Nambissan , Aland Astudillo , Shengyuan Cao

Rethinking Recommender Systems: Cluster-based Algorithm Selection

Cluster-based algorithm selection deals with selecting recommendation algorithms on clusters of users to obtain performance gains. No studies have been attempted for many combinations of clustering approaches and recommendation algorithms.…

Information Retrieval · Computer Science 2024-05-29 Andreas Lizenberger , Ferdinand Pfeifer , Bastian Polewka

Clustering with fair-center representation: parameterized approximation algorithms and heuristics

We study a variant of classical clustering formulations in the context of algorithmic fairness, known as diversity-aware clustering. In this variant we are given a collection of facility subsets, and a solution must contain at least a…

Data Structures and Algorithms · Computer Science 2022-10-25 Suhas Thejaswi , Ameet Gadekar , Bruno Ordozgoiti , Michal Osadnik

Fair Hierarchical Clustering

As machine learning has become more prevalent, researchers have begun to recognize the necessity of ensuring machine learning systems are fair. Recently, there has been an interest in defining a notion of fairness that mitigates…

Data Structures and Algorithms · Computer Science 2020-06-22 Sara Ahmadian , Alessandro Epasto , Marina Knittel , Ravi Kumar , Mohammad Mahdian , Benjamin Moseley , Philip Pham , Sergei Vassilvitskii , Yuyan Wang

Fair Labeled Clustering

Numerous algorithms have been produced for the fundamental problem of clustering under many different notions of fairness. Perhaps the most common family of notions currently studied is group fairness, in which proportional group…

Machine Learning · Computer Science 2023-06-06 Seyed A. Esmaeili , Sharmila Duppala , John P. Dickerson , Brian Brubach

Clustering is difficult only when it does not matter

Numerous papers ask how difficult it is to cluster data. We suggest that the more relevant and interesting question is how difficult it is to cluster data sets {\em that can be clustered well}. More generally, despite the ubiquity and the…

Machine Learning · Computer Science 2012-05-23 Amit Daniely , Nati Linial , Michael Saks

Improved Approximation for Fair Correlation Clustering

Correlation clustering is a ubiquitous paradigm in unsupervised machine learning where addressing unfairness is a major challenge. Motivated by this, we study Fair Correlation Clustering where the data points may belong to different…

Machine Learning · Computer Science 2022-06-13 Sara Ahmadian , Maryam Negahbani

Hierarchical Clustering with Structural Constraints

Hierarchical clustering is a popular unsupervised data analysis method. For many real-world applications, we would like to exploit prior information about the data that imposes constraints on the clustering hierarchy, and is not captured by…

Data Structures and Algorithms · Computer Science 2018-07-17 Vaggos Chatziafratis , Rad Niazadeh , Moses Charikar

Discriminating Equivalent Algorithms via Relative Performance

In scientific computing, it is common that a mathematical expression can be computed by many different algorithms (sometimes over hundreds), each identifying a specific sequence of library calls. Although mathematically equivalent, those…

Performance · Computer Science 2021-09-15 Aravind Sankaran , Paolo Bientinesi

Natural Hierarchical Cluster Analysis by Nearest Neighbors with Near-Linear Time Complexity

We propose a nearest neighbor based clustering algorithm that results in a naturally defined hierarchy of clusters. In contrast to the agglomerative and divisive hierarchical clustering algorithms, our approach is not dependent on the…

Data Structures and Algorithms · Computer Science 2022-03-16 Kaan Gokcesu , Hakan Gokcesu

New Algorithms And Fast Implementations To Approximate Stochastic Processes

We present new algorithms and fast implementations to find efficient approximations for modelling stochastic processes. For many numerical computations it is essential to develop finite approximations for stochastic processes. While the…

Optimization and Control · Mathematics 2020-12-03 Kipngeno Benard Kirui , Georg Ch. Pflug , Alois Pichler

Stochastic Optimization Algorithms

When looking for a solution, deterministic methods have the enormous advantage that they do find global optima. Unfortunately, they are very CPU-intensive, and are useless on untractable NP-hard problems that would require thousands of…

Neural and Evolutionary Computing · Computer Science 2011-12-20 Pierre Collet , Jean-Philippe Rennard

A Clustering Approach to Solving Large Stochastic Matching Problems

In this work we focus on efficient heuristics for solving a class of stochastic planning problems that arise in a variety of business, investment, and industrial applications. The problem is best described in terms of future buy and sell…

Artificial Intelligence · Computer Science 2013-01-14 Milos Hauskrecht , Eli Upfal

Stochastic optimization with decision-dependent distributions

Stochastic optimization problems often involve data distributions that change in reaction to the decision variables. This is the case for example when members of the population respond to a deployed classifier by manipulating their features…

Optimization and Control · Mathematics 2020-12-15 Dmitriy Drusvyatskiy , Lin Xiao

Consensus clustering in complex networks

The community structure of complex networks reveals both their organization and hidden relationships among their constituents. Most community detection methods currently available are not deterministic, and their results typically depend on…

Physics and Society · Physics 2012-03-29 Andrea Lancichinetti , Santo Fortunato

Clustering Algorithms: A Comparative Approach

Many real-world systems can be studied in terms of pattern recognition tasks, so that proper use (and understanding) of machine learning methods in practical applications becomes essential. While a myriad of classification methods have been…

Machine Learning · Computer Science 2016-12-28 Mayra Z. Rodriguez , Cesar H. Comin , Dalcimar Casanova , Odemir M. Bruno , Diego R. Amancio , Francisco A. Rodrigues , Luciano da F. Costa