Related papers: Approximation Algorithms for Clustering with Dynam…

Fair Clustering for Data Summarization: Improved Approximation Algorithms and Complexity Insights

Data summarization tasks are often modeled as $k$-clustering problems, where the goal is to choose $k$ data points, called cluster centers, that best represent the dataset by minimizing a clustering objective. A popular objective is to…

Machine Learning · Computer Science 2024-10-18 Ameet Gadekar , Aristides Gionis , Suhas Thejaswi

A Constant Approximation Algorithm for Sequential Random-Order No-Substitution k-Median Clustering

We study k-median clustering under the sequential no-substitution setting. In this setting, a data stream is sequentially observed, and some of the points are selected by the algorithm as cluster centers. However, a point can be selected as…

Machine Learning · Computer Science 2022-04-14 Tom Hess , Michal Moshkovitz , Sivan Sabato

Faster Balanced Clusterings in High Dimension

The problem of constrained clustering has attracted significant attention in the past decades. In this paper, we study the balanced $k$-center, $k$-median, and $k$-means clustering problems where the size of each cluster is constrained by…

Computational Geometry · Computer Science 2018-09-11 Hu Ding

On Clustering with Discounts

We study the $k$-median with discounts problem, wherein we are given clients with non-negative discounts and seek to open at most $k$ facilities. The goal is to minimize the sum of distances from each client to its nearest open facility…

Data Structures and Algorithms · Computer Science 2021-11-19 Shichuan Deng

Dynamic Consistent $k$-Center Clustering with Optimal Recourse

Given points from an arbitrary metric space and a sequence of point updates sent by an adversary, what is the minimum recourse per update (i.e., the minimum number of changes needed to the set of centers after an update), in order to…

Data Structures and Algorithms · Computer Science 2025-06-04 Sebastian Forster , Antonis Skarlatos

Approximation Algorithms for K-Modes Clustering

In this paper, we study clustering with respect to the k-modes objective function, a natural formulation of clustering for categorical data. One of the main contributions of this paper is to establish the connection between k-modes and…

Artificial Intelligence · Computer Science 2007-05-23 Zengyou He

Fully Dynamic Consistent $k$-Center Clustering

We study the consistent k-center clustering problem. In this problem, the goal is to maintain a constant factor approximate $k$-center solution during a sequence of $n$ point insertions and deletions while minimizing the recourse, i.e., the…

Data Structures and Algorithms · Computer Science 2023-07-27 Jakub Łącki , Bernhard Haeupler , Christoph Grunau , Václav Rozhoň , Rajesh Jayaram

Fully Dynamic k-Center Clustering in Doubling Metrics

Clustering is one of the most fundamental problems in unsupervised learning with a large number of applications. However, classical clustering algorithms assume that the data is static, thus failing to capture many real-world applications…

Data Structures and Algorithms · Computer Science 2020-02-11 Gramoz Goranci , Monika Henzinger , Dariusz Leniowski , Christian Schulz , Alexander Svozil

Balanced $k$-Center Clustering When $k$ Is A Constant

The problem of constrained $k$-center clustering has attracted significant attention in the past decades. In this paper, we study balanced $k$-center cluster where the size of each cluster is constrained by the given lower and upper bounds.…

Computational Geometry · Computer Science 2017-04-11 Hu Ding

Fully Dynamic $k$-Clustering with Fast Update Time and Small Recourse

In the dynamic metric $k$-median problem, we wish to maintain a set of $k$ centers $S \subseteq V$ in an input metric space $(V, d)$ that gets updated via point insertions/deletions, so as to minimize the objective $\sum_{x \in V} \min_{y…

Data Structures and Algorithms · Computer Science 2024-08-05 Sayan Bhattacharya , Martín Costa , Naveen Garg , Silvio Lattanzi , Nikos Parotsidis

Near-Optimal Clustering in the $k$-machine model

The clustering problem, in its many variants, has numerous applications in operations research and computer science (e.g., in applications in bioinformatics, image processing, social network analysis, etc.). As sizes of data sets have grown…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-10-24 Sayan Bandyapadhyay , Tanmay Inamdar , Shreyas Pai , Sriram V. Pemmaraju

Fully Dynamic $k$-Median with Near-Optimal Update Time and Recourse

In metric $k$-clustering, we are given as input a set of $n$ points in a general metric space, and we have to pick $k$ centers and cluster the input points around these chosen centers, so as to minimize an appropriate objective function. In…

Data Structures and Algorithms · Computer Science 2024-11-06 Sayan Bhattacharya , Martín Costa , Ermiya Farokhnejad

Fair k-Center Clustering for Data Summarization

In data summarization we want to choose $k$ prototypes in order to summarize a data set. We study a setting where the data set comprises several demographic groups and we are restricted to choose $k_i$ prototypes belonging to group $i$. A…

Machine Learning · Statistics 2019-05-14 Matthäus Kleindessner , Pranjal Awasthi , Jamie Morgenstern

Consistent k-Clustering for General Metrics

Given a stream of points in a metric space, is it possible to maintain a constant approximate clustering by changing the cluster centers only a small number of times during the entire execution of the algorithm? This question received…

Data Structures and Algorithms · Computer Science 2020-11-16 Hendrik Fichtenberger , Silvio Lattanzi , Ashkan Norouzi-Fard , Ola Svensson

Approximation Algorithm for Constrained $k$-Center Clustering: A Local Search Approach

Clustering is a long-standing research problem and a fundamental tool in AI and data analysis. The traditional k-center problem, a fundamental theoretical challenge in clustering, has a best possible approximation ratio of 2, and any…

Machine Learning · Computer Science 2026-04-28 Chaoqi Jia , Longkun Guo , Kewen Liao , Zhigang Lu , Chao Chen , Jason Xue

On Approximability of Clustering Problems Without Candidate Centers

The k-means objective is arguably the most widely-used cost function for modeling clustering tasks in a metric space. In practice and historically, k-means is thought of in a continuous setting, namely where the centers can be located…

Computational Complexity · Computer Science 2020-10-08 Vincent Cohen-Addad , Karthik C. S. , Euiwoong Lee

Dynamic k-center clustering with lifetimes

The $k$-center problem is a fundamental clustering variant with applications in learning systems and data summarization. In several real-world scenarios, the dataset to be clustered is not static, but evolves over time, as new data points…

Data Structures and Algorithms · Computer Science 2026-03-25 Simone Moretti , Paolo Pellizzoni , Andrea Pietracaprina , Geppino Pucci

Fast Approximate $K$-Means via Cluster Closures

$K$-means, a simple and effective clustering algorithm, is one of the most widely used algorithms in multimedia and computer vision community. Traditional $k$-means is an iterative algorithm---in each iteration new cluster centers are…

Computer Vision and Pattern Recognition · Computer Science 2013-12-12 Jingdong Wang , Jing Wang , Qifa Ke , Gang Zeng , Shipeng Li

Clustering with fair-center representation: parameterized approximation algorithms and heuristics

We study a variant of classical clustering formulations in the context of algorithmic fairness, known as diversity-aware clustering. In this variant we are given a collection of facility subsets, and a solution must contain at least a…

Data Structures and Algorithms · Computer Science 2022-10-25 Suhas Thejaswi , Ameet Gadekar , Bruno Ordozgoiti , Michal Osadnik

Near-Optimal Algorithms for Constrained k-Center Clustering with Instance-level Background Knowledge

Center-based clustering has attracted significant research interest from both theory and practice. In many practical applications, input data often contain background knowledge that can be used to improve clustering results. In this work,…

Machine Learning · Computer Science 2025-06-13 Longkun Guo , Chaoqi Jia , Kewen Liao , Zhigang Lu , Minhui Xue