Related papers: Efficient solutions for weight-balanced partitioni…

A Polynomial Algorithm for Balanced Clustering via Graph Partitioning

The objective of clustering is to discover natural groups in datasets and to identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics of the data. The problem can be seen as…

Computational Geometry · Computer Science 2018-01-26 Luis-Evaristo Caraballo , José-Miguel Díaz-Báñez , Nadine Kroher

On resolution coresets for constrained clustering

Specific data compression techniques, formalized by the concept of coresets, proved to be powerful for many optimization problems. In fact, while tightly controlling the approximation error, coresets may lead to significant speed up of the…

Optimization and Control · Mathematics 2022-04-05 Maximilian Fiedler , Peter Gritzmann , Fabian Klemm

Weighted total variation based convex clustering

Data clustering is a fundamental problem with a wide range of applications. Standard methods, eg the $k$-means method, usually require solving a non-convex optimization problem. Recently, total variation based convex relaxation to the…

Optimization and Control · Mathematics 2018-08-29 Guodong Xu , Yu Xia , Hui Ji

Approximate Minimum Tree Cover in All Symmetric Monotone Norms Simultaneously

We study the problem of partitioning a set of $n$ objects in a metric space into $k$ clusters $V_1,\dots,V_k$. The quality of the clustering is measured by considering the vector of cluster costs and then minimizing some monotone symmetric…

Data Structures and Algorithms · Computer Science 2025-01-10 Matthias Kaul , Kelin Luo , Matthias Mnich , Heiko Röglin

Biconvex Clustering

Convex clustering has recently garnered increasing interest due to its attractive theoretical and computational properties, but its merits become limited in the face of high-dimensional data. In such settings, pairwise affinity terms that…

Methodology · Statistics 2021-04-02 Saptarshi Chakraborty , Jason Xu

Optimal Variable Clustering for High-Dimensional Matrix Valued Data

Matrix valued data has become increasingly prevalent in many applications. Most of the existing clustering methods for this type of data are tailored to the mean model and do not account for the dependence structure of the features, which…

Machine Learning · Statistics 2023-12-07 Inbeom Lee , Siyi Deng , Yang Ning

Convex Matroid Optimization

We consider a problem of optimizing convex functionals over matroid bases. It is richly expressive and captures certain quadratic assignment and clustering problems. While generally NP-hard, we show it is polynomial time solvable when a…

Combinatorics · Mathematics 2018-08-21 Shmuel Onn

Clustered Multi-Task Learning: A Convex Formulation

In multi-task learning several related tasks are considered simultaneously, with the hope that by an appropriate sharing of information across tasks, each task may benefit from the others. In the context of learning linear functions for…

Machine Learning · Computer Science 2008-09-12 Laurent Jacob , Francis Bach , Jean-Philippe Vert

Faster Balanced Clusterings in High Dimension

The problem of constrained clustering has attracted significant attention in the past decades. In this paper, we study the balanced $k$-center, $k$-median, and $k$-means clustering problems where the size of each cluster is constrained by…

Computational Geometry · Computer Science 2018-09-11 Hu Ding

A balanced k-means algorithm for weighted point sets

The classical $k$-means algorithm for partitioning $n$ points in $\mathbb{R}^d$ into $k$ clusters is one of the most popular and widely spread clustering methods. The need to respect prescribed lower bounds on the cluster sizes has been…

Optimization and Control · Mathematics 2016-08-04 Steffen Borgwardt , Andreas Brieden , Peter Gritzmann

Meta-learning of shared linear representations beyond well-specified linear regression

Motivated by multi-task and meta-learning approaches, we consider the problem of learning structure shared by tasks or users, such as shared low-rank representations or clustered structures. While all previous works focus on well-specified…

Machine Learning · Computer Science 2025-02-14 Mathieu Even , Laurent Massoulié

A Fast Algorithm for Clustering High Dimensional Feature Vectors

We propose an algorithm for clustering high dimensional data. If $P$ features for $N$ objects are represented in an $N\times P$ matrix ${\bf X}$, where $N\ll P$, the method is based on exploiting the cluster-dependent structure of the…

Machine Learning · Statistics 2018-11-05 Shahina Rahman , Valen E. Johnson

On Convex Clustering Solutions

Convex clustering is an attractive clustering algorithm with favorable properties such as efficiency and optimality owing to its convex formulation. It is thought to generalize both k-means clustering and agglomerative clustering. However,…

Machine Learning · Statistics 2021-05-19 Canh Hao Nguyen , Hiroshi Mamitsuka

Bicriteria Polygon Aggregation with Arbitrary Shapes

We study the problem of aggregating polygons by covering them with disjoint representative regions, thereby inducing a clustering of the polygons. Our objective is to minimize a weighted sum of the total area and the total perimeter of the…

Computational Geometry · Computer Science 2025-07-17 Lotte Blank , David Eppstein , Jan-Henrik Haunert , Herman Haverkort , Benedikt Kolbe , Philip Mayer , Petra Mutzel , Alexander Naumann , Jonas Sauer

Weighted Clustering Ensemble: A Review

Clustering ensemble, or consensus clustering, has emerged as a powerful tool for improving both the robustness and the stability of results from individual clustering methods. Weighted clustering ensemble arises naturally from clustering…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Mimi Zhang

The Why and How of Convex Clustering

This survey reviews a clustering method based on solving a convex optimization problem. Despite the plethora of existing clustering methods, convex clustering has several uncommon features that distinguish it from prior art. The…

Methodology · Statistics 2025-09-19 Eric C. Chi , Aaron J. Molstad , Zheming Gao , Jocelyn T. Chi

Neural Capacitated Clustering

Recent work on deep clustering has found new promising methods also for constrained clustering problems. Their typically pairwise constraints often can be used to guide the partitioning of the data. Many problems however, feature…

Machine Learning · Computer Science 2023-05-22 Jonas K. Falkner , Lars Schmidt-Thieme

Multiple Partitions Aligned Clustering

Multi-view clustering is an important yet challenging task due to the difficulty of integrating the information from multiple representations. Most existing multi-view clustering methods explore the heterogeneous information in the space…

Machine Learning · Computer Science 2019-09-16 Zhao Kang , Zipeng Guo , Shudong Huang , Siying Wang , Wenyu Chen , Yuanzhang Su , Zenglin Xu

Hard Clusters Maximize Mutual Information

In this paper, we investigate mutual information as a cost function for clustering, and show in which cases hard, i.e., deterministic, clusters are optimal. Using convexity properties of mutual information, we show that certain formulations…

Information Theory · Computer Science 2017-06-13 Bernhard C. Geiger , Rana Ali Amjad

Convex Clustering: Model, Theoretical Guarantee and Efficient Algorithm

Clustering is a fundamental problem in unsupervised learning. Popular methods like K-means, may suffer from poor performance as they are prone to get stuck in its local minima. Recently, the sum-of-norms (SON) model (also known as the…

Machine Learning · Computer Science 2018-10-08 Defeng Sun , Kim-Chuan Toh , Yancheng Yuan