English
Related papers

Related papers: Maximizing diversity over clustered data

200 papers

Result diversification is an important aspect in web-based search, document summarization, facility location, portfolio management and other applications. Given a set of ranked results for a set of objects (e.g. web documents, facilities,…

Data Structures and Algorithms · Computer Science 2016-11-28 Allan Borodin , Aadhar Jain , Hyun Chul Lee , Yuli Ye

Diversity maximization is a fundamental problem in web search and data mining. For a given dataset $S$ of $n$ elements, the problem requires to determine a subset of $S$ containing $k\ll n$ "representatives" which minimize some diversity…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-11 Matteo Ceccarello , Andrea Pietracaprina , Geppino Pucci

Diversity maximization problem is a well-studied problem where the goal is to find $k$ diverse items. Fair diversity maximization aims to select a diverse subset of $k$ items from a large dataset, while requiring that each group of items be…

Data Structures and Algorithms · Computer Science 2025-06-11 Florian Adriaens , Nikolaj Tatti

Diversity maximization aims to select a diverse and representative subset of items from a large dataset. It is a fundamental optimization task that finds applications in data summarization, feature selection, web search, recommender…

Data Structures and Algorithms · Computer Science 2023-04-27 Yanhao Wang , Michael Mathioudakis , Jia Li , Francesco Fabbri

Finding diverse solutions to optimization problems has been of practical interest for several decades, and recently enjoyed increasing attention in research. While submodular optimization has been rigorously studied in many fields, its…

Data Structures and Algorithms · Computer Science 2023-07-18 Anh Viet Do , Mingyu Guo , Aneta Neumann , Frank Neumann

Diversity is an important principle in data selection and summarization, facility location, and recommendation systems. Our work focuses on maximizing diversity in data selection, while offering fairness guarantees. In particular, we offer…

Data Structures and Algorithms · Computer Science 2020-10-20 Zafeiria Moumoulidou , Andrew McGregor , Alexandra Meliou

Maximizing a submodular function has a wide range of applications in machine learning and data mining. One such application is data summarization whose goal is to select a small set of representative and diverse data items from a large…

Machine Learning · Computer Science 2023-03-10 Jing Yuan , Shaojie Tang

We introduce a novel criterion in clustering that seeks clusters with limited range of values associated with each cluster's elements. In clustering or classification the objective is to partition a set of objects into subsets, called…

Data Structures and Algorithms · Computer Science 2018-05-15 Dorit S. Hochbaum

Streaming submodular maximization is a natural model for the task of selecting a representative subset from a large-scale dataset. If datapoints have sensitive attributes such as gender or race, it becomes important to enforce fairness to…

Machine Learning · Computer Science 2025-11-25 Marwa El Halabi , Federico Fusco , Ashkan Norouzi-Fard , Jakab Tardos , Jakub Tarnawski

Submodular maximization subject to matroid constraints is a central problem with many applications in machine learning. As algorithms are increasingly used in decision-making over datapoints with sensitive attributes such as gender or race,…

Data Structures and Algorithms · Computer Science 2026-01-16 Sepideh Mahabadi , Sherry Sarkar , Jakub Tarnawski

There are many problems in machine learning and data mining which are equivalent to selecting a non-redundant, high "quality" set of objects. Recommender systems, feature selection, and data summarization are among many applications of…

Machine Learning · Computer Science 2019-04-19 Mehrdad Ghadiri , Mark Schmidt

In many applications such as web-based search, document summarization, facility location and other applications, the results are preferable to be both representative and diversified subsets of documents. The goal of this study is to select…

Machine Learning · Computer Science 2015-11-10 Sepehr Abbasi Zadeh , Mehrdad Ghadiri

We present approximation algorithms for some variants of center-based clustering and related problems in the fully dynamic setting, where the pointset evolves through an arbitrary sequence of insertions and deletions. Specifically, we…

Data Structures and Algorithms · Computer Science 2023-09-06 Paolo Pellizzoni , Andrea Pietracaprina , Geppino Pucci

Submodular function optimization has numerous applications in machine learning and data analysis, including data summarization which aims to identify a concise and diverse set of data points from a large dataset. It is important to…

Data Structures and Algorithms · Computer Science 2023-04-11 Shaojie Tang , Jing Yuan , Twumasi Mensah-Boateng

We consider the {\em clustering with diversity} problem: given a set of colored points in a metric space, partition them into clusters such that each cluster has at least $\ell$ points, all of which have distinct colors. We give a…

Data Structures and Algorithms · Computer Science 2010-04-22 Jian Li , Ke Yi , Qin Zhang

A wide variety of problems in machine learning, including exemplar clustering, document summarization, and sensor placement, can be cast as constrained submodular maximization problems. A lot of recent effort has been devoted to developing…

Data Structures and Algorithms · Computer Science 2016-08-15 Rafael da Ponte Barbosa , Alina Ene , Huy L. Nguyen , Justin Ward

Finding a \emph{single} best solution is the most common objective in combinatorial optimization problems. However, such a single solution may not be applicable to real-world problems as objective functions and constraints are only…

Data Structures and Algorithms · Computer Science 2022-01-25 Tesshu Hanaka , Masashi Kiyomi , Yasuaki Kobayashi , Yusuke Kobayashi , Kazuhiro Kurita , Yota Otachi

In this work, we study the problem of monotone non-submodular maximization with partition matroid constraint. Although a generalization of this problem has been studied in literature, our work focuses on leveraging properties of partition…

Data Structures and Algorithms · Computer Science 2022-05-02 Lan N. Nguyen , My T. Thai

In this paper, we study the classic submodular maximization problem subject to a group equality constraint under both non-adaptive and adaptive settings. It has been shown that the utility function of many machine learning applications,…

Machine Learning · Computer Science 2023-08-30 Shaojie Tang , Jing Yuan

In this work, we study diversity-aware clustering problems where the data points are associated with multiple attributes resulting in intersecting groups. A clustering solution needs to ensure that the number of chosen cluster centers from…

Data Structures and Algorithms · Computer Science 2025-05-21 Suhas Thejaswi , Ameet Gadekar , Bruno Ordozgoiti , Aristides Gionis
‹ Prev 1 2 3 10 Next ›