English
Related papers

Related papers: Fair and Diverse DPP-based Data Summarization

200 papers

Determinantal Point Processes (DPPs) are probabilistic models that arise in quantum physics and random matrix theory and have recently found numerous applications in computer science. DPPs define distributions over subsets of a given ground…

Data Structures and Algorithms · Computer Science 2017-04-25 L. Elisa Celis , Amit Deshpande , Tarun Kathuria , Damian Straszak , Nisheeth K. Vishnoi

Subsampling from a large data set is useful in many supervised learning contexts to provide a global view of the data based on only a fraction of the observations. Diverse (or space-filling) subsampling is an appealing subsampling approach…

Methodology · Statistics 2023-11-27 Boyang Shang , Daniel W. Apley , Sanjay Mehrotra

Fairness in multi-document summarization of user-generated content remains a critical challenge in natural language processing (NLP). Existing summarization methods often fail to ensure equitable representation across different social…

Computation and Language · Computer Science 2025-03-19 Sina Bagheri Nezhad , Sayan Bandyapadhyay , Ameeta Agrawal

Due to the recent cases of algorithmic bias in data-driven decision-making, machine learning methods are being put under the microscope in order to understand the root cause of these biases and how to correct them. Here, we consider a basic…

Machine Learning · Computer Science 2016-10-25 L. Elisa Celis , Amit Deshpande , Tarun Kathuria , Nisheeth K. Vishnoi

Submodular maximization has become established as the method of choice for the task of selecting representative and diverse summaries of data. However, if datapoints have sensitive attributes such as gender or age, such machine learning…

Machine Learning · Computer Science 2020-10-20 Marwa El Halabi , Slobodan Mitrović , Ashkan Norouzi-Fard , Jakab Tardos , Jakub Tarnawski

We propose a novel diverse feature selection method based on determinantal point processes (DPPs). Our model enables one to flexibly define diversity based on the covariance of features (similar to orthogonal matching pursuit) or…

Machine Learning · Computer Science 2014-11-25 Nematollah Kayhan Batmanghelich , Gerald Quon , Alex Kulesza , Manolis Kellis , Polina Golland , Luke Bornn

The most important obstacles facing multi-document summarization include excessive redundancy in source descriptions and the looming shortage of training data. These obstacles prevent encoder-decoder models from being used directly, but…

Computation and Language · Computer Science 2019-06-04 Sangwoo Cho , Logan Lebanoff , Hassan Foroosh , Fei Liu

Determinantal Point Processes (DPPs) are elegant probabilistic models of repulsion and diversity over discrete sets of items. But their applicability to large sets is hindered by expensive cubic-complexity matrix operations for basic tasks…

Machine Learning · Computer Science 2016-05-31 Chengtao Li , Stefanie Jegelka , Suvrit Sra

In some practical learning tasks, such as traffic video analysis, the number of available training samples is restricted by different factors, such as limited communication bandwidth and computation power. Determinantal Point Process (DPP)…

Machine Learning · Computer Science 2023-08-17 Xiwen Chen , Huayu Li , Rahul Amin , Abolfazl Razi

Determinantal point processes (DPPs), which arise in random matrix theory and quantum physics, are natural models for subset selection problems where diversity is preferred. Among many remarkable properties, DPPs offer tractable algorithms…

Machine Learning · Computer Science 2012-02-20 Alex Kulesza , Ben Taskar

Determinantal point processes (DPPs) are well known models for diverse subset selection problems, including recommendation tasks, document summarization and image search. In this paper, we discuss a greedy deterministic adaptation of k-DPP.…

Machine Learning · Computer Science 2021-05-31 Joachim Schreurs , Michaël Fanuel , Johan A. K. Suykens

Dynamic data selection accelerates training by sampling a changing subset of the dataset while preserving accuracy. We rethink two core notions underlying sample evaluation: representativeness and diversity. Instead of local geometric…

Artificial Intelligence · Computer Science 2026-03-06 Yuzhe Zhou , Zhenglin Hua , Haiyun Guo , Yuheng Jia

Subset selection problems ask for a small, diverse yet representative subset of the given data. When pairwise similarities are captured by a kernel, the determinants of submatrices provide a measure of diversity or independence of items…

Data Structures and Algorithms · Computer Science 2016-07-07 Tarun Kathuria , Amit Deshpande

People from different social and demographic groups express diverse perspectives and conflicting opinions on a broad set of topics such as product reviews, healthcare, law, and politics. A fair summary should provide a comprehensive…

Computation and Language · Computer Science 2024-04-02 Yusen Zhang , Nan Zhang , Yixin Liu , Alexander Fabbri , Junru Liu , Ryo Kamoi , Xiaoxin Lu , Caiming Xiong , Jieyu Zhao , Dragomir Radev , Kathleen McKeown , Rui Zhang

It is now much easier than ever before to produce videos. While the ubiquitous video data is a great source for information discovery and extraction, the computational challenges are unparalleled. Automatically summarizing the videos has…

Machine Learning · Computer Science 2018-10-26 Aidean Sharghi , Ali Borji , Chengtao Li , Tianbao Yang , Boqing Gong

The goal of diversity sampling is to select a representative subset of data in a way that maximizes information contained in the subset while keeping its cardinality small. We introduce the ordered diverse sampling problem based on a new…

Computation and Language · Computer Science 2025-03-17 Ashish Tiwari , Mukul Singh , Ananya Singha , Arjun Radhakrishna

Online feature selection has been an active research area in recent years. We propose a novel diverse online feature selection method based on Determinantal Point Processes (DPP). Our model aims to provide diverse features which can be…

Machine Learning · Statistics 2019-04-26 Chapman Siu , Richard Yi Da Xu

Determinantal point processes (DPPs) are an important concept in random matrix theory and combinatorics. They have also recently attracted interest in the study of numerical methods for machine learning, as they offer an elegant "missing…

Machine Learning · Computer Science 2018-04-18 Philipp Hennig , Roman Garnett

We study core-set construction algorithms for the task of Diversity Maximization under fairness/partition constraint. Given a set of points $P$ in a metric space partitioned into $m$ groups, and given $k_1,\ldots,k_m$, the goal of this…

Data Structures and Algorithms · Computer Science 2023-10-13 Sepideh Mahabadi , Stojan Trajanovski

Condensing large datasets into smaller synthetic counterparts has demonstrated its promise for image classification. However, previous research has overlooked a crucial concern in image recognition: ensuring that models trained on condensed…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Qihang Zhou , Shenhao Fang , Shibo He , Wenchao Meng , Jiming Chen
‹ Prev 1 2 3 10 Next ›