English
Related papers

Related papers: apricot: Submodular selection for data summarizati…

200 papers

Submodular functions are a special class of set functions which naturally model the notion of representativeness, diversity, coverage etc. and have been shown to be computationally very efficient. A lot of past work has applied submodular…

Machine Learning · Computer Science 2022-02-24 Vishal Kaushal , Ganesh Ramakrishnan , Rishabh Iyer

Submodular functions allow to model many real-world optimisation problems. This paper introduces approaches for computing diverse sets of high quality solutions for submodular optimisation problems. We first present diversifying greedy…

Artificial Intelligence · Computer Science 2020-10-23 Aneta Neumann , Jakob Bossek , Frank Neumann

We propose subsampling as a unified algorithmic technique for submodular maximization in centralized and online settings. The idea is simple: independently sample elements from the ground set, and use simple combinatorial techniques (such…

Data Structures and Algorithms · Computer Science 2021-04-08 Christopher Harshaw , Ehsan Kazemi , Moran Feldman , Amin Karbasi

Modern datasets span billions of samples, making training on all available data infeasible. Selecting a high quality subset helps in reducing training costs and enhancing model quality. Submodularity, a discrete analogue of convexity, is…

Machine Learning · Computer Science 2025-04-04 Maximilian Böther , Abraham Sebastian , Pranjal Awasthi , Ana Klimovic , Srikumar Ramalingam

Submodularity is a key property in discrete optimization. Submodularity has been widely used for analyzing the greedy algorithm to give performance bounds and providing insight into the construction of valid inequalities for mixed-integer…

Optimization and Control · Mathematics 2022-05-24 Temitayo Ajayi , Taewoo Lee , Andrew Schaefer

In today's information systems, the availability of massive amounts of data necessitates the development of fast and accurate algorithms to summarize these data and represent them in a succinct format. One crucial problem in big data…

Data Structures and Algorithms · Computer Science 2013-12-27 Ahmed K. Farahat , Ahmed Elgohary , Ali Ghodsi , Mohamed S. Kamel

The problem of column subset selection has recently attracted a large body of research, with feature selection serving as one obvious and important application. Among the techniques that have been applied to solve this problem, the greedy…

Data Structures and Algorithms · Computer Science 2021-11-16 Jason Altschuler , Aditya Bhaskara , Gang Fu , Vahab Mirrokni , Afshin Rostamizadeh , Morteza Zadimoghaddam

Greedy algorithms are widely used for problems in machine learning such as feature selection and set function optimization. Unfortunately, for large datasets, the running time of even greedy algorithms can be quite high. This is because for…

Machine Learning · Statistics 2017-03-09 Rajiv Khanna , Ethan Elenberg , Alexandros G. Dimakis , Sahand Negahban , Joydeep Ghosh

This paper addresses the problem of sequential submodular maximization: selecting and ranking items in a sequence to optimize some composite submodular function. In contrast to most of the previous works, which assume access to the utility…

Machine Learning · Computer Science 2024-09-10 Jing Yuan , Shaojie Tang

Subset selection is an interesting and important topic in the field of evolutionary multi-objective optimization (EMO). Especially, in an EMO algorithm with an unbounded external archive, subset selection is an essential post-processing…

Neural and Evolutionary Computing · Computer Science 2021-09-23 Weiyu Chen , Hisao Ishibuchi , Ke Shang

Subset selection is a popular topic in recent years and a number of subset selection methods have been proposed. Among those methods, hypervolume subset selection is widely used. Greedy hypervolume subset selection algorithms can achieve…

Neural and Evolutionary Computing · Computer Science 2020-07-07 Weiyu Chen , Hisao Ishibuhci , Ke Shang

Automatic prompt optimization reduces manual prompt engineering, but relies on task performance measured on a small, often randomly sampled evaluation subset as its main source of feedback signal. Despite this, how to select that evaluation…

Computation and Language · Computer Science 2026-01-08 Jinming Nian , Zhiyuan Peng , Hongwei Shang , Dae Hoon Park , Yi Fang

Submodularity is an important concept in integer and combinatorial optimization. A classical submodular set function models the utility of selecting homogenous items from a single ground set, and such selections can be represented by binary…

Optimization and Control · Mathematics 2023-04-06 Simge Küçükyavuz , Qimeng Yu

We present pomegranate, an open source machine learning package for probabilistic modeling in Python. Probabilistic modeling encompasses a wide range of methods that explicitly describe uncertainty using probability distributions. Three…

Artificial Intelligence · Computer Science 2018-03-01 Jacob Schreiber

In the classical selection problem, the input consists of a collection of elements and the goal is to pick a subset of elements from the collection such that some objective function $f$ is maximized. This problem has been studied…

Data Structures and Algorithms · Computer Science 2021-09-06 Sofia Maria Nikolakaki , Alina Ene , Evimaria Terzi

We introduce a new library named abess that implements a unified framework of best-subset selection for solving diverse machine learning problems, e.g., linear regression, classification, and principal component analysis. Particularly, the…

Machine Learning · Statistics 2024-04-02 Jin Zhu , Xueqin Wang , Liyuan Hu , Junhao Huang , Kangkang Jiang , Yanhang Zhang , Shiyun Lin , Junxian Zhu

Large-scale subset selection asks for a small useful set of examples, features, sensors, seed users, or context passages from an enormous ground set. Submodular maximization is a canonical model for such diminishing-returns problems, but…

Data Structures and Algorithms · Computer Science 2026-05-07 Alan Kuhnle

In this paper, we propose a new framework for designing fast parallel algorithms for fundamental statistical subset selection tasks that include feature selection and experimental design. Such tasks are known to be weakly submodular and are…

Machine Learning · Computer Science 2021-04-02 Sharon Qian , Yaron Singer

Subset selection, which aims to select a subset from a ground set to maximize some objective function, arises in various applications such as influence maximization and sensor placement. In real-world scenarios, however, one often needs to…

Neural and Evolutionary Computing · Computer Science 2022-05-10 Chao Bian , Yawen Zhou , Chao Qian

In this manuscript, we offer a gentle review of submodularity and supermodularity and their properties. We offer a plethora of submodular definitions; a full description of a number of example submodular functions and their generalizations;…

Machine Learning · Computer Science 2022-10-05 Jeff Bilmes
‹ Prev 1 2 3 10 Next ›