Related papers: Fast Parallel Algorithms for Statistical Subset Se…

Scalable Greedy Feature Selection via Weak Submodularity

Greedy algorithms are widely used for problems in machine learning such as feature selection and set function optimization. Unfortunately, for large datasets, the running time of even greedy algorithms can be quite high. This is because for…

Machine Learning · Statistics 2017-03-09 Rajiv Khanna , Ethan Elenberg , Alexandros G. Dimakis , Sahand Negahban , Joydeep Ghosh

Fast Feature Selection with Fairness Constraints

We study the fundamental problem of selecting optimal features for model construction. This problem is computationally challenging on large datasets, even with the use of greedy algorithm variants. To address this challenge, we extend the…

Machine Learning · Computer Science 2023-02-06 Francesco Quinzan , Rajiv Khanna , Moshik Hershcovitch , Sarel Cohen , Daniel G. Waddington , Tobias Friedrich , Michael W. Mahoney

Adaptive Submodularity: Theory and Applications in Active Learning and Stochastic Optimization

Solving stochastic optimization problems under partial observability, where one needs to adaptively make decisions with uncertain outcomes, is a fundamental but notoriously difficult challenge. In this paper, we introduce the concept of…

Machine Learning · Computer Science 2017-12-07 Daniel Golovin , Andreas Krause

An Exponential Speedup in Parallel Running Time for Submodular Maximization without Loss in Approximation

In this paper we study the adaptivity of submodular maximization. Adaptivity quantifies the number of sequential rounds that an algorithm makes when function evaluations can be executed in parallel. Adaptivity is a fundamental concept that…

Data Structures and Algorithms · Computer Science 2018-04-18 Eric Balkanski , Aviad Rubinstein , Yaron Singer

Distributed Submodular Maximization with Parallel Execution

The submodular maximization problem is widely applicable in many engineering problems where objectives exhibit diminishing returns. While this problem is known to be NP-hard for certain subclasses of objective functions, there is a greedy…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-01 Haoyuan Sun , David Grimsman , Jason R Marden

GreedyML: A Parallel Algorithm for Maximizing Constrained Submodular Functions

We describe a parallel approximation algorithm for maximizing monotone submodular functions subject to hereditary constraints on distributed memory multiprocessors. Our work is motivated by the need to solve submodular optimization problems…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-18 Shivaram Gopal , S M Ferdous , Hemanta K. Maji , Alex Pothen

Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection

We study the problem of selecting a subset of k random variables from a large set, in order to obtain the best linear prediction of another variable of interest. This problem can be viewed in the context of both feature selection and sparse…

Machine Learning · Statistics 2011-02-28 Abhimanyu Das , David Kempe

Performance-Complexity Tradeoffs in Greedy Weak Submodular Maximization with Random Sampling

Many problems in signal processing and machine learning can be formalized as weak submodular optimization tasks. For such problems, a simple greedy algorithm (\textsc{Greedy}) is guaranteed to find a solution achieving the objective with a…

Discrete Mathematics · Computer Science 2021-11-24 Abolfazl Hashemi , Haris Vikalo , Gustavo de Veciana

Distributed Maximization of Submodular plus Diversity Functions for Multi-label Feature Selection on Huge Datasets

There are many problems in machine learning and data mining which are equivalent to selecting a non-redundant, high "quality" set of objects. Recommender systems, feature selection, and data summarization are among many applications of…

Machine Learning · Computer Science 2019-04-19 Mehrdad Ghadiri , Mark Schmidt

Parallelizing greedy for submodular set function maximization in matroids and beyond

We consider parallel, or low adaptivity, algorithms for submodular function maximization. This line of work was recently initiated by Balkanski and Singer and has already led to several interesting results on the cardinality constraint and…

Data Structures and Algorithms · Computer Science 2018-12-03 Chandra Chekuri , Kent Quanrud

Greedy Column Subset Selection: New Bounds and Distributed Algorithms

The problem of column subset selection has recently attracted a large body of research, with feature selection serving as one obvious and important application. Among the techniques that have been applied to solve this problem, the greedy…

Data Structures and Algorithms · Computer Science 2021-11-16 Jason Altschuler , Aditya Bhaskara , Gang Fu , Vahab Mirrokni , Afshin Rostamizadeh , Morteza Zadimoghaddam

Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio

We propose a new concept named adaptive submodularity ratio to study the greedy policy for sequential decision making. While the greedy policy is known to perform well for a wide variety of adaptive stochastic optimization problems in…

Machine Learning · Computer Science 2019-04-25 Kaito Fujii , Shinsaku Sakaue

Fast Semidifferential-based Submodular Function Optimization

We present a practical and powerful new framework for both unconstrained and constrained submodular function optimization based on discrete semidifferentials (sub- and super-differentials). The resulting algorithms, which repeatedly compute…

Data Structures and Algorithms · Computer Science 2013-08-13 Rishabh Iyer , Stefanie Jegelka , Jeff Bilmes

Weakly Submodular Maximization Beyond Cardinality Constraints: Does Randomization Help Greedy?

Submodular functions are a broad class of set functions, which naturally arise in diverse areas. Many algorithms have been suggested for the maximization of these functions. Unfortunately, once the function deviates from submodularity, the…

Discrete Mathematics · Computer Science 2017-07-17 Lin Chen , Moran Feldman , Amin Karbasi

Restricted Strong Convexity Implies Weak Submodularity

We connect high-dimensional subset selection and submodular maximization. Our results extend the work of Das and Kempe (2011) from the setting of linear regression to arbitrary objective functions. For greedy feature selection, this…

Machine Learning · Statistics 2017-10-13 Ethan R. Elenberg , Rajiv Khanna , Alexandros G. Dimakis , Sahand Negahban

An Efficient Framework for Balancing Submodularity and Cost

In the classical selection problem, the input consists of a collection of elements and the goal is to pick a subset of elements from the collection such that some objective function $f$ is maximized. This problem has been studied…

Data Structures and Algorithms · Computer Science 2021-09-06 Sofia Maria Nikolakaki , Alina Ene , Evimaria Terzi

On Distributed Larger-Than-Memory Subset Selection With Pairwise Submodular Functions

Modern datasets span billions of samples, making training on all available data infeasible. Selecting a high quality subset helps in reducing training costs and enhancing model quality. Submodularity, a discrete analogue of convexity, is…

Machine Learning · Computer Science 2025-04-04 Maximilian Böther , Abraham Sebastian , Pranjal Awasthi , Ana Klimovic , Srikumar Ramalingam

Submodular meets Structured: Finding Diverse Subsets in Exponentially-Large Structured Item Sets

To cope with the high level of ambiguity faced in domains such as Computer Vision or Natural Language processing, robust prediction methods often search for a diverse set of high-quality candidate solutions or proposals. In structured…

Machine Learning · Computer Science 2014-11-10 Adarsh Prasad , Stefanie Jegelka , Dhruv Batra

Lazy Greedy Hypervolume Subset Selection from Large Candidate Solution Sets

Subset selection is a popular topic in recent years and a number of subset selection methods have been proposed. Among those methods, hypervolume subset selection is widely used. Greedy hypervolume subset selection algorithms can achieve…

Neural and Evolutionary Computing · Computer Science 2020-07-07 Weiyu Chen , Hisao Ishibuhci , Ke Shang

A Parallel Double Greedy Algorithm for Submodular Maximization

We study parallel algorithms for the problem of maximizing a non-negative submodular function. Our main result is an algorithm that achieves a nearly-optimal $1/2 -\epsilon$ approximation using $O(\log(1/\epsilon) / \epsilon)$ parallel…

Data Structures and Algorithms · Computer Science 2018-12-05 Alina Ene , Huy L. Nguyen , Adrian Vladu