Related papers: Discovering Valuable Items from Massive Data

Selecting the top-quality item through crowd scoring

We investigate crowdsourcing algorithms for finding the top-quality item within a large collection of objects with unknown intrinsic quality values. This is an important problem with many relevant applications, for example in networked…

Human-Computer Interaction · Computer Science 2017-10-03 Alessandro Nordio , Alberto Tarable , Emilio Leonardi , Marco Ajmone Marsan

The robust selection problem with information discovery

We explore a multiple-stage variant of the min-max robust selection problem with budgeted uncertainty that includes queries. First, one queries a subset of items and gets the exact values of their uncertain parameters. Given this…

Optimization and Control · Mathematics 2025-01-07 Xiaoyu Chen , Marc Goerigk , Michael Poss

Joint Optimization and Variable Selection of High-dimensional Gaussian Processes

Maximizing high-dimensional, non-convex functions through noisy observations is a notoriously hard problem, but one that arises in many applications. In this paper, we tackle this challenge by modeling the unknown function as a sample from…

Machine Learning · Computer Science 2012-07-03 Bo Chen , Rui Castro , Andreas Krause

Fast and Scalable Spike and Slab Variable Selection in High-Dimensional Gaussian Processes

Variable selection in Gaussian processes (GPs) is typically undertaken by thresholding the inverse lengthscales of automatic relevance determination kernels, but in high-dimensional datasets this approach can be unreliable. A more…

Machine Learning · Statistics 2022-02-25 Hugh Dance , Brooks Paige

Maximizing diversity over clustered data

Maximum diversity aims at selecting a diverse set of high-quality objects from a collection, which is a fundamental problem and has a wide range of applications, e.g., in Web search. Diversity under a uniform or partition matroid constraint…

Data Structures and Algorithms · Computer Science 2021-04-13 Guangyi Zhang , Aristides Gionis

Finding Good Itemsets by Packing Data

The problem of selecting small groups of itemsets that represent the data well has recently gained a lot of attention. We approach the problem by searching for the itemsets that compress the data efficiently. As a compression technique we…

Data Structures and Algorithms · Computer Science 2019-02-08 Nikolaj Tatti , Jilles Vreeken

Deep Kernels with Probabilistic Embeddings for Small-Data Learning

Gaussian Processes (GPs) are known to provide accurate predictions and uncertainty estimates even with small amounts of labeled data by capturing similarity between data points through their kernel function. However traditional GP kernels…

Machine Learning · Computer Science 2021-11-16 Ankur Mallick , Chaitanya Dwivedi , Bhavya Kailkhura , Gauri Joshi , T. Yong-Jin Han

Statistical mechanics analysis of general multi-dimensional knapsack problems

Knapsack problem (KP) is a representative combinatorial optimization problem that aims to maximize the total profit by selecting a subset of items under given constraints on the total weights. In this study, we analyze a generalized version…

Optimization and Control · Mathematics 2022-08-23 Yuta Nakamura , Takashi Takahashi , Yoshiyuki Kabashima

A tutorial on learning from preferences and choices with Gaussian Processes

Preference modelling lies at the intersection of economics, decision theory, machine learning and statistics. By understanding individuals' preferences and how they make choices, we can build products that closely match their expectations,…

Machine Learning · Computer Science 2026-05-19 Alessio Benavoli , Dario Azzimonti

Distributed Maximization of Submodular plus Diversity Functions for Multi-label Feature Selection on Huge Datasets

There are many problems in machine learning and data mining which are equivalent to selecting a non-redundant, high "quality" set of objects. Recommender systems, feature selection, and data summarization are among many applications of…

Machine Learning · Computer Science 2019-04-19 Mehrdad Ghadiri , Mark Schmidt

Optimizing Revenue over Data-driven Assortments

We revisit the problem of large-scale assortment optimization under the multinomial logit choice model without any assumptions on the structure of the feasible assortments. Scalable real-time assortment optimization has become essential in…

Optimization and Control · Mathematics 2018-05-02 Deeksha Sinha , Theja Tulabandhula

Active embedding search via noisy paired comparisons

Suppose that we wish to estimate a user's preference vector $w$ from paired comparisons of the form "does user $w$ prefer item $p$ or item $q$?," where both the user and items are embedded in a low-dimensional Euclidean space with distances…

Machine Learning · Statistics 2019-05-27 Gregory H. Canal , Andrew K. Massimino , Mark A. Davenport , Christopher J. Rozell

Learning Choice Functions with Gaussian Processes

In consumer theory, ranking available objects by means of preference relations yields the most common description of individual choices. However, preference-based models assume that individuals: (1) give their preferences only between pairs…

Machine Learning · Computer Science 2023-02-02 Alessio Benavoli , Dario Azzimonti , Dario Piga

The Sample Complexity of Best-$k$ Items Selection from Pairwise Comparisons

This paper studies the sample complexity (aka number of comparisons) bounds for the active best-$k$ items selection from pairwise comparisons. From a given set of items, the learner can make pairwise comparisons on every pair of items, and…

Machine Learning · Computer Science 2021-08-02 Wenbo Ren , Jia Liu , Ness B. Shroff

Efficient Multiscale Gaussian Process Regression using Hierarchical Clustering

Standard Gaussian Process (GP) regression, a powerful machine learning tool, is computationally expensive when it is applied to large datasets, and potentially inaccurate when data points are sparsely distributed in a high-dimensional…

Machine Learning · Computer Science 2016-03-08 Z. Zhang , K. Duraisamy , N. A. Gumerov

Feature Selection for Value Function Approximation Using Bayesian Model Selection

Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of the main challenges in scaling RL to real-world applications. Here we…

Artificial Intelligence · Computer Science 2012-02-01 Tobias Jung , Peter Stone

On the Complexity and Approximation of the Maximum Expected Value All-or-Nothing Subset

An unconstrained nonlinear binary optimization problem of selecting a maximum expected value subset of items is considered. Each item is associated with a profit and probability. Each of the items succeeds or fails independently with the…

Computational Complexity · Computer Science 2017-06-23 Noam Goldberg , Gabor Rudolf

Optimizing Revenue while showing Relevant Assortments at Scale

Scalable real-time assortment optimization has become essential in e-commerce operations due to the need for personalization and the availability of a large variety of items. While this can be done when there are simplistic assortment…

Artificial Intelligence · Computer Science 2021-03-03 Theja Tulabandhula , Deeksha Sinha , Saketh Karra

Measuring the robustness of Gaussian processes to kernel choice

Gaussian processes (GPs) are used to make medical and scientific decisions, including in cardiac care and monitoring of atmospheric carbon dioxide levels. Notably, the choice of GP kernel is often somewhat arbitrary. In particular,…

Machine Learning · Statistics 2022-03-15 William T. Stephenson , Soumya Ghosh , Tin D. Nguyen , Mikhail Yurochkin , Sameer K. Deshpande , Tamara Broderick

Feature Selection Facilitates Learning Mixtures of Discrete Product Distributions

Feature selection can facilitate the learning of mixtures of discrete random variables as they arise, e.g. in crowdsourcing tasks. Intuitively, not all workers are equally reliable but, if the less reliable ones could be eliminated, then…

Machine Learning · Statistics 2017-11-28 Vincent Zhao , Steven W. Zucker