Related papers: Interactive Set Discovery

Learning to Prune: Speeding up Repeated Computations

It is common to encounter situations where one must solve a sequence of similar computational problems. Running a standard algorithm with worst-case runtime guarantees on each instance will fail to take advantage of valuable structure…

Machine Learning · Computer Science 2019-04-29 Daniel Alabi , Adam Tauman Kalai , Katrina Ligett , Cameron Musco , Christos Tzamos , Ellen Vitercik

Finding Favourite Tuples on Data Streams with Provably Few Comparisons

One of the most fundamental tasks in data science is to assist a user with unknown preferences in finding high-utility tuples within a large database. To accurately elicit the unknown user preferences, a widely-adopted way is by asking the…

Databases · Computer Science 2023-07-07 Guangyi Zhang , Nikolaj Tatti , Aristides Gionis

Guided Data Discovery in Interactive Visualizations via Active Search

Recent advances in visual analytics have enabled us to learn from user interactions and uncover analytic goals. These innovations set the foundation for actively guiding users during data exploration. Providing such guidance will become…

Human-Computer Interaction · Computer Science 2022-07-19 Shayan Monadjemi , Sunwoo Ha , Quan Nguyen , Henry Chai , Roman Garnett , Alvitta Ottley

Efficient Biological Data Acquisition through Inference Set Design

In drug discovery, highly automated high-throughput laboratories are used to screen a large number of compounds in search of effective drugs. These experiments are expensive, so one might hope to reduce their cost by only experimenting on a…

Machine Learning · Computer Science 2025-04-15 Ihor Neporozhnii , Julien Roy , Emmanuel Bengio , Jason Hartford

Finding Good Itemsets by Packing Data

The problem of selecting small groups of itemsets that represent the data well has recently gained a lot of attention. We approach the problem by searching for the itemsets that compress the data efficiently. As a compression technique we…

Data Structures and Algorithms · Computer Science 2019-02-08 Nikolaj Tatti , Jilles Vreeken

Learning to search efficiently for causally near-optimal treatments

Finding an effective medical treatment often requires a search by trial and error. Making this search more efficient by minimizing the number of unnecessary trials could lower both costs and patient suffering. We formalize this problem as…

Machine Learning · Computer Science 2021-02-18 Samuel Håkansson , Viktor Lindblom , Omer Gottesman , Fredrik D. Johansson

Guided Visual Exploration of Relations in Data Sets

Efficient explorative data analysis systems must take into account both what a user knows and wants to know. This paper proposes a principled framework for interactive visual exploration of relations in data, through views most informative…

Machine Learning · Statistics 2021-07-02 Kai Puolamäki , Emilia Oikarinen , Andreas Henelius

Scientific Dataset Discovery via Topic-level Recommendation

Data intensive research requires the support of appropriate datasets. However, it is often time-consuming to discover usable datasets matching a specific research topic. We formulate the dataset discovery problem on an attributed…

Information Retrieval · Computer Science 2021-06-08 Basmah Altaf , Shichao Pei , Xiangliang Zhang

Exploiting Semidefinite Relaxations in Constraint Programming

Constraint programming uses enumeration and search tree pruning to solve combinatorial optimization problems. In order to speed up this solution process, we investigate the use of semidefinite relaxations within constraint programming. In…

Discrete Mathematics · Computer Science 2007-05-23 Willem Jan van Hoeve

Interactive Data Exploration with Smart Drill-Down

We present {\em smart drill-down}, an operator for interactively exploring a relational table to discover and summarize "interesting" groups of tuples. Each group of tuples is described by a {\em rule}. For instance, the rule $(a, b, \star,…

Databases · Computer Science 2016-12-20 Manas Joglekar , Hector Garcia-Molina , Aditya Parameswaran

Graph Pruning for Enumeration of Minimal Unsatisfiable Subsets

Finding Minimal Unsatisfiable Subsets (MUSes) of binary constraints is a common problem in infeasibility analysis of over-constrained systems. However, because of the exponential search space of the problem, enumerating MUSes is extremely…

Artificial Intelligence · Computer Science 2024-02-27 Panagiotis Lymperopoulos , Liping Liu

Determinantal Beam Search

Beam search is a go-to strategy for decoding neural sequence models. The algorithm can naturally be viewed as a subset optimization problem, albeit one where the corresponding set function does not reflect interactions between candidates.…

Computation and Language · Computer Science 2023-06-26 Clara Meister , Martina Forster , Ryan Cotterell

Guided Exploration of Sequential Rules

In pattern mining, sequential rules provide a formal framework to capture the temporal relationships and inferential dependencies between items. However, the discovery process is computationally intensive. To obtain mining results…

Databases · Computer Science 2026-02-20 Wensheng Gan , Gengsen Huang , Junyu Ren , Philip S. Yu

Efficient Locally Optimal Number Set Partitioning for Scheduling, Allocation and Fair Selection

We study the optimization version of the set partition problem (where the difference between the partition sums are minimized), which has numerous applications in decision theory literature. While the set partitioning problem is NP-hard and…

Data Structures and Algorithms · Computer Science 2021-09-13 Kaan Gokcesu , Hakan Gokcesu

Pruning variable selection ensembles

In the context of variable selection, ensemble learning has gained increasing interest due to its great potential to improve selection accuracy and to reduce false discovery rate. A novel ordering-based selective ensemble learning strategy…

Machine Learning · Statistics 2017-04-28 Chunxia Zhang , Yilei Wu , Mu Zhu

Discovery of Maximal Frequent Item Sets using Subset Creation

Data mining is the practice to search large amount of data to discover data patterns. Data mining uses mathematical algorithms to group the data and evaluate the future events. Association rule is a research area in the field of knowledge…

Databases · Computer Science 2013-02-08 Jnanamurthy H. K.

Active Learning Meets Optimized Item Selection

Designing recommendation systems with limited or no available training data remains a challenge. To that end, a new combinatorial optimization problem is formulated to generate optimized item selection for experimentation with the goal to…

Information Retrieval · Computer Science 2021-12-07 Bernard Kleynhans , Xin Wang , Serdar Kadıoğlu

Optimal Algorithms for Crawling a Hidden Database in the Web

A hidden database refers to a dataset that an organization makes accessible on the web by allowing users to issue queries through a search interface. In other words, data acquisition from such a source is not by following static…

Databases · Computer Science 2012-08-02 Cheng Sheng , Nan Zhang , Yufei Tao , Xin Jin

Ensemble pruning via an integer programming approach with diversity constraints

Ensemble learning combines multiple classifiers in the hope of obtaining better predictive performance. Empirical studies have shown that ensemble pruning, that is, choosing an appropriate subset of the available classifiers, can lead to…

Machine Learning · Computer Science 2022-05-03 Marcelo Antônio Mendes Bastos , Humberto Brandão César de Oliveira , Cristiano Arbex Valle

Frequent Itemset-driven Search for Finding Minimum Node Separators in Complex Networks

Finding an optimal set of critical nodes in a complex network has been a long-standing problem in the fields of both artificial intelligence and operations research. Potential applications include epidemic control, network security, carbon…

Neural and Evolutionary Computing · Computer Science 2022-01-19 Yangming Zhou , Xiaze Zhang , Na Geng , Zhibin Jiang , Mengchu Zhou