Related papers: Safe projections of binary data sets

Sparse Data-Driven Random Projection in Regression for High-Dimensional Data

We examine the linear regression problem in a challenging high-dimensional setting with correlated predictors where the vector of coefficients can vary from sparse to dense. In this setting, we propose a combination of probabilistic…

Methodology · Statistics 2025-05-13 Roman Parzer , Peter Filzmoser , Laura Vana-Gür

Likelihood Ratio Confidence Sets for Sequential Decision Making

Certifiable, adaptive uncertainty estimates for unknown quantities are an essential ingredient of sequential decision-making algorithms. Standard approaches rely on problem-dependent concentration results and are limited to a specific…

Machine Learning · Computer Science 2023-11-09 Nicolas Emmenegger , Mojmír Mutný , Andreas Krause

Stochastic Packing Integer Programs with Few Queries

We consider a stochastic variant of the packing-type integer linear programming problem, which contains random variables in the objective vector. We are allowed to reveal each entry of the objective vector by conducting a query, and the…

Data Structures and Algorithms · Computer Science 2019-03-14 Takanori Maehara , Yutaro Yamaguchi

Expressing and visualizing model uncertainty in Bayesian variable selection using Cartesian credible sets

Modern regression applications can involve hundreds or thousands of variables which motivates the use of variable selection methods. Bayesian variable selection defines a posterior distribution on the possible subsets of the variables…

Methodology · Statistics 2024-10-16 J. E. Griffin

From Data to Uncertainty Sets: a Machine Learning Approach

Existing approaches of prescriptive analytics -- where inputs of an optimization model can be predicted by leveraging covariates in a machine learning model -- often attempt to optimize the mean value of an uncertain objective. However,…

Machine Learning · Computer Science 2025-03-05 Dimitris Bertsimas , Benjamin Boucher

Random Logic Programs: Linear Model

This paper proposes a model, the linear model, for randomly generating logic programs with low density of rules and investigates statistical properties of such random logic programs. It is mathematically shown that the average number of…

Artificial Intelligence · Computer Science 2015-10-07 Kewen Wang , Lian Wen , Kedian Mu

Set Selection under Explorable Stochastic Uncertainty via Covering Techniques

Given subsets of uncertain values, we study the problem of identifying the subset of minimum total value (sum of the uncertain values) by querying as few values as possible. This set selection problem falls into the field of explorable…

Data Structures and Algorithms · Computer Science 2023-06-16 Nicole Megow , Jens Schlöter

Consistent and Flexible Selectivity Estimation for High-Dimensional Data

Selectivity estimation aims at estimating the number of database objects that satisfy a selection criterion. Answering this problem accurately and efficiently is essential to many applications, such as density estimation, outlier detection,…

Databases · Computer Science 2021-05-28 Yaoshu Wang , Chuan Xiao , Jianbin Qin , Rui Mao , Onizuka Makoto , Wei Wang , Rui Zhang , Yoshiharu Ishikawa

Binary Random Projections with Controllable Sparsity Patterns

Random projection is often used to project higher-dimensional vectors onto a lower-dimensional space, while approximately preserving their pairwise distances. It has emerged as a powerful tool in various data processing tasks and has…

Machine Learning · Computer Science 2020-06-30 Wenye Li , Shuzhong Zhang

A Theory of Selective Prediction

We consider a model of selective prediction, where the prediction algorithm is given a data sequence in an online fashion and asked to predict a pre-specified statistic of the upcoming data points. The algorithm is allowed to choose when to…

Machine Learning · Computer Science 2019-05-30 Mingda Qiao , Gregory Valiant

Gap Safe screening rules for sparsity enforcing penalties

In high dimensional regression settings, sparsity enforcing penalties have proved useful to regularize the data-fitting term. A recently introduced technique called screening rules propose to ignore some variables in the optimization…

Machine Learning · Statistics 2017-12-29 Eugene Ndiaye , Olivier Fercoq , Alexandre Gramfort , Joseph Salmon

Fair Regression: Quantitative Definitions and Reduction-based Algorithms

In this paper, we study the prediction of a real-valued target, such as a risk score or recidivism rate, while guaranteeing a quantitative notion of fairness with respect to a protected attribute such as gender or race. We call this class…

Machine Learning · Computer Science 2019-05-31 Alekh Agarwal , Miroslav Dudík , Zhiwei Steven Wu

Safe Screening Rules for Generalized Double Sparsity Learning

In a high-dimensional setting, sparse model has shown its power in computational and statistical efficiency. We consider variables selection problem with a broad class of simultaneous sparsity regularization, enforcing both feature-wise and…

Optimization and Control · Mathematics 2021-09-27 Xinyu Zhang

Reachability of Linear Uncertain Systems: Sampling Based Approaches

In this work, we perform safety analysis of linear dynamical systems with uncertainties. Instead of computing a conservative overapproximation of the reachable set, our approach involves computing a statistical approximate reachable set. As…

Systems and Control · Electrical Eng. & Systems 2021-09-17 Bineet Ghosh , Parasara Sridhar Duggirala

Subset Selection for Multiple Linear Regression via Optimization

Subset selection in multiple linear regression aims to choose a subset of candidate explanatory variables that tradeoff fitting error (explanatory power) and model complexity (number of variables selected). We build mathematical programming…

Machine Learning · Statistics 2020-09-04 Young Woong Park , Diego Klabjan

Selective prediction-set models with coverage guarantees

Though black-box predictors are state-of-the-art for many complex tasks, they often fail to properly quantify predictive uncertainty and may provide inappropriate predictions for unfamiliar data. Instead, we can learn more reliable models…

Machine Learning · Statistics 2021-12-14 Jean Feng , Arjun Sondhi , Jessica Perry , Noah Simon

Bayesian subset selection and variable importance for interpretable prediction and classification

Subset selection is a valuable tool for interpretable learning, scientific discovery, and data compression. However, classical subset selection is often avoided due to selection instability, lack of regularization, and difficulties with…

Machine Learning · Statistics 2022-02-17 Daniel R. Kowal

Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering

The goal of selective prediction is to allow an a model to abstain when it may not be able to deliver a reliable prediction, which is important in safety-critical contexts. Existing approaches to selective prediction typically require…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Zaid Khan , Yun Fu

Structured Variable Selection with Sparsity-Inducing Norms

We consider the empirical risk minimization problem for linear supervised learning, with regularization by structured sparsity-inducing norms. These are defined as sums of Euclidean norms on certain subsets of variables, extending the usual…

Machine Learning · Statistics 2011-11-23 Rodolphe Jenatton , Jean-Yves Audibert , Francis Bach

Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality?

For all its successes, Reinforcement Learning (RL) still struggles to deliver formal guarantees on the closed-loop behavior of the learned policy. Among other things, guaranteeing the safety of RL with respect to safety-critical systems is…

Systems and Control · Electrical Eng. & Systems 2020-04-03 Sebastien Gros , Mario Zanon , Alberto Bemporad