English
Related papers

Related papers: A Statistical Approach Towards Robust Progress Est…

200 papers

The ability to estimate resource consumption of SQL queries is crucial for a number of tasks in a database system such as admission control, query scheduling and costing during query optimization. Recent work has explored the use of…

Databases · Computer Science 2012-08-02 Jiexing Li , Arnd Christian König , Vivek Narasayya , Surajit Chaudhuri

The last decade has seen a number of advances in computationally efficient algorithms for statistical methods subject to robustness constraints. An estimator may be robust in a number of different ways: to contamination of the dataset, to…

Machine Learning · Statistics 2025-09-08 Gautam Kamath

Formulating efficient SQL queries requires several cycles of tuning and execution, particularly for inexperienced users. We examine methods that can accelerate and improve this interaction by providing insights about SQL queries prior to…

Databases · Computer Science 2020-02-24 Zainab Zolaktaf , Mostafa Milani , Rachel Pottinger

Statistical and structural modeling represent two distinct approaches to data analysis. In this paper, we propose a set of novel methods for combining statistical and structural models for improved prediction and causal inference. Our first…

Econometrics · Economics 2020-06-11 Jiaming Mao , Jingzhi Xu

An important challenge in statistical analysis lies in controlling the bias of estimators due to the ever-increasing data size and model complexity. Approximate numerical methods and data features like censoring and misclassification often…

Statistics Theory · Mathematics 2020-11-17 Stéphane Guerrier , Mucyo Karemera , Samuel Orso , Maria-Pia Victoria-Feser , Yuming Zhang

The aim of this paper is to present a new estimation procedure that can be applied in many statistical frameworks including density and regression and which leads to both robust and optimal (or nearly optimal) estimators. In density…

Statistics Theory · Mathematics 2017-01-23 Yannick Baraud , Lucien Birgé , Mathieu Sart

As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which…

Machine Learning · Computer Science 2021-03-16 Adarsh Subbaswamy , Roy Adams , Suchi Saria

There has been increasing interest in recent years in the development of approaches to estimate causal effects when the number of potential confounders is prohibitively large. This growth in interest has led to a number of potential…

Methodology · Statistics 2020-02-05 Joseph Antonelli , Matthew Cefalu

Fully robust versions of the elastic net estimator are introduced for linear and logistic regression. The algorithms to compute the estimators are based on the idea of repeatedly applying the non-robust classical estimators to data subsets…

Methodology · Statistics 2017-03-16 Fatma Sevinc Kurnaz , Irene Hoffmann , Peter Filzmoser

With the wide adoption of machine learning techniques, requirements have evolved beyond sheer high performance, often requiring models to be trustworthy. A common approach to increase the trustworthiness of such systems is to allow them to…

Machine Learning · Computer Science 2023-11-16 Andrea Pugnana , Carlos Mougan , Dan Saattrup Nielsen

Model performance evaluation is a critical and expensive task in machine learning and computer vision. Without clear guidelines, practitioners often estimate model accuracy using a one-time completely random selection of the data. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Riccardo Fogliato , Pratik Patil , Mathew Monfort , Pietro Perona

Considering the increasing size of available data, the need for statistical methods that control the finite sample bias is growing. This is mainly due to the frequent settings where the number of variables is large and allowed to increase…

Statistics Theory · Mathematics 2018-10-12 Stéphane Guerrier , Mucyo Karemera , Samuel Orso , Maria-Pia Victoria-Feser

Local decision rules are commonly understood to be more explainable, due to the local nature of the patterns involved. With numerical optimization methods such as gradient boosting, ensembles of local decision rules can gain good predictive…

Machine Learning · Computer Science 2025-08-27 Xin Du , Subramanian Ramamoorthy , Wouter Duivesteijn , Jin Tian , Mykola Pechenizkiy

Active statistical inference is a new method for inference with AI-assisted data collection. Given a budget on the number of labeled data points that can be collected and assuming access to an AI predictive model, the basic idea is to…

Machine Learning · Statistics 2025-11-13 Puheng Li , Tijana Zrnic , Emmanuel Candès

This paper considers an empirical likelihood inference for parameters defined by general estimating equations, when data are missing at random. The efficiency of existing estimators depends critically on correctly specifying the conditional…

Methodology · Statistics 2016-12-06 Tianqing Liu , Xiaohui Yuan , Zhaohai Li , Aiyi Liu

In many situations, data are recorded over a period of time and may be regarded as realizations of a stochastic process. In this paper, robust estimators for the principal components are considered by adapting the projection pursuit…

Statistics Theory · Mathematics 2012-03-12 Juan Lucas Bali , Graciela Boente , David E. Tyler , Jane-Ling Wang

State estimation or filtering serves as a fundamental task to enable intelligent decision-making in applications such as autonomous vehicles, robotics, healthcare monitoring, smart grids, intelligent transportation, and predictive…

Machine Learning · Computer Science 2025-06-16 Aamir Hussain Chughtai

This paper addresses the problem of providing robust estimators under a functional logistic regression model. Logistic regression is a popular tool in classification problems with two populations. As in functional linear regression,…

Methodology · Statistics 2023-08-16 Graciela Boente , Marina Valdora

In many instances, the application of approximate Bayesian methods is hampered by two practical features: 1) the requirement to project the data down to low-dimensional summary, including the choice of this projection, which ultimately…

Methodology · Statistics 2020-06-26 David T. Frazier

Ordinal user-provided ratings across multiple items are frequently encountered in both scientific and commercial applications. Whilst recommender systems are known to do well on these type of data from a predictive point of view, their…

Methodology · Statistics 2025-03-05 Sjoerd Hermes
‹ Prev 1 2 3 10 Next ›