Related papers: Automatic Bayesian Density Analysis

Statistical Models for the Analysis of Optimization Algorithms with Benchmark Functions

Frequentist statistical methods, such as hypothesis testing, are standard practice in papers that provide benchmark comparisons. Unfortunately, these methods have often been misused, e.g., without testing for their statistical test…

Methodology · Statistics 2021-05-18 David Issa Mattos , Jan Bosch , Helena Holmström Olsson

Bayesian data analysis in empirical software engineering---The case of missing data

Bayesian data analysis (BDA) is today used by a multitude of research disciplines. These disciplines use BDA as a way to embrace uncertainty by using multilevel models and making use of all available information at hand. In this chapter, we…

Software Engineering · Computer Science 2020-01-03 Richard Torkar , Robert Feldt , Carlo A. Furia

AIDE: An Automated Sample-based Approach for Interactive Data Exploration

In this paper, we argue that database systems be augmented with an automated data exploration service that methodically steers users through the data in a meaningful way. Such an automated system is crucial for deriving insights from…

Databases · Computer Science 2015-11-02 Kyriaki Dimitriadou , Olga Papaemmanouil , Yanlei Diao

DADA: Differentiable Automatic Data Augmentation

Data augmentation (DA) techniques aim to increase data variability, and thus train deep networks with better generalisation. The pioneering AutoAugment automated the search for optimal DA policies with reinforcement learning. However,…

Computer Vision and Pattern Recognition · Computer Science 2020-07-31 Yonggang Li , Guosheng Hu , Yongtao Wang , Timothy Hospedales , Neil M. Robertson , Yongxin Yang

Big Data Dimensional Analysis

The ability to collect and analyze large amounts of data is a growing problem within the scientific community. The growing gap between data and users calls for innovative tools that address the challenges faced by big data volume, velocity…

Databases · Computer Science 2016-08-01 Vijay Gadepally , Jeremy Kepner

Efficient Adaptive Data Analysis over Dense Distributions

Modern data workflows are inherently adaptive, repeatedly querying the same dataset to refine and validate sequential decisions, but such adaptivity can lead to overfitting and invalid statistical inference. Adaptive Data Analysis (ADA)…

Machine Learning · Computer Science 2026-02-10 Joon Suk Huh

Automated Bioinformatics Analysis via AutoBA

With the fast-growing and evolving omics data, the demand for streamlined and adaptable tools to handle the analysis continues to grow. In response to this need, we introduce Auto Bioinformatics Analysis (AutoBA), an autonomous AI agent…

Genomics · Quantitative Biology 2023-09-08 Juexiao Zhou , Bin Zhang , Xiuying Chen , Haoyang Li , Xiaopeng Xu , Siyuan Chen , Xin Gao

Approximate Bayesian Computation with Domain Expert in the Loop

Approximate Bayesian computation (ABC) is a popular likelihood-free inference method for models with intractable likelihood functions. As ABC methods usually rely on comparing summary statistics of observed and simulated data, the choice of…

Machine Learning · Statistics 2022-06-22 Ayush Bharti , Louis Filstroff , Samuel Kaski

New models for symbolic data analysis

Symbolic data analysis (SDA) is an emerging area of statistics concerned with understanding and modelling data that takes distributional form (i.e. symbols), such as random lists, intervals and histograms. It was developed under the premise…

Computation · Statistics 2020-04-09 Boris Beranger , Huan Lin , Scott A. Sisson

The Landscape of R Packages for Automated Exploratory Data Analysis

The increasing availability of large but noisy data sets with a large number of heterogeneous variables leads to the increasing interest in the automation of common tasks for data analysis. The most time-consuming part of this process is…

Computation · Statistics 2019-09-19 Mateusz Staniak , Przemyslaw Biecek

Automatic Bayesian inference for LISA data analysis strategies

We demonstrate the use of automatic Bayesian inference for the analysis of LISA data sets. In particular we describe a new automatic Reversible Jump Markov Chain Monte Carlo method to evaluate the posterior probability density functions of…

General Relativity and Quantum Cosmology · Physics 2009-11-11 Alexander Stroeer , Jonathan Gair , Alberto Vecchio

Augmented Data Science: Towards Industrialization and Democratization of Data Science

Conversion of raw data into insights and knowledge requires substantial amounts of effort from data scientists. Despite breathtaking advances in Machine Learning (ML) and Artificial Intelligence (AI), data scientists still spend the…

Artificial Intelligence · Computer Science 2019-09-13 Huseyin Uzunalioglu , Jin Cao , Chitra Phadke , Gerald Lehmann , Ahmet Akyamac , Ran He , Jeongran Lee , Maria Able

High-dimensional Statistical Inference and Variable Selection Using Sufficient Dimension Association

Simultaneous variable selection and statistical inference is challenging in high-dimensional data analysis. Most existing post-selection inference methods require explicitly specified regression models, which are often linear, as well as…

Methodology · Statistics 2026-03-19 Shangyuan Ye , Shauna Rakshe , Ye Liang

Bayesian Anomaly Detection and Classification

Statistical uncertainties are rarely incorporated in machine learning algorithms, especially for anomaly detection. Here we present the Bayesian Anomaly Detection And Classification (BADAC) formalism, which provides a unified statistical…

Machine Learning · Statistics 2019-02-26 Ethan Roberts , Bruce A. Bassett , Michelle Lochner

Fully Adaptive Bayesian Algorithm for Data Analysis, FABADA

The aim of this paper is to describe a novel non-parametric noise reduction technique from the point of view of Bayesian inference that may automatically improve the signal-to-noise ratio of one- and two-dimensional data, such as e.g.…

Instrumentation and Methods for Astrophysics · Physics 2023-07-07 Pablo M Sanchez-Alarcon , Yago Ascasibar Sequeiros

General Latent Feature Modeling for Data Exploration Tasks

This paper introduces a general Bayesian non- parametric latent feature model suitable to per- form automatic exploratory analysis of heterogeneous datasets, where the attributes describing each object can be either discrete, continuous or…

Machine Learning · Statistics 2017-07-27 Isabel Valera , Melanie F. Pradier , Zoubin Ghahramani

Directly Handling Missing Data in Linear Discriminant Analysis for Enhancing Classification Accuracy and Interpretability

As the adoption of Artificial Intelligence (AI) models expands into critical real-world applications, ensuring the explainability of these models becomes paramount, particularly in sensitive fields such as medicine and finance. Linear…

Machine Learning · Computer Science 2024-10-10 Tuan L. Vo , Uyen Dang , Thu Nguyen

Sensitivity-Aware Amortized Bayesian Inference

Sensitivity analyses reveal the influence of various modeling choices on the outcomes of statistical analyses. While theoretically appealing, they are overwhelmingly inefficient for complex Bayesian models. In this work, we propose…

Machine Learning · Statistics 2024-08-29 Lasse Elsemüller , Hans Olischläger , Marvin Schmitt , Paul-Christian Bürkner , Ullrich Köthe , Stefan T. Radev

Explaining Predictions by Approximating the Local Decision Boundary

Constructing accurate model-agnostic explanations for opaque machine learning models remains a challenging task. Classification models for high-dimensional data, like images, are often inherently complex. To reduce this complexity,…

Machine Learning · Computer Science 2020-10-26 Georgios Vlassopoulos , Tim van Erven , Henry Brighton , Vlado Menkovski

Analysing symbolic data by pseudo-marginal methods

Symbolic data analysis (SDA) aggregates large individual-level datasets into a small number of distributional summaries, such as random rectangles or random histograms. The inference is carried out using these summaries in place of the…

Methodology · Statistics 2026-04-02 Yu Yang , Matias Quiroz , Boris Beranger , Robert Kohn , Scott A. Sisson