Related papers: Technical Report: Partial Dependence through Strat…

Automated Dependence Plots

In practical applications of machine learning, it is necessary to look beyond standard metrics such as test accuracy in order to validate various qualitative properties of a model. Partial dependence plots (PDP), including instance-specific…

Machine Learning · Computer Science 2020-07-31 David I. Inouye , Liu Leqi , Joon Sik Kim , Bryon Aragam , Pradeep Ravikumar

Fast Estimation of Partial Dependence Functions using Trees

Many existing interpretation methods are based on Partial Dependence (PD) functions that, for a pre-trained machine learning model, capture how a subset of the features affects the predictions by averaging over the remaining features.…

Machine Learning · Computer Science 2025-06-05 Jinyang Liu , Tessa Steensgaard , Marvin N. Wright , Niklas Pfister , Munir Hiabu

Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process

Scientists and practitioners increasingly rely on machine learning to model data and draw conclusions. Compared to statistical modeling approaches, machine learning makes fewer explicit assumptions about data structures, such as linearity.…

Machine Learning · Statistics 2023-11-09 Christoph Molnar , Timo Freiesleben , Gunnar König , Giuseppe Casalicchio , Marvin N. Wright , Bernd Bischl

Hybrid additive modeling with partial dependence for supervised regression and dynamical systems forecasting

Learning processes by exploiting restricted domain knowledge is an important task across a plethora of scientific areas, with more and more hybrid training methods additively combining data-driven and model-based approaches. Although the…

Machine Learning · Computer Science 2025-01-17 Yann Claes , Vân Anh Huynh-Thu , Pierre Geurts

Learning Multi-Frequency Partial Correlation Graphs

Despite the large research effort devoted to learning dependencies between time series, the state of the art still faces a major limitation: existing methods learn partial correlations but fail to discriminate across distinct frequency…

Machine Learning · Computer Science 2024-07-08 Gabriele D'Acunto , Paolo Di Lorenzo , Francesco Bonchi , Stefania Sardellitti , Sergio Barbarossa

Demystifying Functional Random Forests: Novel Explainability Tools for Model Transparency in High-Dimensional Spaces

The advent of big data has raised significant challenges in analysing high-dimensional datasets across various domains such as medicine, ecology, and economics. Functional Data Analysis (FDA) has proven to be a robust framework for…

Machine Learning · Statistics 2024-08-23 Fabrizio Maturo , Annamaria Porreca

From XAI to MLOps: Explainable Concept Drift Detection with Profile Drift Detection

Predictive models often degrade in performance due to evolving data distributions, a phenomenon known as data drift. Among its forms, concept drift, where the relationship between explanatory variables and the response variable changes, is…

Machine Learning · Statistics 2026-05-18 Ugur Dar , Mustafa Cavus

Time-Series Forecasting: Unleashing Long-Term Dependencies with Fractionally Differenced Data

This study introduces a novel forecasting strategy that leverages the power of fractional differencing (FD) to capture both short- and long-term dependencies in time series data. Unlike traditional integer differencing methods, FD preserves…

Machine Learning · Computer Science 2023-12-05 Sarit Maitra , Vivek Mishra , Srashti Dwivedi , Sukanya Kundu , Goutam Kumar Kundu

Covariate-adjusted statistical dependence representation through partial copulas: bounds and new insights

In this paper, we revisit the notion of partial copula, originally introduced to test conditional independence, highlighting its capability to represent the dependence between two random variables after removing their dependence with a…

Methodology · Statistics 2026-05-26 Vinícius Litvinoff Justus , Felipe Fontana Vieira

Causal Dependence Plots

Explaining artificial intelligence or machine learning models is increasingly important. To use such data-driven systems wisely we must understand how they interact with the world, including how they depend causally on data inputs. In this…

Machine Learning · Computer Science 2023-07-06 Joshua R. Loftus , Lucius E. J. Bynum , Sakina Hansen

Covariate-informed reconstruction of partially observed functional data via factor models

This paper studies linear reconstruction of partially observed functional data which are recorded on a discrete grid. We propose a novel estimation approach based on approximate factor models with increasing rank taking into account…

Statistics Theory · Mathematics 2024-05-22 Maximilian Ofner , Siegfried Hörmann

How Much Can We See? A Note on Quantifying Explainability of Machine Learning Models

One of the most popular approaches to understanding feature effects of modern black box machine learning models are partial dependence plots (PDP). These plots are easy to understand but only able to visualize low order dependencies. The…

Machine Learning · Statistics 2019-12-17 Gero Szepannek

A discretization scheme for path-dependent FBSDEs and PDEs

This study develops a numerical scheme for path-dependent FBSDEs and PDEs. We introduce a Picard iteration method for solving path-dependent FBSDEs, prove its convergence to the true solution, and establish its rate of convergence. A key…

Probability · Mathematics 2025-10-01 Jiuk Jang , Hyungbin Park

Operator-Based Uncertainty Quantification of Stochastic Fractional PDEs

Fractional calculus provides a rigorous mathematical framework to describe anomalous stochastic processes by generalizing the notion of classical differential equations to their fractional-order counterparts. By introducing the fractional…

Numerical Analysis · Mathematics 2018-06-04 Ehsan Kharazmi , Mohsen Zayernouri

Statistical inference for semiparametric varying-coefficient partially linear models with error-prone linear covariates

We study semiparametric varying-coefficient partially linear models when some linear covariates are not observed, but ancillary variables are available. Semiparametric profile least-square based estimation procedures are developed for…

Statistics Theory · Mathematics 2009-03-04 Yong Zhou , Hua Liang

Regression Discontinuity Designs: A Decision Theoretic Approach

The regression discontinuity design (RDD) is a quasi-experimental design that can be used to identify and estimate the causal effect of a treatment using observational data. In an RDD, a pre-specified rule is used for treatment assignment,…

Methodology · Statistics 2016-01-05 Panayiota Constantinou , Aidan G. O'Keeffe

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any…

Methodology · Statistics 2011-11-16 Jianqing Fan , Xu Han , Weijie Gu

Efficient Web-based Data Imputation with Graph Model

A challenge for data imputation is the lack of knowledge. In this paper, we attempt to address this challenge by involving extra knowledge from web. To achieve high-performance web-based imputation, we use the dependency, i.e.FDs and CFDs,…

Databases · Computer Science 2016-11-15 Yiwen Tang , Hongzhi Wang , Shiwei Zhang , Huijun Zhang , Ruoxi Shi

Measuring Approximate Functional Dependencies: a Comparative Study

Approximate functional dependencies (AFDs) are functional dependencies (FDs) that "almost" hold in a relation. While various measures have been proposed to quantify the level to which an FD holds approximately, they are difficult to compare…

Databases · Computer Science 2023-12-12 Marcel Parciak , Sebastiaan Weytjens , Niel Hens , Frank Neven , Liesbet M. Peeters , Stijn Vansummeren

Asymptotic Uncertainty of False Discovery Proportion for Dependent $t$-Tests

Multiple testing is a fundamental problem in high-dimensional statistical inference. Although many methods have been proposed to control false discoveries, it is still a challenging task when the tests are correlated to each other. To…

Statistics Theory · Mathematics 2022-07-06 Meng Mei , Yuan Jiang