Related papers: Model Averaging and Double Machine Learning

ddml: Double/debiased machine learning in Stata

We introduce the package ddml for Double/Debiased Machine Learning (DDML) in Stata. Estimators of causal parameters for five different econometric models are supported, allowing for flexible estimation of causal effects of endogenous…

Econometrics · Economics 2024-01-09 Achim Ahrens , Christian B. Hansen , Mark E. Schaffer , Thomas Wiemann

xtdml: Double Machine Learning Estimation to Static Panel Data Models with Fixed Effects in R

The double machine learning (DML) method combines the predictive power of machine learning with statistical estimation to conduct inference about the structural parameter of interest. This paper presents the R package `xtdml`, which…

Econometrics · Economics 2025-12-19 Annalivia Polselli

Multiway Cluster Robust Double/Debiased Machine Learning

This paper investigates double/debiased machine learning (DML) under multiway clustered sampling environments. We propose a novel multiway cross fitting algorithm and a multiway DML estimator based on this algorithm. We also develop a…

Econometrics · Economics 2020-03-05 Harold D. Chiang , Kengo Kato , Yukun Ma , Yuya Sasaki

Double/Debiased Machine Learning for Treatment and Causal Parameters

Most modern supervised statistical/machine learning (ML) methods are explicitly designed to solve prediction problems very well. Achieving this goal does not imply that these methods automatically deliver good estimators of causal…

Machine Learning · Statistics 2024-11-05 Victor Chernozhukov , Denis Chetverikov , Mert Demirer , Esther Duflo , Christian Hansen , Whitney Newey , James Robins

Adaptive debiased machine learning using data-driven model selection techniques

Debiased machine learning estimators for smooth functionals in nonparametric models can exhibit substantial variability and instability, often leading practitioners to instead rely on parametric or semiparametric working models. Such…

Methodology · Statistics 2026-03-20 Lars van der Laan , Marco Carone , Alex Luedtke , Mark van der Laan

Using stacking to average Bayesian predictive distributions

The widely recommended procedure of Bayesian model averaging is flawed in the M-open setting in which the true data-generating process is not one of the candidate models being fit. We take the idea of stacking from the point estimation…

Methodology · Statistics 2018-10-15 Yuling Yao , Aki Vehtari , Daniel Simpson , Andrew Gelman

Enhancing binary classification: A new stacking method via leveraging computational geometry

Stacking, a potent ensemble learning method, leverages a meta-model to harness the strengths of multiple base models, thereby enhancing prediction accuracy. Traditional stacking techniques typically utilize established learning models, such…

Machine Learning · Computer Science 2024-10-31 Wei Wu , Liang Tang , Zhongjie Zhao , Chung-Piaw Teo

Double Machine Learning meets Panel Data -- Promises, Pitfalls, and Potential Solutions

Estimating causal effect using machine learning (ML) algorithms can help to relax functional form assumptions if used within appropriate frameworks. However, most of these frameworks assume settings with cross-sectional data, whereas…

Econometrics · Economics 2024-09-04 Jonathan Fuhr , Dominik Papies

Neighborhood Stability in Double/Debiased Machine Learning with Dependent Data

This paper studies double/debiased machine learning (DML) methods applied to weakly dependent data. We allow observations to be situated in a general metric space that accommodates spatial and network data. Existing work implements…

Econometrics · Economics 2025-11-17 Jianfei Cao , Michael P. Leung

An Introduction to Double/Debiased Machine Learning

This paper provides an introduction to Double/Debiased Machine Learning (DML). DML is a general approach to performing inference about a target parameter in the presence of nuisance functions: objects that are needed to identify the target…

Econometrics · Economics 2026-02-16 Achim Ahrens , Victor Chernozhukov , Christian Hansen , Damian Kozbur , Mark Schaffer , Thomas Wiemann

Model Averaging by Cross-validation for Partially Linear Functional Additive Models

In this paper, we propose a model averaging approach for addressing model uncertainty in the context of partial linear functional additive models. These models are designed to describe the relation between a response and mixed-types of…

Methodology · Statistics 2023-06-12 Shishi Liu , Jingxiao Zhang

Improving the Finite Sample Estimation of Average Treatment Effects using Double/Debiased Machine Learning with Propensity Score Calibration

In the last decade, machine learning techniques have gained popularity for estimating causal effects. One machine learning approach that can be used for estimating an average treatment effect is Double/debiased machine learning (DML)…

Econometrics · Economics 2025-01-17 Daniele Ballinari , Nora Bearth

Double Machine Learning for Static Panel Models with Fixed Effects

Recent advances in causal inference have seen the development of methods which make use of the predictive power of machine learning algorithms. In this paper, we develop novel double machine learning (DML) procedures for panel data in which…

Econometrics · Economics 2025-01-03 Paul S. Clarke , Annalivia Polselli

Model Averaging for Support Vector Machine by Cross-Validation

Support vector machine (SVM) is a well-known statistical technique for classification problems in machine learning and other fields. An important question for SVM is the selection of covariates (or features) for the model. Many studies have…

Methodology · Statistics 2022-02-22 Jiahui Zou , Chaoxia Yuan , Xinyu Zhang , Guohua Zou , Alan T. K. Wan

Bayesian hierarchical stacking: Some models are (somewhere) useful

Stacking is a widely used model averaging technique that asymptotically yields optimal predictions among linear averages. We show that stacking is most effective when model predictive performance is heterogeneous in inputs, and we can…

Methodology · Statistics 2021-10-29 Yuling Yao , Gregor Pirš , Aki Vehtari , Andrew Gelman

Debiasing Algorithm through Model Adaptation

Large language models are becoming the go-to solution for the ever-growing number of tasks. However, with growing capacity, models are prone to rely on spurious correlations stemming from biases and stereotypes present in the training data.…

Computation and Language · Computer Science 2024-05-30 Tomasz Limisiewicz , David Mareček , Tomáš Musil

Bootstrap consistency for general double/debiased machine learning estimators

Double/debiased machine learning (DML) provides a general framework for inference with high-dimensional or otherwise complex nuisance parameters by combining Neyman-orthogonal scores with cross-fitting, thereby circumventing classical…

Statistics Theory · Mathematics 2026-04-21 Ziming Lin , Fang Han

Multi-layer Stack Ensembles for Time Series Forecasting

Ensembling is a powerful technique for improving the accuracy of machine learning models, with methods like stacking achieving strong results in tabular tasks. In time series forecasting, however, ensemble methods remain underutilized, with…

Machine Learning · Computer Science 2025-11-20 Nathanael Bosch , Oleksandr Shchur , Nick Erickson , Michael Bohlke-Schneider , Caner Türkmen

A Bayes interpretation of stacking for M-complete and M-open settings

In M-open problems where no true model can be conceptualized, it is common to back off from modeling and merely seek good prediction. Even in M-complete problems, taking a predictive approach can be very useful. Stacking is a model…

Statistics Theory · Mathematics 2016-02-17 Tri Le , Bertrand Clarke

A Generalized Stacking for Implementing Ensembles of Gradient Boosting Machines

The gradient boosting machine is one of the powerful tools for solving regression problems. In order to cope with its shortcomings, an approach for constructing ensembles of gradient boosting models is proposed. The main idea behind the…

Machine Learning · Computer Science 2020-10-14 Andrei V. Konstantinov , Lev V. Utkin