English
Related papers

Related papers: Do We Really Even Need Data?

200 papers

As artificial intelligence and machine learning tools become more accessible, and scientists face new obstacles to data collection (e.g., rising costs, declining survey response rates), researchers increasingly use predictions from…

Machine Learning · Statistics 2025-12-08 Stephen Salerno , Kentaro Hoffman , Awan Afiaz , Anna Neufeld , Tyler H. McCormick , Jeffrey T. Leek

In this paper we present a discussion of the basic aspects of the well-known problem of prediction and inference in physics, with specific attention to the role of models, the use of data and the application of recent developments in…

General Physics · Physics 2024-10-07 Luca Gammaitoni , Angelo Vulpiani

Prediction, where observed data is used to quantify uncertainty about a future observation, is a fundamental problem in statistics. Prediction sets with coverage probability guarantees are a common solution, but these do not provide…

Statistics Theory · Mathematics 2022-11-22 Leonardo Cella , Ryan Martin

Researchers now routinely use AI or other machine learning methods to estimate latent variables of economic interest, then plug-in the estimates as covariates in a regression. We show both theoretically and empirically that naively treating…

Econometrics · Economics 2025-05-01 Laura Battaglia , Timothy Christensen , Stephen Hansen , Szymon Sacher

This paper discusses the fundamental principles of causal inference - the area of statistics that estimates the effect of specific occurrences, treatments, interventions, and exposures on a given outcome from experimental and observational…

Methodology · Statistics 2021-12-03 Francesca Dominici , Falco J. Bargagli-Stoffi , Fabrizia Mealli

Machine Learning explainability techniques have been proposed as a means of `explaining' or interrogating a model in order to understand why a particular decision or prediction has been made. Such an ability is especially important at a…

Machine Learning · Statistics 2022-02-28 Matthew J. Vowels

Data science has become increasingly essential for the production of official statistics, as it enables the automated collection, processing, and analysis of large amounts of data. With such data science practices in place, it enables more…

Machine Learning · Statistics 2023-06-08 Cedric De Boom , Michael Reusens

Supervised machine learning and predictive models have achieved an impressive standard today, enabling us to answer questions that were inconceivable a few years ago. Besides these successes, it becomes clear, that beyond pure prediction,…

Machine Learning · Statistics 2025-01-29 Cornelia Gruber , Patrick Oliver Schenk , Malte Schierholz , Frauke Kreuter , Göran Kauermann

Predictive algorithms inform consequential decisions in settings with selective labels: outcomes are observed only for units selected by past decision makers. This creates an identification problem under unobserved confounding -- when…

Econometrics · Economics 2025-11-07 Ashesh Rambachan , Amanda Coston , Edward Kennedy

Predictions about people, such as their expected educational achievement or their credit risk, can be performative and shape the outcome that they aim to predict. Understanding the causal effect of these predictions on the eventual outcomes…

Machine Learning · Statistics 2022-10-19 Celestine Mendler-Dünner , Frances Ding , Yixin Wang

A data science task can be deemed as making sense of the data or testing a hypothesis about it. The conclusions inferred from data can greatly guide us to make informative decisions. Big data has enabled us to carry out countless prediction…

Machine Learning · Computer Science 2022-01-12 Wenhao Zhang , Ramin Ramezani , Arash Naeim

High-dimensional multivariate longitudinal data, which arise when many outcome variables are measured repeatedly over time, are becoming increasingly common in social, behavioral and health sciences. We propose a latent variable model for…

Methodology · Statistics 2025-12-09 Sze Ming Lee , Yunxiao Chen , Tony Sit

Machine learning is the science of discovering statistical dependencies in data, and the use of those dependencies to perform predictions. During the last decade, machine learning has made spectacular progress, surpassing human performance…

Machine Learning · Statistics 2016-07-13 David Lopez-Paz

Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics, for decades. Nowadays, estimating causal effect from observational data has become an appealing…

Methodology · Statistics 2020-02-10 Liuyi Yao , Zhixuan Chu , Sheng Li , Yaliang Li , Jing Gao , Aidong Zhang

Clinical researchers often select among and evaluate risk prediction models using standard machine learning metrics based on confusion matrices. However, if these models are used to allocate interventions to patients, standard metrics…

Machine Learning · Statistics 2020-06-03 Alejandro Schuler , Aashish Bhardwaj , Vincent Liu

In this work, we empirically examine human-AI decision-making in the presence of explanations based on predicted outcomes. This type of explanation provides a human decision-maker with expected consequences for each decision alternative at…

Human-Computer Interaction · Computer Science 2022-08-31 Johannes Jakubik , Jakob Schöffer , Vincent Hoge , Michael Vössing , Niklas Kühl

The research area of algorithms with predictions has seen recent success showing how to incorporate machine learning into algorithm design to improve performance when the predictions are correct, while retaining worst-case guarantees when…

Machine Learning · Computer Science 2022-12-06 Michael Dinitz , Sungjin Im , Thomas Lavastida , Benjamin Moseley , Sergei Vassilvitskii

The emergence of generative AI models has dramatically expanded the availability and use of synthetic data across scientific, industrial, and policy domains. While these developments open new possibilities for data analysis, they also raise…

Machine Learning · Statistics 2026-03-06 Ahmad Abdel-Azim , Ruoyu Wang , Xihong Lin

Recent work has focused on the very common practice of prediction-based inference: that is, (i) using a pre-trained machine learning model to predict an unobserved response variable, and then (ii) conducting inference on the association…

Machine Learning · Statistics 2024-01-02 Keshav Motwani , Daniela Witten

Causal inference from observational data often assumes "ignorability," that all confounders are observed. This assumption is standard yet untestable. However, many scientific studies involve multiple causes, different variables whose…

Machine Learning · Statistics 2019-04-16 Yixin Wang , David M. Blei
‹ Prev 1 2 3 10 Next ›