English
Related papers

Related papers: Collaborative causal inference on distributed data

200 papers

Observational studies enable causal inferences when randomized controlled trials (RCTs) are not feasible. However, integrating sensitive medical data across multiple institutions introduces significant privacy challenges. The data…

Data sharing barriers are paramount challenges arising from multicenter clinical trials where multiple data sources are stored in a distributed fashion at different local study sites. Merging such data sources into a common data storage for…

Methodology · Statistics 2022-04-05 Mengtong Hu , Xu Shi , Peter X. -K. Song

The estimation of conditional average treatment effects (CATEs) is an important topic in many scientific fields. CATEs can be estimated with high accuracy if data distributed across multiple parties are centralized. However, it is difficult…

Methodology · Statistics 2025-07-28 Yuji Kawamata , Ryoki Motai , Yukihiko Okada , Akira Imakura , Tetsuya Sakurai

Having a large number of covariates can have a negative impact on the quality of causal effect estimation since confounding adjustment becomes unreliable when the number of covariates is large relative to the samples available. Propensity…

Methodology · Statistics 2020-09-15 Debo Cheng , Jiuyong Li , Lin Liu , Jixue Liu

Causal discovery serves a pivotal role in mitigating model uncertainty through recovering the underlying causal mechanisms among variables. In many practical domains, such as healthcare, access to the data gathered by individual entities is…

Machine Learning · Computer Science 2024-02-13 Amin Abyaneh , Nino Scherrer , Patrick Schwab , Stefan Bauer , Bernhard Schölkopf , Arash Mehrjou

Multi-source data fusion, in which multiple data sources are jointly analyzed to obtain improved information, has considerable research attention. For the datasets of multiple medical institutions, data confidentiality and…

Machine Learning · Computer Science 2022-09-01 Akira Imakura , Tetsuya Sakurai , Yukihiko Okada , Tomoya Fujii , Teppei Sakamoto , Hiroyuki Abe

Missing data is a common problem in clinical data collection, which causes difficulty in the statistical analysis of such data. To overcome problems caused by incomplete data, we propose a new imputation method called projective resampling…

Methodology · Statistics 2021-06-17 Zishu Zhan , Xiangjie Li , Jingxiao Zhang

The sharing of patient-level data necessary for covariate-adjusted survival analysis between medical institutions is difficult due to privacy protection restrictions. We propose a privacy-preserving framework that estimates balanced…

Utilizing covariate information has been a powerful approach to improve the efficiency and accuracy for causal inference, which support massive amount of randomized experiments run on data-driven enterprises. However, state-of-art…

Methodology · Statistics 2023-11-06 Yuhang Wu , Jinghai He , Zeyu Zheng

A number of methods have been proposed for causal effect estimation, yet few have demonstrated efficacy in handling data with complex structures, such as images. To fill this gap, we propose Causal Multi-task Deep Ensemble (CMDE), a novel…

Machine Learning · Computer Science 2023-05-30 Ziyang Jiang , Zhuoran Hou , Yiling Liu , Yiman Ren , Keyu Li , David Carlson

Estimating individual-level treatment effect from observational data is a fundamental problem in causal inference and has attracted increasing attention in the fields of education, healthcare, and public policy.In this work, we concentrate…

Machine Learning · Computer Science 2025-07-10 Hui Meng , Keping Yang , Xuyu Peng , Bo Zheng

Distributed data analysis without revealing the individual data has recently attracted significant attention in several applications. A collaborative data analysis through sharing dimensionality reduced representations of data has been…

Machine Learning · Computer Science 2021-01-28 Akira Imakura , Anna Bogdanova , Takaya Yamazoe , Kazumasa Omote , Tetsuya Sakurai

This work considers the problem of Distributed Mean Estimation (DME) over networks with intermittent connectivity, where the goal is to learn a global statistic over the data samples localized across distributed nodes with the help of a…

Information Theory · Computer Science 2023-03-02 Rajarshi Saha , Mohamed Seif , Michal Yemini , Andrea J. Goldsmith , H. Vincent Poor

Data sharing barriers are paramount challenges arising from multicenter clinical studies where multiple data sources are stored in a distributed fashion at different local study sites. Particularly in the case of time-to-event analysis when…

Applications · Statistics 2024-09-10 Mengtong Hu , Xu Shi , Peter X. -K. Song

Quality Estimation (QE) models evaluate the quality of machine translations without reference translations, serving as the reward models for the translation task. Due to the data scarcity, synthetic data generation has emerged as a…

Computation and Language · Computer Science 2025-06-19 Xiang Geng , Zhejian Lai , Jiajun Chen , Hao Yang , Shujian Huang

Decentralized data sources are prevalent in real-world applications, posing a formidable challenge for causal inference. These sources cannot be consolidated into a single entity owing to privacy constraints. The presence of dissimilar data…

Machine Learning · Computer Science 2024-05-31 Thanh Vinh Vo , Young lee , Tze-Yun Leong

With the development of big data and machine learning, privacy concerns have become increasingly critical, especially when handling heterogeneous datasets containing sensitive personal information. Differential privacy provides a rigorous…

Machine Learning · Statistics 2025-08-08 Ziliang Shen , Caixing Wang , Shaoli Wang , Yibo Yan

Model quantization, which aims to compress deep neural networks and accelerate inference speed, has greatly facilitated the development of cumbersome models on mobile and edge devices. There is a common assumption in quantization methods…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Yuzhang Shang , Bingxin Xu , Gaowen Liu , Ramana Kompella , Yan Yan

In observational studies, the causal effect of a treatment may be confounded with variables that are related to both the treatment and the outcome of interest. In order to identify a causal effect, such studies often rely on the…

Methodology · Statistics 2017-10-17 Emma Persson , Jenny Häggström , Ingeborg Waernbaum , Xavier de Luna

The increased availability of massive data sets provides a unique opportunity to discover subtle patterns in their distributions, but also imposes overwhelming computational challenges. To fully utilize the information contained in big…

Statistics Theory · Mathematics 2018-04-12 Stanislav Volgushev , Shih-Kang Chao , Guang Cheng
‹ Prev 1 2 3 10 Next ›