English
Related papers

Related papers: Decentralized Nonparametric Multiple Testing

200 papers

The parallel and distributed processing are becoming de facto industry standard, and a large part of the current research is targeted on how to make computing scalable and distributed, dynamically, without allocating the resources on…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-10 Rajendra Purohit , K R Chowdhary , S D Purohit

We consider nonparametric sequential hypothesis testing problem when the distribution under the null hypothesis is fully known but the alternate hypothesis corresponds to some other unknown distribution with some loose constraints. We…

Information Theory · Computer Science 2013-11-15 Shouvik Ganguly , K Sahasranand , Vinod Sharma

The problem of simultaneously testing the marginal distributions of sequentially monitored, independent data streams is considered. The decisions for the various testing problems can be made at different times, using data from all streams,…

Methodology · Statistics 2023-04-21 Yiming Xing , Georgios Fellouris

A key motivation in the development of Distributed Model Predictive Control (DMPC) is to accelerate centralized Model Predictive Control (MPC) for large-scale systems. DMPC has the prospect of scaling well by parallelizing computations…

Optimization and Control · Mathematics 2025-04-16 Gösta Stomberg , Maurice Raetsch , Alexander Engelmann , Timm Faulwasser

It is frequently of interest to jointly analyze multiple sequences of multiple tests in order to identify simultaneous signals, defined as features tested in multiple studies whose test statistics are non-null in each. In many problems,…

Methodology · Statistics 2019-01-16 Sihai Dave Zhao , Yet Tien Nguyen

We consider nonparametric or universal sequential hypothesis testing problem when the distribution under the null hypothesis is fully known but the alternate hypothesis corresponds to some other unknown distribution. These algorithms are…

Information Theory · Computer Science 2013-08-30 Jithin K. Sreedharan , Vinod Sharma

Test of independence is of fundamental importance in modern data analysis, with broad applications in variable selection, graphical models, and causal inference. When the data is high dimensional and the potential dependence signal is…

Methodology · Statistics 2023-06-13 Zhanrui Cai , Jing Lei , Kathryn Roeder

We consider statistical procedures for hypothesis testing of real valued functionals of matched pairs with missing values. In order to improve the accuracy of existing methods, we propose a novel multiplication combination procedure.…

Statistics Theory · Mathematics 2018-01-29 Lubna Amro , Frank Konietschke , Markus Pauly

Testing the equality in distributions of multiple samples is a common task in many fields. However, this problem for high-dimensional or non-Euclidean data has not been well explored. In this paper, we propose new nonparametric tests based…

Methodology · Statistics 2022-05-30 Hoseung Song , Hao Chen

Due to rapid data growth, statistical analysis of massive datasets often has to be carried out in a distributed fashion, either because several datasets stored in separate physical locations are all relevant to a given problem, or simply to…

Computation · Statistics 2016-02-08 Matthias Katzfuss , Dorit Hammerling

Conformalized multiple testing offers a model-free way to control predictive uncertainty in decision-making. Existing methods typically use only part of the available data to build score functions tailored to specific settings. We propose a…

Methodology · Statistics 2026-05-22 Yuyang Huo , Xiaoyang Wu , Changliang Zou , Haojie Ren

Large amount of data is often required to train and deploy useful machine learning models in industry. Smaller enterprises do not have the luxury of accessing enough data for machine learning, For privacy sensitive fields such as banking,…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-09-05 Felix Ongati , Eng. Lawrence Muchemi

Identifying dependency in multivariate data is a common inference task that arises in numerous applications. However, existing nonparametric independence tests typically require computation that scales at least quadratically with the sample…

Methodology · Statistics 2021-07-08 Shai Gorsky , Li Ma

In this article, we consider the problem of simultaneous testing of hypotheses when the individual test statistics are not necessarily independent. Specifically, we consider the problem of simultaneous testing of point null hypotheses…

Statistics Theory · Mathematics 2018-07-17 Prasenjit Ghosh , Arijit Chakrabarti

Large-scale multiple testing under static factor models is widely used to detect sparse signals in high-dimensional data. However, static factor models are arguably too stringent because they ignore serial correlation, which seriously…

Statistics Theory · Mathematics 2025-04-04 Xinxin Yang , Lilun Du

Testing independence among a number of (ultra) high-dimensional random samples is a fundamental and challenging problem. By arranging $n$ identically distributed $p$-dimensional random vectors into a $p \times n$ data matrix, we investigate…

Statistics Theory · Mathematics 2017-03-28 Xi Chen , Weidong Liu

We investigate distributed memory parallel sorting algorithms that scale to the largest available machines and are robust with respect to input size and distribution of the input elements. The main outcome is that four sorting algorithms…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-17 Michael Axtmann , Peter Sanders

In this paper, we propose a data collaboration analysis method for distributed datasets. The proposed method is a centralized machine learning while training datasets and models remain distributed over some institutions. Recently, data…

Machine Learning · Computer Science 2019-02-21 Akira Imakura , Tetsuya Sakurai

This paper considers the problem of multi-sample nonparametric comparison of counting processes with panel count data, which arise naturally when recurrent events are considered. Such data frequently occur in medical follow-up studies and…

Statistics Theory · Mathematics 2009-04-21 N. Balakrishnan , Xingqiu Zhao

Data is a precious resource in today's society, and is generated at an unprecedented and constantly growing pace. The need to store, analyze, and make data promptly available to a multitude of users introduces formidable challenges in…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-08 Alessandro Margara , Gianpaolo Cugola , Nicolò Felicioni , Stefano Cilloni
‹ Prev 1 2 3 10 Next ›