English
Related papers

Related papers: Robust Testing and Estimation under Manipulation A…

200 papers

We study the problem of robust mean estimation and introduce a novel Hamming distance-based measure of distribution shift for coordinate-level corruptions. We show that this measure yields adversary models that capture more realistic…

Machine Learning · Computer Science 2021-06-14 Zifan Liu , Jongho Park , Theodoros Rekatsinas , Christos Tzamos

We consider the problem of hypothesis testing for discrete distributions. In the standard model, where we have sample access to an underlying distribution $p$, extensive research has established optimal bounds for uniformity testing,…

Machine Learning · Computer Science 2024-12-03 Maryam Aliakbarpour , Piotr Indyk , Ronitt Rubinfeld , Sandeep Silwal

We study the problem of testing discrete distributions with a focus on the high probability regime. Specifically, given samples from one or more discrete distributions, a property $\mathcal{P}$, and parameters $0< \epsilon, \delta <1$, we…

Data Structures and Algorithms · Computer Science 2020-09-15 Ilias Diakonikolas , Themis Gouleakis , Daniel M. Kane , John Peebles , Eric Price

It is of soaring demand to develop statistical analysis tools that are robust against contamination as well as preserving individual data owners' privacy. In spite of the fact that both topics host a rich body of literature, to the best of…

Statistics Theory · Mathematics 2022-09-01 Mengchu Li , Thomas B. Berrett , Yi Yu

We initiate a systematic investigation of distribution testing in the framework of algorithmic replicability. Specifically, given independent samples from a collection of probability distributions, the goal is to characterize the sample…

Machine Learning · Computer Science 2025-07-04 Ilias Diakonikolas , Jingyi Gao , Daniel Kane , Sihan Liu , Christopher Ye

We study robust mean estimation in an online and distributed scenario in the presence of adversarial data attacks. At each time step, each agent in a network receives a potentially corrupted data point, where the data points were originally…

Cryptography and Security · Computer Science 2022-09-21 Tong Yao , Shreyas Sundaram

Training models that perform well under distribution shifts is a central challenge in machine learning. In this paper, we introduce a modeling framework where, in addition to training data, we have partial structural knowledge of the…

Machine Learning · Computer Science 2021-10-28 Tobias Sutter , Andreas Krause , Daniel Kuhn

Robust estimators, like the median of a point set, are important for data analysis in the presence of outliers. We study robust estimators for locationally uncertain points with discrete distributions. That is, each point in a data set has…

Discrete Mathematics · Computer Science 2018-03-14 Kevin Buchin , Jeff M. Phillips , Pingfan Tang

Distributed diffusion is a powerful algorithm for multi-task state estimation which enables networked agents to interact with neighbors to process input data and diffuse information across the network. Compared to a centralized approach,…

Multiagent Systems · Computer Science 2020-03-27 Jiani Li , Xenofon Koutsoukos

Distributed learning paradigms, such as federated or decentralized learning, allow a collection of agents to solve global learning and optimization problems through limited local interactions. Most such strategies rely on a mixture of local…

Machine Learning · Computer Science 2023-10-27 Christian A. Schroth , Stefan Vlaski , Abdelhak M. Zoubir

Large data sets often require performing distributed statistical estimation, with a full data set split across multiple machines and limited communication between machines. To study such scenarios, we define and study some refinements of…

Information Theory · Computer Science 2014-06-24 John C. Duchi , Michael I. Jordan , Martin J. Wainwright , Yuchen Zhang

In this work, we give a novel general approach for distribution testing. We describe two techniques: our first technique gives sample-optimal testers, while our second technique gives matching sample lower bounds. As a consequence, we…

Data Structures and Algorithms · Computer Science 2016-05-10 Ilias Diakonikolas , Daniel M. Kane

Algorithmic robust statistics has traditionally focused on the contamination model where a small fraction of the samples are arbitrarily corrupted. We consider a recent contamination model that combines two kinds of corruptions: (i) small…

Data Structures and Algorithms · Computer Science 2024-10-23 Thanasis Pittas , Ankit Pensia

Distribution shifts are ubiquitous in real-world machine learning applications, posing a challenge to the generalization of models trained on one data distribution to another. We focus on scenarios where data distributions vary across…

Machine Learning · Statistics 2024-06-05 Steven Wilkins-Reeves , Xu Chen , Qi Ma , Christine Agarwal , Aude Hofleitner

We study simple binary hypothesis testing under both local differential privacy (LDP) and communication constraints. We qualify our results as either minimax optimal or instance optimal: the former hold for the set of distribution pairs…

Statistics Theory · Mathematics 2023-12-19 Ankit Pensia , Amir R. Asadi , Varun Jog , Po-Ling Loh

This paper addresses the problem of distributed resilient state estimation and control for linear time-invariant systems in the presence of malicious false data injection sensor attacks and bounded noise. We consider a system operator…

Systems and Control · Electrical Eng. & Systems 2025-07-17 Takumi Shinohara , Karl H. Johansson , Henrik Sandberg

This paper concerns the central issues of model robustness and sample efficiency in offline reinforcement learning (RL), which aims to learn to perform decision making from history data without active exploration. Due to uncertainties and…

Machine Learning · Computer Science 2024-01-01 Laixi Shi , Yuejie Chi

Cooperative geolocation has attracted significant research interests in recent years. A large number of localization algorithms rely on the availability of statistical knowledge of measurement errors, which is often difficult to obtain in…

Applications · Statistics 2017-01-05 Xiufang Shi , Guoqiang Mao , Brian. D. O. Anderson , Zaiyue Yang , Jiming Chen

We find separation rates for testing multinomial or more general discrete distributions under the constraint of local differential privacy. We construct efficient randomized algorithms and test procedures, in both the case where only…

Statistics Theory · Mathematics 2020-05-27 Thomas B. Berrett , Cristina Butucea

As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which…

Machine Learning · Computer Science 2021-03-16 Adarsh Subbaswamy , Roy Adams , Suchi Saria
‹ Prev 1 2 3 10 Next ›