English
Related papers

Related papers: Linear-Complexity Exponentially-Consistent Tests f…

200 papers

Outlier hypothesis testing is studied in a universal setting. Multiple sequences of observations are collected, a small subset of which are outliers. A sequence is considered an outlier if the observations in that sequence are distributed…

Information Theory · Computer Science 2014-04-02 Yun Li , Sirin Nitinawarat , Venugopal V. Veeravalli

Universal outlier hypothesis testing is studied in a sequential setting. Multiple observation sequences are collected, a small subset of which are outliers. A sequence is considered an outlier if the observations in that sequence are…

Statistics Theory · Mathematics 2014-11-27 Yun Li , Sirin Nitinawarat , Venugopal V. Veeravalli

The following detection problem is studied, in which there are $M$ sequences of samples out of which one outlier sequence needs to be detected. Each typical sequence contains $n$ independent and identically distributed (i.i.d.) continuous…

Information Theory · Computer Science 2015-10-08 Yuheng Bu , Shaofeng Zou , Yingbin Liang , Venugopal V. Veeravalli

In this work, we revisit outlier hypothesis testing and propose exponentially consistent, low-complexity fixed-length tests that achieve a better tradeoff between detection performance and computational complexity than existing…

Signal Processing · Electrical Eng. & Systems 2026-01-28 Lina Zhu , Lin Zhou

In outlier hypothesis testing, one aims to detect outlying sequences among a given set of sequences, where most sequences are generated i.i.d. from a nominal distribution while outlying sequences (outliers) are generated i.i.d. from a…

Signal Processing · Electrical Eng. & Systems 2024-09-10 Lina Zhu , Lin Zhou

We revisit outlier hypothesis testing, propose exponentially consistent low complexity fixed-length and sequential tests and show that our tests achieve better tradeoff between detection performance and computational complexity than…

Information Theory · Computer Science 2026-01-09 Jun Diao , Jingjing Wang , Lin Zhou

Universal outlier hypothesis testing refers to a hypothesis testing problem where one observes a large number of length-$n$ sequences -- the majority of which are distributed according to the typical distribution $\pi$ and a small number…

Information Theory · Computer Science 2026-01-05 Bernhard C. Geiger , Tobias Koch , Josipa Mihaljević , Maximilian Toller

Outliers are ubiquitous in modern data sets. Distance-based techniques are a popular non-parametric approach to outlier detection as they require no prior assumptions on the data generating distribution and are simple to implement. Scaling…

Machine Learning · Statistics 2016-05-04 Mario Lucic , Olivier Bachem , Andreas Krause

This paper develops a flexible distribution-free method for collective outlier detection and enumeration, designed for situations in which the presence of outliers can be detected powerfully even though their precise identification may be…

Methodology · Statistics 2026-05-19 Chiara G. Magnani , Matteo Sesia , Aldo Solari

In multiple classification, one aims to determine whether a testing sequence is generated from the same distribution as one of the M training sequences or not. Unlike most of existing studies that focus on discrete-valued sequences with…

Machine Learning · Statistics 2024-10-30 Lina Zhu , Lin Zhou

Clustering has many important applications in computer science, but real-world datasets often contain outliers. Moreover, the presence of outliers can make the clustering problems to be much more challenging. To reduce the complexities,…

Data Structures and Algorithms · Computer Science 2020-05-04 Hu Ding , Jiawei Huang , Haikuo Yu

Outlier detection is an important problem occurring in a wide range of areas. Outliers are the outcome of fraudulent behaviour, mechanical faults, human error, or simply natural deviations. Many data mining applications perform outlier…

Machine Learning · Computer Science 2025-10-28 Juan A. Lara , David Lizcano , Víctor Rampérez , Javier Soriano

We study the classic $k$-means/median clustering, which are fundamental problems in unsupervised learning, in the setting where data are partitioned across multiple sites, and where we are allowed to discard a small portion of the data by…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-10-12 Jiecao Chen , Erfan Sadeqi Azer , Qin Zhang

A sensor network is considered where at each sensor a sequence of random variables is observed. At each time step, a processed version of the observations is transmitted from the sensors to a common node called the fusion center. At some…

Statistics Theory · Mathematics 2023-07-19 Taposh Banerjee , Venugopal V. Veeravalli

Often the challenge associated with tasks like fraud and spam detection[1] is the lack of all likely patterns needed to train suitable supervised learning models. In order to overcome this limitation, such tasks are attempted as outlier or…

Machine Learning · Computer Science 2018-08-22 Utkarsh Porwal , Smruthi Mukund

We revisit the outlier hypothesis testing framework of Li \emph{et al.} (TIT 2014) and derive fundamental limits for the optimal test under the generalized Neyman-Pearson criterion. In outlier hypothesis testing, one is given multiple…

Information Theory · Computer Science 2022-02-15 Lin Zhou , Yun Wei , Alfred Hero

We revisit sequential outlier hypothesis testing and derive bounds on achievable exponents when both the nominal and anomalous distributions are unknown. The task of outlier hypothesis testing is to identify the set of outliers that are…

Information Theory · Computer Science 2025-04-24 Jun Diao , Lin Zhou

In the binary hypothesis testing problem, it is well known that sequentiality in taking samples eradicates the trade-off between two error exponents, yet implementing the optimal test requires the knowledge of the underlying distributions,…

Information Theory · Computer Science 2025-01-07 Ching-Fang Li , I-Hsiang Wang

There is an increasing interest in algorithms to learn invariant correlations across training environments. A big share of the current proposals find theoretical support in the causality literature but, how useful are they in practice? The…

Machine Learning · Computer Science 2021-02-23 Benjamin Aubin , Agnieszka Słowik , Martin Arjovsky , Leon Bottou , David Lopez-Paz

Real-world datasets often contain outliers, and the presence of outliers can make the clustering problems to be much more challenging. In this paper, we propose a simple uniform sampling framework for solving three representative…

Machine Learning · Computer Science 2023-10-04 Jiawei Huang , Wenjie Liu , Hu Ding
‹ Prev 1 2 3 10 Next ›