Related papers: Mitigating dimensionality effects with robust grap…

A Robust Framework for Graph-based Two-Sample Tests Using Weights

Graph-based tests are a class of non-parametric two-sample tests useful for analyzing high-dimensional data. The test statistics are constructed from similarity graphs (such as K-minimum spanning tree), and consequently, their performance…

Methodology · Statistics 2025-06-23 Yichuan Bai , Lynna Chu

Limiting distributions of graph-based test statistics on sparse and dense graphs

Two-sample tests utilizing a similarity graph on observations are useful for high-dimensional and non-Euclidean data due to their flexibility and good performance under a wide range of alternatives. Existing works mainly focused on sparse…

Statistics Theory · Mathematics 2023-11-14 Yejiong Zhu , Hao Chen

On high-dimensional modifications of some graph-based two-sample tests

Testing for the equality of two high-dimensional distributions is a challenging problem, and this becomes even more challenging when the sample size is small. Over the last few decades, several graph-based two-sample tests have been…

Methodology · Statistics 2019-11-22 Soham Sarkar , Rahul Biswas , Anil K. Ghosh

More power via graph-structured tests for differential expression of gene networks

We consider multivariate two-sample tests of means, where the location shift between the two populations is expected to be related to a known graph structure. An important application of such tests is the detection of differentially…

Applications · Statistics 2012-07-02 Laurent Jacob , Pierre Neuvial , Sandrine Dudoit

Graph-Based Two-Sample Tests for Data with Repeated Observations

In the regime of two-sample comparison, tests based on a graph constructed on observations by utilizing similarity information among them is gaining attention due to their flexibility and good performances for high-dimensional/non-Euclidean…

Methodology · Statistics 2019-02-13 Jingru Zhang , Hao Chen

Robust Graph Structure Learning via Multiple Statistical Tests

Graph structure learning aims to learn connectivity in a graph from data. It is particularly important for many computer vision related tasks since no explicit graph structure is available for images for most cases. A natural way to…

Computer Vision and Pattern Recognition · Computer Science 2022-12-26 Yaohua Wang , FangYi Zhang , Ming Lin , Senzhang Wang , Xiuyu Sun , Rong Jin

A Brief Survey on Representation Learning based Graph Dimensionality Reduction Techniques

Dimensionality reduction techniques map data represented on higher dimensions onto lower dimensions with varying degrees of information loss. Graph dimensionality reduction techniques adopt the same principle of providing latent…

Machine Learning · Computer Science 2022-11-11 Akhil Pandey Akella

New graph-based multi-sample tests for high-dimensional and non-Euclidean data

Testing the equality in distributions of multiple samples is a common task in many fields. However, this problem for high-dimensional or non-Euclidean data has not been well explored. In this paper, we propose new nonparametric tests based…

Methodology · Statistics 2022-05-30 Hoseung Song , Hao Chen

Gains in Power from Structured Two-Sample Tests of Means on Graphs

We consider multivariate two-sample tests of means, where the location shift between the two populations is expected to be related to a known graph structure. An important application of such tests is the detection of differentially…

Quantitative Methods · Quantitative Biology 2014-05-16 Laurent Jacob , Pierre Neuvial , Sandrine Dudoit

Graph Vertex Embeddings: Distance, Regularization and Community Detection

Graph embeddings have emerged as a powerful tool for representing complex network structures in a low-dimensional space, enabling the use of efficient methods that employ the metric structure in the embedding space as a proxy for the…

Social and Information Networks · Computer Science 2024-04-18 Radosław Nowak , Adam Małkowski , Daniel Cieślak , Piotr Sokół , Paweł Wawrzyński

Evaluating Robustness and Uncertainty of Graph Models Under Structural Distributional Shifts

In reliable decision-making systems based on machine learning, models have to be robust to distributional shifts or provide the uncertainty of their predictions. In node-level problems of graph learning, distributional shifts can be…

Machine Learning · Computer Science 2023-11-02 Gleb Bazhenov , Denis Kuznedelev , Andrey Malinin , Artem Babenko , Liudmila Prokhorenkova

A new ranking scheme for modern data and its application to two-sample hypothesis testing

Rank-based approaches are among the most popular nonparametric methods for univariate data in tackling statistical problems such as hypothesis testing due to their robustness and effectiveness. However, they are unsatisfactory for more…

Methodology · Statistics 2023-07-04 Doudou Zhou , Hao Chen

Feature Selection in High-dimensional Spaces Using Graph-Based Methods

High-dimensional feature selection is a central problem in a variety of application domains such as machine learning, image analysis, and genomics. In this paper, we propose graph-based tests as a useful basis for feature selection. We…

Methodology · Statistics 2024-08-13 Swarnadip Ghosh , Somabha Mukherjee , Divyansh Agarwal , Yichen He , Mingzhi Song , Xuejiao Pei

Concise Fuzzy Planar Embedding of Graphs: a Dimensionality Reduction Approach

The enormous amount of data to be represented using large graphs exceeds in some cases the resources of a conventional computer. Edges in particular can take up a considerable amount of memory as compared to the number of nodes. However,…

Artificial Intelligence · Computer Science 2023-12-18 Faisal N. Abu-Khzam , Rana H. Mouawi , Amer Hajj Ahmad , Sergio Thoumi

Unbiased Graph Embedding with Biased Graph Observations

Graph embedding techniques are pivotal in real-world machine learning tasks that operate on graph-structured data, such as social recommendation and protein structure modeling. Embeddings are mostly performed on the node level for learning…

Machine Learning · Computer Science 2022-04-26 Nan Wang , Lu Lin , Jundong Li , Hongning Wang

Practical methods for graph two-sample testing

Hypothesis testing for graphs has been an important tool in applied research fields for more than two decades, and still remains a challenging problem as one often needs to draw inference from few replicates of large graphs. Recent studies…

Machine Learning · Statistics 2018-12-03 Debarghya Ghoshdastidar , Ulrike von Luxburg

Graph-based Change Point Detection for Functional Data

Modeling functions that are sequentially observed as functional time series is becoming increasingly common. In such models, it is often crucial to ensure data homogeneity. We investigate the sensitivity of graph-based change point…

Methodology · Statistics 2025-03-25 Jeremy VanderDoes , Shojaeddin Chenouri

A new graph-based two-sample test for multivariate and object data

Two-sample tests for multivariate data and especially for non-Euclidean data are not well explored. This paper presents a novel test statistic based on a similarity graph constructed on the pooled observations from the two samples. It can…

Methodology · Statistics 2024-08-12 Hao Chen , Jerome H. Friedman

Towards Robust Graph Structural Learning Beyond Homophily via Preserving Neighbor Similarity

Despite the tremendous success of graph-based learning systems in handling structural data, it has been widely investigated that they are fragile to adversarial attacks on homophilic graph data, where adversaries maliciously modify the…

Machine Learning · Computer Science 2025-09-05 Yulin Zhu , Yuni Lai , Xing Ai , Wai Lun LO , Gaolei Li , Jianhua Li , Di Tang , Xingxing Zhang , Mengpei Yang , Kai Zhou

When Dimensionality Reduction Meets Graph (Drawing) Theory: Introducing a Common Framework, Challenges and Opportunities

In the vast landscape of visualization research, Dimensionality Reduction (DR) and graph analysis are two popular subfields, often essential to most visual data analytics setups. DR aims to create representations to support neighborhood and…

Machine Learning · Computer Science 2024-12-10 Fernando Paulovich , Alessio Arleo , Stef van den Elzen