cardinalR: Generating Interesting High-Dimensional Data Structures

Jayani P. Gamage; Dianne Cook; Paul Harrison; Michael Lydeamore; Thiyanga S. Talagala

cardinalR: Generating Interesting High-Dimensional Data Structures

Methodology 2025-12-23 v1 Applications

Authors: Jayani P. Gamage , Dianne Cook , Paul Harrison , Michael Lydeamore , Thiyanga S. Talagala

Abstract

Simulated high-dimensional data is useful for testing, validating, and improving algorithms used in dimension reduction, supervised and unsupervised learning. High-dimensional data is characterized by multiple variables that are dependent or associated in some way, such as linear, nonlinear, clustering or anomalies. Here we provide new methods for generating a variety of high-dimensional structures using mathematical functions and statistical distributions organized into the R package cardinalR. Several example data sets are also provided. These will be useful for researchers to better understand how different analytical methods work and can be improved, with a special focus on nonlinear dimension reduction methods. This package enriches the existing toolset of benchmark datasets for evaluating algorithms.

Keywords

sufficient dimension reduction statistical data analysis statistical software and packages

Cite

@article{arxiv.2512.18172,
  title  = {cardinalR: Generating Interesting High-Dimensional Data Structures},
  author = {Jayani P. Gamage and Dianne Cook and Paul Harrison and Michael Lydeamore and Thiyanga S. Talagala},
  journal= {arXiv preprint arXiv:2512.18172},
  year   = {2025}
}

cardinalR: Generating Interesting High-Dimensional Data Structures

Abstract

Keywords

Cite

Related papers