English
Related papers

Related papers: Simulating High-Dimensional Multivariate Data usin…

200 papers

Simulated high-dimensional data is useful for testing, validating, and improving algorithms used in dimension reduction, supervised and unsupervised learning. High-dimensional data is characterized by multiple variables that are dependent…

High dimensional correlated binary data arise in many areas, such as observed genetic variations in biomedical research. Data simulation can help researchers evaluate efficiency and explore properties of different computational and…

Methodology · Statistics 2020-07-29 Wei Jiang , Shuang Song , Lin Hou , Hongyu Zhao

Nonlinear dimension reduction methods provide a low-dimensional representation of high-dimensional data by applying a Nonlinear transformation. However, the complexity of the transformations and data structures can create wildly different…

Repeated-measure designs allow comparisons within a group as well as between groups, and are commonly referred to as split-plot designs. While originating in agricultural experiments, they are now widely used in medical research,…

Computation · Statistics 2025-12-22 Paavo Sattler , Nils Hichert

Correlation among the observations in high-dimensional regression modeling can be a major source of confounding. We present a new open-source package, plmmr, to implement penalized linear mixed models in R. This R package estimates…

Computation · Statistics 2026-05-13 Tabitha K. Peter , Anna C. Reisetter , Yujing Lu , Oscar A. Rysavy , Patrick J. Breheny

The R package BigVAR allows for the simultaneous estimation of high-dimensional time series by applying structured penalties to the conventional vector autoregression (VAR) and vector autoregression with exogenous variables (VARX)…

Computation · Statistics 2017-02-24 William Nicholson , David Matteson , Jacob Bien

The past decade has witnessed a dramatic increase in the size and scope of biological and behavioral experiments. These experiments are providing an unprecedented level of detail and depth of data. However, this increase in data presents…

Quantitative Methods · Quantitative Biology 2014-04-03 Samuel V. Scarpino , Ross Gillette , David Crews

Generating artificial data is a crucial step when performing Monte-Carlo simulation studies. Depending on the planned study, complex data generation processes (DGP) containing multiple, possibly time-varying, variables with various forms of…

Methodology · Statistics 2025-06-03 Robin Denz , Nina Timmesfeld

In molecular biology, advances in high-throughput technologies have made it possible to study complex multivariate phenotypes and their simultaneous associations with high-dimensional genomic and other omics data, a problem that can be…

Methodology · Statistics 2021-12-02 Zhi Zhao , Marco Banterle , Leonardo Bottolo , Sylvia Richardson , Alex Lewin , Manuela Zucknick

The numerical availability of statistical inference methods for a modern and robust analysis of longitudinal- and multivariate data in factorial experiments is an essential element in research and education. While existing approaches that…

Computation · Statistics 2018-01-25 Sarah Friedrich , Frank Konietschke , Markus Pauly

BHAM is a freely avaible R pakcage that implments Bayesian hierarchical additive models for high-dimensional clinical and genomic data. The package includes functions that generalized additive model, and Cox additive model with the…

Computation · Statistics 2022-07-07 Boyi Guo , Nengjun Yi

The package High-dimensional Metrics (\Rpackage{hdm}) is an evolving collection of statistical methods for estimation and quantification of uncertainty in high-dimensional approximately sparse models. It focuses on providing confidence…

Machine Learning · Statistics 2017-09-28 Victor Chernozhukov , Chris Hansen , Martin Spindler

It is shown how to set up, conduct, and analyze large simulation studies with the new R package simsalapar = simulations simplified and launched parallel. A simulation study typically starts with determining a collection of input variables…

Computation · Statistics 2013-09-18 Marius Hofert , Martin Mächler

Recent developments in data science and big data research have produced an abundance of large data sets that are too big to be analyzed in their entirety, due to limits on either computer memory or storage capacity. Here, we introduce our R…

Applications · Statistics 2015-04-27 Alexey Miroshnikov , Evgeny Savel'ev , Erin M. Conlon

Estimating sample size and statistical power is an essential part of a good study design. This R package allows users to conduct power analysis based on Monte Carlo simulations in settings in which consideration of the correlations between…

Methodology · Statistics 2024-04-16 Phuc H. Nguyen , Stephanie M. Engel , Amy H. Herring

Due to the increasing availability of high-dimensional empirical applications in many research disciplines, valid simultaneous inference becomes more and more important. For instance, high-dimensional settings might arise in economic…

Econometrics · Economics 2018-09-14 Philipp Bach , Victor Chernozhukov , Martin Spindler

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz

Hyperspectral remote sensing is a promising tool for a variety of applications including ecology, geology, analytical chemistry and medical research. This article presents the new \hsdar package for R statistical software, which performs a…

Other Statistics · Statistics 2019-05-28 Lukas W. Lehnert , Hanna Meyer , Wolfgang A. Obermeier , Brenner Silva , Bianca Regeling , Jörg Bendix

We describe an R package named huge which provides easy-to-use functions for estimating high dimensional undirected graphs from data. This package implements recent results in the literature, including Friedman et al. (2007), Liu et al.…

Machine Learning · Statistics 2020-06-29 Tuo Zhao , Han Liu , Kathryn Roeder , John Lafferty , Larry Wasserman

Recent advances in big data and analytics research have provided a wealth of large data sets that are too big to be analyzed in their entirety, due to restrictions on computer memory or storage size. New Bayesian methods have been developed…

Applications · Statistics 2014-09-30 Alexey Miroshnikov , Erin Conlon
‹ Prev 1 2 3 10 Next ›