English
Related papers

Related papers: Bayesian Multi-study Factor Analysis for High-thro…

200 papers

Most of previous works and applications of Bayesian factor model have assumed the normal likelihood regardless of its validity. We propose a Bayesian factor model for heavy-tailed high-dimensional data based on multivariate Student-$t$…

Methodology · Statistics 2020-12-10 Jaejoon Lee , Jaeyong Lee

We present an applied study in cancer genomics for integrating data and inferences from laboratory experiments on cancer cell lines with observational data obtained from human breast cancer studies. The biological focus is on improving…

Applications · Statistics 2010-10-07 Daniel Merl , Julia Ling-Yu Chen , Jen-Tsan Chi , Mike West

We introduce a novel class of factor analysis methodologies for the joint analysis of multiple studies. The goal is to separately identify and estimate 1) common factors shared across multiple studies, and 2) study-specific factors. We…

Applications · Statistics 2018-06-27 Roberta De Vito , Ruggero Bellio , Lorenzo Trippa , Giovanni Parmigiani

Motivation: Modelling methods that find structure in data are necessary with the current large volumes of genomic data, and there have been various efforts to find subsets of genes exhibiting consistent patterns over subsets of treatments.…

Machine Learning · Computer Science 2016-09-15 Kerstin Bunte , Eemeli Leppäaho , Inka Saarinen , Samuel Kaski

The task of clustering a set of objects based on multiple sources of data arises in several modern applications. We propose an integrative statistical model that permits a separate clustering of the objects for each data source. These…

Machine Learning · Statistics 2015-12-01 Eric F. Lock , David B. Dunson

Analyzing multiple studies allows leveraging data from a range of sources and populations, but until recently, there have been limited methodologies to approach the joint unsupervised analysis of multiple high-dimensional studies. A recent…

Methodology · Statistics 2020-07-27 Isabella N. Grabski , Roberta De Vito , Lorenzo Trippa , Giovanni Parmigiani

High-dimensional data are crucial in biomedical research. Integrating such data from multiple studies is a critical process that relies on the choice of advanced statistical models, enhancing statistical power, reproducibility, and…

Applications · Statistics 2025-06-24 Mavis Liang , Blake Hansen , Alejandra Avalos-Pacheco , Roberta De Vito

Recent advances in engineering technologies have enabled the collection of a large number of longitudinal features. This wealth of information presents unique opportunities for researchers to investigate the complex nature of diseases and…

Methodology · Statistics 2023-11-27 Zihang Lu , Noirrit Kiran Chandra

Factors models are routinely used to analyze high-dimensional data in both single-study and multi-study settings. Bayesian inference for such models relies on Markov Chain Monte Carlo (MCMC) methods which scale poorly as the number of…

Methodology · Statistics 2025-04-29 Blake Hansen , Alejandra Avalos-Pacheco , Massimiliano Russo , Roberta De Vito

Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting…

Methodology · Statistics 2019-08-29 Panagiotis Papastamoulis

In computational biology, gene expression datasets are characterized by very few individual samples compared to a large number of measurements per sample. Thus, it is appealing to merge these datasets in order to increase the number of…

Methodology · Statistics 2011-08-18 Meili Baragatti

Bi-clustering is a useful approach in analyzing biological data when observations come from heterogeneous groups and have a large number of features. We outline a general Bayesian approach in tackling bi-clustering problems in moderate to…

Applications · Statistics 2021-02-11 Han Yan , Jiexing Wu , Yang Li , Jun S. Liu

Bayesian sparse factor models have proven useful for characterizing dependence in multivariate data, but scaling computation to large numbers of samples and dimensions is problematic. We propose expandable factor analysis for scalable…

Methodology · Statistics 2018-06-21 Sanvesh Srivastava , Barbara E. Engelhardt , David B. Dunson

High-throughput scientific studies involving no clear a'priori hypothesis are common. For example, a large-scale genomic study of a disease may examine thousands of genes without hypothesizing that any specific gene is responsible for the…

Methodology · Statistics 2012-03-02 Babak Shahbaba

Variable selection is crucial in high-dimensional omics-based analyses, since it is biologically reasonable to assume only a subset of non-noisy features contributes to the data structures. However, the task is particularly hard in an…

Methodology · Statistics 2022-03-22 Emilie Eliseussen , Thomas Fleischer , Valeria Vitelli

In molecular biology, advances in high-throughput technologies have made it possible to study complex multivariate phenotypes and their simultaneous associations with high-dimensional genomic and other omics data, a problem that can be…

Methodology · Statistics 2021-12-02 Zhi Zhao , Marco Banterle , Leonardo Bottolo , Sylvia Richardson , Alex Lewin , Manuela Zucknick

Large-scale longitudinal molecular profiling is now firmly established in biomedical research, prompted by the need to uncover coordinated biomarker trajectories reflecting the dynamics of underlying biological mechanisms and characterise…

Methodology · Statistics 2026-03-24 Salima Jaoua , Daniel Temko , Hélène Ruffieux

This paper proposes a hierarchical Bayesian multitask learning model that is applicable to the general multi-task binary classification learning problem where the model assumes a shared sparsity structure across different tasks. We derive a…

Background: Many mathematical models have now been employed across every area of systems biology. These models increasingly involve large numbers of unknown parameters, have complex structure which can result in substantial evaluation time…

Molecular Networks · Quantitative Biology 2018-01-15 Ian Vernon , Junli Liu , Michael Goldstein , James Rowe , Jen Topping , Keith Lindsey

The features in high dimensional biomedical prediction problems are often well described with lower dimensional manifolds. An example is genes that are organised in smaller functional networks. The outcome can then be described with the…

‹ Prev 1 2 3 10 Next ›