Related papers: Deep Learning for Efficient GWAS Feature Selection

Multi-Disease Deep Learning Framework for GWAS: Beyond Feature Selection Constraints

Traditional GWAS has advanced our understanding of complex diseases but often misses nonlinear genetic interactions. Deep learning offers new opportunities to capture complex genomic patterns, yet existing methods mostly depend on feature…

Machine Learning · Computer Science 2025-07-08 Iqra Farooq , Sara Atito , Ayse Demirkan , Inga Prokopenko , Muhammad Rana

Training Flexible Models of Genetic Variant Effects from Functional Annotations using Accelerated Linear Algebra

To understand how genetic variants in human genomes manifest in phenotypes -- traits like height or diseases like asthma -- geneticists have sequenced and measured hundreds of thousands of individuals. Geneticists use this data to build…

Machine Learning · Computer Science 2025-07-01 Alan N. Amin , Andres Potapczynski , Andrew Gordon Wilson

Genetic heterogeneity analysis using genetic algorithm and network science

Through genome-wide association studies (GWAS), disease susceptible genetic variables can be identified by comparing the genetic data of individuals with and without a specific disease. However, the discovery of these associations poses a…

Machine Learning · Computer Science 2023-08-15 Zhendong Sha , Yuanzhu Chen , Ting Hu

Iterative Hard Thresholding for Model Selection in Genome-Wide Association Studies

A genome-wide association study (GWAS) correlates marker variation with trait variation in a sample of individuals. Each study subject is genotyped at a multitude of SNPs (single nucleotide polymorphisms) spanning the genome. Here we assume…

Machine Learning · Statistics 2019-01-14 Kevin L. Keys , Gary K. Chen , Kenneth Lange

Utilising Deep Learning and Genome Wide Association Studies for Epistatic-Driven Preterm Birth Classification in African-American Women

Genome Wide Association Studies (GWAS) are used to identify statistically significant genetic variants in case-control studies. GWAS typically use a p-value threshold of 5 x 10-8 to identify highly ranked single nucleotide polymorphisms…

Computational Engineering, Finance, and Science · Computer Science 2018-01-10 Paul Fergus , Casimiro Curbelo Montanez , Basma Abdulaimma , Paulo Lisboa , Carl Chalmers

Deep Learning Classification of Polygenic Obesity using Genome Wide Association Study SNPs

In this paper, association results from genome-wide association studies (GWAS) are combined with a deep learning framework to test the predictive capacity of statistically significant single nucleotide polymorphism (SNPs) associated with…

Computers and Society · Computer Science 2018-08-27 Casimiro Adays Curbelo Montañez , Paul Fergus , Almudena Curbelo Montañez , Carl Chalmers

Gene and Gene-Set Analysis for Genome-Wide Association Studies

Genome-wide association studies (GWAS) have identified hundreds of loci at very stringent levels of statistical significance across many different human traits. However, it is now clear that very large samples (n~10^4-10^5) are needed to…

Genomics · Quantitative Biology 2013-08-20 Inti Pedroso

A Hybrid Two-layer Feature Selection Method Using GeneticAlgorithm and Elastic Net

Feature selection, as a critical pre-processing step for machine learning, aims at determining representative predictors from a high-dimensional feature space dataset to improve the prediction accuracy. However, the increase in feature…

Machine Learning · Statistics 2020-11-16 Fatemeh Amini , Guiping Hu

Recent Advances in Network-based Methods for Disease Gene Prediction

Disease-gene association through Genome-wide association study (GWAS) is an arduous task for researchers. Investigating single nucleotide polymorphisms (SNPs) that correlate with specific diseases needs statistical analysis of associations.…

Quantitative Methods · Quantitative Biology 2020-12-21 Sezin Kircali Ata , Min Wu , Yuan Fang , Le Ou-Yang , Chee Keong Kwoh , Xiao-Li Li

A model selection approach to genome wide association studies

For the vast majority of genome wide association studies (GWAS) published so far, statistical analysis was performed by testing markers individually. In this article we present some elementary statistical considerations which clearly show…

Applications · Statistics 2010-10-04 Florian Frommlet , Felix Ruhaltinger , Piotr Twarog , Malgorzata Bogdan

Deep Feature Screening: Feature Selection for Ultra High-Dimensional Data via Deep Neural Networks

The applications of traditional statistical feature selection methods to high-dimension, low sample-size data often struggle and encounter challenging problems, such as overfitting, curse of dimensionality, computational infeasibility, and…

Machine Learning · Statistics 2023-12-19 Kexuan Li , Fangfang Wang , Lingli Yang , Ruiqi Liu

Bayesian group Lasso for nonparametric varying-coefficient models with application to functional genome-wide association studies

Although genome-wide association studies (GWAS) have proven powerful for comprehending the genetic architecture of complex traits, they are challenged by a high dimension of single-nucleotide polymorphisms (SNPs) as predictors, the presence…

Applications · Statistics 2015-09-15 Jiahan Li , Zhong Wang , Runze Li , Rongling Wu

Analysis of Extremely Obese Individuals Using Deep Learning Stacked Autoencoders and Genome-Wide Genetic Data

The aetiology of polygenic obesity is multifactorial, which indicates that life-style and environmental factors may influence multiples genes to aggravate this disorder. Several low-risk single nucleotide polymorphisms (SNPs) have been…

Genomics · Quantitative Biology 2018-08-27 Casimiro A. Curbelo Montañez , Paul Fergus , Carl Chalmers , Jade Hind

A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data

Gene expression data represents a unique challenge in predictive model building, because of the small number of samples $(n)$ compared to the huge amount of features $(p)$. This "$n<<p$" property has hampered application of deep learning…

Machine Learning · Statistics 2018-02-13 Yunchuan Kong , Tianwei Yu

Large-scale Feature Selection of Risk Genetic Factors for Alzheimer's Disease via Distributed Group Lasso Regression

Genome-wide association studies (GWAS) have achieved great success in the genetic study of Alzheimer's disease (AD). Collaborative imaging genetics studies across different research institutions show the effectiveness of detecting genetic…

Machine Learning · Computer Science 2017-04-28 Qingyang Li , Dajiang Zhu , Jie Zhang , Derrek Paul Hibar , Neda Jahanshad , Yalin Wang , Jieping Ye , Paul M. Thompson , Jie Wang

Learning the optimal scale for GWAS through hierarchical SNP aggregation

Motivation: Genome-Wide Association Studies (GWAS) seek to identify causal genomic variants associated with rare human diseases. The classical statistical approach for detecting these variants is based on univariate hypothesis testing, with…

Methodology · Statistics 2018-10-22 Florent Guinot , Marie Szafranski , Christophe Ambroise , Franck Samson

Genetic studies through the lens of gene networks

Understanding the genetic basis of complex traits is a longstanding challenge in the field of genomics. Genome-wide association studies (GWAS) have identified thousands of variant-trait associations, but most of these variants are located…

Molecular Networks · Quantitative Biology 2024-11-01 Marc Subirana-Granés , Jill Hoffman , Haoyu Zhang , Christina Akirtava , Sutanu Nandi , Kevin Fotso , Milton Pividori

FsNet: Feature Selection Network on High-dimensional Biological Data

Biological data including gene expression data are generally high-dimensional and require efficient, generalizable, and scalable machine-learning methods to discover their complex nonlinear patterns. The recent advances in machine learning…

Machine Learning · Computer Science 2020-12-21 Dinesh Singh , Héctor Climente-González , Mathis Petrovich , Eiryo Kawakami , Makoto Yamada

Deep Feature Selection using a Teacher-Student Network

High-dimensional data in many machine learning applications leads to computational and analytical complexities. Feature selection provides an effective way for solving these problems by removing irrelevant and redundant features, thus…

Machine Learning · Computer Science 2019-03-19 Ali Mirzaei , Vahid Pourahmadi , Mehran Soltani , Hamid Sheikhzadeh

Deep Learning and genetic algorithms for cosmological Bayesian inference speed-up

In this paper, we present a novel approach to accelerate the Bayesian inference process, focusing specifically on the nested sampling algorithms. Bayesian inference plays a crucial role in cosmological parameter estimation, providing a…

Instrumentation and Methods for Astrophysics · Physics 2024-10-17 Isidro Gómez-Vargas , J. Alberto Vázquez