Muhammad Muneeb — Scifaro

AnnotateMissense: a genome-wide annotation and benchmarking framework for missense pathogenicity prediction

Missense variant interpretation remains challenging because pathogenicity depends on heterogeneous evidence from population frequency, evolutionary conservation, transcript context, amino acid substitution severity, prior pathogenicity…

Genomics · Quantitative Biology 2026-05-26 Muhammad Muneeb , David B. Ascher

EFGPP: Exploratory framework for genotype-phenotype prediction

Predicting complex human traits from genetic data is challenging because different genetic, clinical, and molecular data sources often contain different parts of the signal. Here, we present EFGPP, a reproducible framework for generating,…

Genomics · Quantitative Biology 2026-05-06 Muhammad Muneeb , David B. Ascher

PhenotypeToGeneDownloaderR: automated multi-source retrieval and validation of phenotype-associated genes

Identifying phenotype-associated genes is a common first step in polygenic risk score construction, enrichment testing, target prioritisation and variant interpretation, but relevant evidence is distributed across heterogeneous databases…

Genomics · Quantitative Biology 2026-05-05 Muhammad Muneeb , David B. Ascher

Benchmarking end-to-end genotype-to-phenotype prediction workflows across 80 openSNP phenotypes

Genotype-to-phenotype prediction is a central goal of statistical genetics, yet practical comparisons of prediction workflows remain limited in small, heterogeneous, participant-shared genomic datasets. Here, we benchmarked end-to-end…

Genomics · Quantitative Biology 2026-05-05 Muhammad Muneeb , David B. Ascher , YooChan Myung , Samuel F. Feng , Andreas Henschel

Benchmarking Heritability Estimation Strategies Across 86 Configurations and Their Downstream Effect on Polygenic Risk Score Performance

Objective: SNP heritability estimates vary substantially across estimation strategies, yet the downstream consequences for polygenic risk score (PRS) construction remain poorly characterised. We systematically benchmarked heritability…

Genomics · Quantitative Biology 2026-04-06 Muhammad Muneeb , David B. Ascher

A harmonized benchmarking framework for implementation-aware evaluation of 46 polygenic risk score tools across binary and continuous phenotypes

Polygenic risk score (PRS) tools differ substantially in statistical assumptions, input requirements, and implementation complexity, making direct comparison difficult. We developed a harmonized, implementation-aware benchmarking framework…

Genomics · Quantitative Biology 2026-03-24 Muhammad Muneeb , David B. Ascher

G2DR: A Genotype-First Framework for Genetics-Informed Target Prioritization and Drug Repurposing

Human genetics offers a promising route to therapeutic discovery, yet practical frameworks translating genotype-derived signal into ranked target and drug hypotheses remain limited, particularly when matched disease transcriptomics are…

Genomics · Quantitative Biology 2026-03-24 Muhammad Muneeb , David B. Ascher

Identifying genes associated with phenotypes using machine and deep learning

Identifying disease-associated genes enables the development of precision medicine and the understanding of biological processes. Genome-wide association studies (GWAS), gene expression data, biological pathway analysis, and protein network…

Genomics · Quantitative Biology 2026-03-10 Muhammad Muneeb , David B. Ascher , YooChan Myung

GWAS Summary Statistic Tool: A Meta-Analysis and Parsing Tool for Polygenic Risk Score Calculation

Motivation: GWAS (genome-wide association study) summary statistic files are essential inputs for polygenic risk score (PRS) calculation. However, identifying suitable files across thousands of catalog entries typically requires downloading…

Quantitative Methods · Quantitative Biology 2026-03-10 Muhammad Muneeb , David B. Ascher

An Empirical Analysis of Fine-Tuning Large Language Models on Bioinformatics Literature: PRSGPT and BioStarsGPT

Large language models (LLMs) often lack specialized knowledge for complex bioinformatics applications. We present a reproducible pipeline for fine-tuning LLMs on specialized bioinformatics data, demonstrated through two use cases: PRSGPT,…

Computation and Language · Computer Science 2026-01-21 Muhammad Muneeb , David B. Ascher

Comparative Analysis of 47 Context-Based Question Answer Models Across 8 Diverse Datasets

Context-based question answering (CBQA) models provide more accurate and relevant answers by considering the contextual information. They effectively extract specific information given a context, making them functional in various…

Computation and Language · Computer Science 2025-12-02 Muhammad Muneeb , David B. Ascher , Ahsan Baidar Bakht

Deep learning pipeline for image classification on mobile phones

This article proposes and documents a machine-learning framework and tutorial for classifying images using mobile phones. Compared to computers, the performance of deep learning model performance degrades when deployed on a mobile phone and…

Image and Video Processing · Electrical Eng. & Systems 2022-06-02 Muhammad Muneeb , Samuel F. Feng , Andreas Henschel

Integration of Single Photon Emitters in 2D Layered Materials with a Silicon Nitride Photonic Chip

Photonic integrated circuits (PICs) enable miniaturization of optical quantum circuits because several optic and electronic functionalities can be added on the same chip. Single photon emitters (SPEs) are central building blocks for such…

Optics · Physics 2019-10-09 Frédéric Peyskens , Chitraleema Chakraborty , Muhammad Muneeb , Dries Van Thourhout , Dirk Englund