Related papers: Celer: an Efficient Program for Genotype Eliminati…

Improving sequence-based genotype calls with linkage disequilibrium and pedigree information

Whole and targeted sequencing of human genomes is a promising, increasingly feasible tool for discovering genetic contributions to risk of complex diseases. A key step is calling an individual's genotype from the multiple aligned short read…

Applications · Statistics 2012-06-29 Baiyu Zhou , Alice S. Whittemore

Haplotype Inference for Pedigrees with Few Recombinations

Pedigrees, or family trees, are graphs of family relationships that are used to study inheritance. A fundamental problem in computational biology is to find, for a pedigree with $n$ individuals genotyped at every site, a set of…

Data Structures and Algorithms · Computer Science 2016-02-16 Bonnie Kirkpatrick

Synthesis of Parametric Programs using Genetic Programming and Model Checking

Formal methods apply algorithms based on mathematical principles to enhance the reliability of systems. It would only be natural to try to progress from verification, model checking or testing a system against its formal specification into…

Software Engineering · Computer Science 2014-02-28 Gal Katz , Doron Peled

mendelFix: a Perl script for checking Mendelian errors in high density SNP data of trio designs

Here we present mendelFix, a Perl script for checking Mendelian errors in genome-wide SNP data of trio designs. The program takes 12-recoded PLINK PED and MAP files as input to calculate a series of summary statistics for Mendelian errors,…

Genomics · Quantitative Biology 2013-06-11 Yuri Tani Utsunomiya , Rodrigo Vitorio Alonso , Adriana Santana do Carmo , Francine Campagnari , José Antonio Vinsintin , José Fernando Garcia

Inferring genotyping error rates from genotyped trios

Genotyping errors are known to influence the power of both family-based and case-control studies in the genetics of complex disease. Estimating genotyping error rate in a given dataset can be complex, but when family information is…

Quantitative Methods · Quantitative Biology 2011-09-08 Luke Jostins

Efficient Reconstruction of Stochastic Pedigrees: Some Steps From Theory to Practice

In an extant population, how much information do extant individuals provide on the pedigree of their ancestors? Recent work by Kim, Mossel, Ramnarayan and Turner (2020) studied this question under a number of simplifying assumptions,…

Populations and Evolution · Quantitative Biology 2022-11-29 Elchanan Mossel , David Vulakh

Fast Genome-Wide QTL Analysis Using Mendel

Pedigree GWAS (Option 29) in the current version of the Mendel software is an optimized subroutine for performing large scale genome-wide QTL analysis. This analysis (a) works for random sample data, pedigree data, or a mix of both, (b) is…

Applications · Statistics 2014-08-01 Hua Zhou , Jin Zhou , Tao Hu , Eric M Sobel , Kenneth Lange

Efficient Reconstruction of Stochastic Pedigrees

We introduce a new algorithm called {\sc Rec-Gen} for reconstructing the genealogy or \textit{pedigree} of an extant population purely from its genetic data. We justify our approach by giving a mathematical proof of the effectiveness of…

Data Structures and Algorithms · Computer Science 2020-05-11 Younhun Kim , Elchanan Mossel , Govind Ramnarayan , Paxton Turner

An Efficient Genetic Algorithm for Discovering Diverse-Frequent Patterns

Working with exhaustive search on large dataset is infeasible for several reasons. Recently, developed techniques that made pattern set mining feasible by a general solver with long execution time that supports heuristic search and are…

Artificial Intelligence · Computer Science 2015-07-21 Shanjida Khatun , Hasib Ul Alam , Swakkhar Shatabda

Reconciling Multiple Genes Trees via Segmental Duplications and Losses

Reconciling gene trees with a species tree is a fundamental problem to understand the evolution of gene families. Many existing approaches reconcile each gene tree independently. However, it is well-known that the evolution of gene families…

Populations and Evolution · Quantitative Biology 2018-06-12 Riccardo Dondi , Manuel Lafond , Celine Scornavacca

From RNA sequencing measurements to the final results: a practical guide to navigating the choices and uncertainties of gene set analysis

Gene set analysis, a popular approach for analyzing high-throughput gene expression data, aims to identify sets of related genes that show significantly enriched or depleted expression patterns between different conditions. In the last…

Applications · Statistics 2026-01-08 Milena Wünsch , Christina Sauer , Patrick Callahan , Ludwig Christian Hinske , Anne-Laure Boulesteix

GEDI: Scalable Algorithms for Genotype Error Detection and Imputation

Genome-wide association studies generate very large datasets that require scalable analysis algorithms. In this report we describe the GEDI software package, which implements efficient algorithms for performing several common tasks in the…

Data Structures and Algorithms · Computer Science 2016-09-08 Justin Kennedy , Ion I. Mandoiu , Bogdan Pasaniuc

Prediction of Hereditary Cancers Using Neural Networks

Family history is a major risk factor for many types of cancer. Mendelian risk prediction models translate family histories into cancer risk predictions based on knowledge of cancer susceptibility genes. These models are widely used in…

Machine Learning · Statistics 2021-06-28 Zoe Guan , Giovanni Parmigiani , Danielle Braun , Lorenzo Trippa

Simple robust genomic prediction and outlier detection for a multi-environmental field trial

The aim of plant breeding trials is often to identify germplasms that are well adapt to target environments. These germplasms are identified through genomic prediction from the analysis of multi-environmental field trial (MET) using linear…

Quantitative Methods · Quantitative Biology 2018-07-20 Emi Tanaka

Fast Algorithms for Reconciliation under Hybridization and Incomplete Lineage Sorting

Reconciling a gene tree with a species tree is an important task that reveals much about the evolution of genes, genomes, and species, as well as about the molecular function of genes. A wide array of computational tools have been devised…

Populations and Evolution · Quantitative Biology 2012-12-11 Yun Yu , Luay Nakhleh

Deep Neural Network: An Efficient and Optimized Machine Learning Paradigm for Reducing Genome Sequencing Error

Genomic data I used in many fields but, it has become known that most of the platforms used in the sequencing process produce significant errors. This means that the analysis and inferences generated from these data may have some errors…

Genomics · Quantitative Biology 2024-09-05 Ferdinand Kartriku , Robert Sowah , Charles Saah

mlscorecheck: Testing the consistency of reported performance scores and experiments in machine learning

Addressing the reproducibility crisis in artificial intelligence through the validation of reported experimental results is a challenging task. It necessitates either the reimplementation of techniques or a meticulous assessment of papers…

Machine Learning · Computer Science 2023-11-14 György Kovács , Attila Fazekas

The genetic code Via Godel encoding

The genetic code structure into distinct multiplet-classes as well as the numeric degeneracies of the latter are revealed by a two-step process. First, an empirical inventory of the degeneracies (of the shuffled multiplets) in two specific…

Other Quantitative Biology · Quantitative Biology 2009-11-13 Tidjani Negadi

Computational aspects of DNA mixture analysis

Statistical analysis of DNA mixtures is known to pose computational challenges due to the enormous state space of possible DNA profiles. We propose a Bayesian network representation for genotypes, allowing computations to be performed…

Methodology · Statistics 2014-02-21 Therese Graversen , Steffen Lauritzen

Learning Expressive Linkage Rules using Genetic Programming

A central problem in data integration and data cleansing is to find entities in different data sources that describe the same real-world object. Many existing methods for identifying such entities rely on explicit linkage rules which…

Databases · Computer Science 2012-08-02 Robert Isele , Christian Bizer