Related papers: A successive sub-grouping method for multiple sequ…

Covariance in protein multiple sequence alignments using groups of columns

Algorithms that detect covariance between pairs of columns in multiple sequence alignments are commonly employed to predict functionally important residues and structural contacts. However, the assumption that co-variance only occurs…

Quantitative Methods · Quantitative Biology 2014-01-07 Kyle E. Kreth , Anthony A. Fodor

On the Natural Structure of Amino Acid Patterns in Families of Protein Sequences

All known terrestrial proteins are coded as continuous strings of ~20 amino acids. The patterns formed by the repetitions of elements in groups of finite sequences describes the natural architectures of protein families. We present a method…

Biomolecules · Quantitative Biology 2018-07-30 Pablo Turjanski , Diego U. Ferreiro

Microarrays for Amino Acid Analysis and Protein Sequencing

A method is described where the aminoacyl-tRNA synthetase system is used to create very small devices for quantitative analysis of the amino acids that occur in proteins. The basis of the method is that each of the 20 synthetases and/or a…

Biological Physics · Physics 2007-05-23 Edward Shipwash

Local sequence-structure relationships in proteins

We seek to understand the interplay between amino acid sequence and local structure in proteins. Are some amino acids unique in their ability to fit harmoniously into certain local structures? What is the role of sequence in sculpting the…

Biomolecules · Quantitative Biology 2021-01-29 Tatjana Škrbić , Amos Maritan , Achille Giacometti , Jayanth R. Banavar

A statistical physics perspective on alignment-independent protein sequence comparison

Within bioinformatics, the textual alignment of amino acid sequences has long dominated the determination of similarity between proteins, with all that implies for shared structure, function and evolutionary descent. Despite the relative…

Quantitative Methods · Quantitative Biology 2016-02-10 Amit K Chattopadhyay , Diar Nasiev , Darren R Flower

Design of Sequences with Good Folding Properties in Coarse-Grained Protein Models

Background: Designing amino acid sequences that are stable in a given target structure amounts to maximizing a conditional probability. A straightforward approach to accomplish this is a nested Monte Carlo where the conformation space is…

Soft Condensed Matter · Physics 2016-08-31 Anders Irbäck , Carsten Peterson , Frank Potthast , Erik Sandelin

Binary classification of proteins by a Machine Learning approach

In this work we present a system based on a Deep Learning approach, by using a Convolutional Neural Network, capable of classifying protein chains of amino acids based on the protein description contained in the Protein Data Bank. Each…

Machine Learning · Computer Science 2021-11-04 Damiano Perri , Marco Simonetti , Andrea Lombardi , Noelia Faginas-Lago , Osvaldo Gervasi

Aligning Multiple Protein Structures using Biochemical and Biophysical Properties

Aligning multiple protein structures can yield valuable information about structural similarities among related proteins, as well as provide insight into evolutionary relationships between proteins in a family. We have developed an…

Biomolecules · Quantitative Biology 2019-11-07 Paul Shealy , Homayoun Valafar

The Protein Family Classification in Protein Databases via Entropy Measures

In the present work, we review the fundamental methods which have been developed in the last few years for classifying into families and clans the distribution of amino acids in protein databases. This is done through functions of random…

Biomolecules · Quantitative Biology 2018-06-15 R. P. Mondaini , S. C. de Albuquerque Neto

piSAAC: Extended notion of SAAC feature selection novel method for discrimination of Enzymes model using different machine learning algorithm

Enzymes and proteins are live driven biochemicals, which has a dramatic impact over the environment, in which it is active. So, therefore, it is highly looked-for to build such a robust and highly accurate automatic and computational model…

Biomolecules · Quantitative Biology 2021-01-11 Zaheer Ullah Khan , Dechang Pi , Izhar Ahmed Khan , Asif Nawaz , Jamil Ahmad , Mushtaq Hussain

Bayesian Protein Sequence and Structure Alignment

The structure of a protein is crucial in determining its functionality, and is much more conserved than sequence during evolution. A key task in structural biology is to compare protein structures in order to determine evolutionary…

Methodology · Statistics 2019-11-06 Christopher Fallaize , Peter Green , Kanti Mardia , Stuart Barber

Amino acid substitution matrices for protein conformation identification

Methods for alignment of protein sequences typically measure similarity by using substitution matrix with scores for all possible exchanges of one amino acid with another. Although widely used, the matrices derived from homologous sequence…

Biomolecules · Quantitative Biology 2007-05-23 Xin Liu , Wei-Mou Zheng

Algorithms for normalized multiple sequence alignments

Sequence alignment supports numerous tasks in bioinformatics, natural language processing, pattern recognition, social sciences, and others fields. While the alignment of two sequences may be performed swiftly in many applications, the…

Data Structures and Algorithms · Computer Science 2021-12-06 Eloi Araujo , Luiz Rozante , Diego P. Rubert , Fabio V. Martinez

Pairing interacting protein sequences using masked language modeling

Predicting which proteins interact together from amino-acid sequences is an important task. We develop a method to pair interacting protein sequences which leverages the power of protein language models trained on multiple sequence…

Biomolecules · Quantitative Biology 2024-12-30 Umberto Lupo , Damiano Sgarbossa , Anne-Florence Bitbol

Fourier-based classification of protein secondary structures

The correct prediction of protein secondary structures is one of the key issues in predicting the correct protein folded shape, which is used for determining gene function. Existing methods make use of amino acids properties as indices to…

Quantitative Methods · Quantitative Biology 2017-05-01 Jian-Jun Shu , Kian-Yan Yong

Single-Sequence-Based Protein Secondary Structure Prediction using One-Hot and Chemical Encodings of Amino Acids

In protein secondary structure prediction, each amino acid in sequence is typically treated as a distinct category and represented by a one-hot vector. In this study, we developed two novel chemical representations for amino acids utilizing…

Biomolecules · Quantitative Biology 2024-07-09 Hoa Trinh , Satish Kumar Thittamaranahalli

Deep Robust Framework for Protein Function Prediction using Variable-Length Protein Sequences

Amino acid sequence portrays most intrinsic form of a protein and expresses primary structure of protein. The order of amino acids in a sequence enables a protein to acquire a particular stable conformation that is responsible for the…

Machine Learning · Computer Science 2022-08-29 Ashish Ranjan , Md Shah Fahad , David Fernandez-Baca , Akshay Deepak , Sudhakar Tripathi

Alignment Metric Accuracy

We propose a metric for the space of multiple sequence alignments that can be used to compare two alignments to each other. In the case where one of the alignments is a reference alignment, the resulting accuracy measure improves upon…

Quantitative Methods · Quantitative Biology 2011-11-09 Ariel S. Schwartz , Eugene W. Myers , Lior Pachter

Combination of digital signal processing and assembled predictive models facilitates the rational design of proteins

Predicting the effect of mutations in proteins is one of the most critical challenges in protein engineering; by knowing the effect a substitution of one (or several) residues in the protein's sequence has on its overall properties, could…

Computational Engineering, Finance, and Science · Computer Science 2020-10-08 David Medina-Ortiz , Sebastian Contreras , Juan Amado-Hinojosa , Jorge Torres-Almonacid , Juan A. Asenjo , Marcelo Navarrete , Álvaro Olivera-Nappa

Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple Sequence Alignment

Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique in bioinformatics used to infer related residues among biological sequences. Thus alignment accuracy is crucial to a vast range of analyses, often in ways difficult…

Quantitative Methods · Quantitative Biology 2015-01-09 Stefano Iantorno , Kevin Gori , Nick Goldman , Manuel Gil , Christophe Dessimoz