English
Related papers

Related papers: Protein sequence classification using natural lang…

200 papers

Composed of amino acid chains that influence how they fold and thus dictating their function and features, proteins are a class of macromolecules that play a central role in major biological processes and are required for the structure,…

Quantitative Methods · Quantitative Biology 2022-07-15 Aaron Wang

Deep neural networks (DNN) have achieved remarkable success in various fields, including computer vision and natural language processing. However, training an effective DNN model still poses challenges. This paper aims to propose a method…

Machine Learning · Computer Science 2024-07-03 Hejie Ying , Mengmeng Song , Yaohong Tang , Shungen Xiao , Zimin Xiao

Objective: We investigate whether deep learning techniques for natural language processing (NLP) can be used efficiently for patient phenotyping. Patient phenotyping is a classification task for determining whether a patient has a medical…

We present paired learning and inference algorithms for significantly reducing computation and increasing speed of the vector dot products in the classifiers that are at the heart of many NLP components. This is accomplished by partitioning…

Computation and Language · Computer Science 2015-05-25 Emma Strubell , Luke Vilnis , Kate Silverstein , Andrew McCallum

We introduce a protein language model for determining the complete sequence of a peptide based on measurement of a limited set of amino acids. To date, protein sequencing relies on mass spectrometry, with some novel edman degregation based…

Automated document classification is a trending topic in Natural Language Processing (NLP) due to the extensive growth in digital databases. However, a model that fits well for a specific classification task might perform weakly for another…

Machine Learning · Computer Science 2025-10-03 Uvini Ranaweera , Bawun Mawitagama , Sanduni Liyanage , Sandupa Keshan , Tiloka de Silva , Supun Hewawalpita

Natural Language Processing (NLP) is widely used in fields like machine translation and sentiment analysis. However, traditional NLP models struggle with accuracy and efficiency. This paper introduces Deep Convolutional Neural Networks…

Computation and Language · Computer Science 2024-12-23 Chang Weng , Scott Rood , Mehdi Ali Ramezani , Amir Aslani , Reza Zarrab , Wang Zwuo , Sanjeev Salimans , Tim Satheesh

In recent era prediction of enzyme class from an unknown protein is one of the challenging tasks in bioinformatics. Day to day the number of proteins is increases as result the prediction of enzyme class gives a new opportunity to…

Machine Learning · Computer Science 2019-01-21 Chhote Lal Prasad Gupta , Anand Bihari , Sudhakar Tripathi

Computational biology and bioinformatics provide vast data gold-mines from protein sequences, ideal for Language Models taken from NLP. These LMs reach for new prediction frontiers at low inference costs. Here, we trained two…

Data-efficient image classification is a challenging task that aims to solve image classification using small training data. Neural network-based deep learning methods are effective for image classification, but they typically require…

Neural and Evolutionary Computing · Computer Science 2022-12-05 Ying Bi , Bing Xue , Mengjie Zhang

For protein sequence datasets, unlabeled data has greatly outpaced labeled data due to the high cost of wet-lab characterization. Recent deep-learning approaches to protein prediction have shown that pre-training on unlabeled data can yield…

Machine Learning · Computer Science 2020-12-02 Pascal Sturmfels , Jesse Vig , Ali Madani , Nazneen Fatema Rajani

Representation learning and \emph{de novo} generation of proteins are pivotal computational biology tasks. Whilst natural language processing (NLP) techniques have proven highly effective for protein sequence modelling, structure modelling…

Quantitative Methods · Quantitative Biology 2025-01-08 Benoit Gaujac , Jérémie Donà , Liviu Copoiu , Timothy Atkinson , Thomas Pierrot , Thomas D. Barrett

State-of-the-art natural language processing (NLP) models are trained on massive training corpora, and report a superlative performance on evaluation datasets. This survey delves into an important attribute of these datasets: the dialect of…

Computation and Language · Computer Science 2024-12-10 Aditya Joshi , Raj Dabre , Diptesh Kanojia , Zhuang Li , Haolan Zhan , Gholamreza Haffari , Doris Dippold

Natural Language Processing (NLP) is an important branch of artificial intelligence that studies how to enable computers to understand, process, and generate human language. Text classification is a fundamental task in NLP, which aims to…

Computation and Language · Computer Science 2024-03-18 Xiaonan Xu , Zheng Xu , Zhipeng Ling , Zhengyu Jin , ShuQian Du

This article reviews recent advances in applying natural language processing (NLP) to Electronic Health Records (EHRs) for computational phenotyping. NLP-based computational phenotyping has numerous applications including diagnosis…

Computation and Language · Computer Science 2018-06-18 Zexian Zeng , Yu Deng , Xiaoyu Li , Tristan Naumann , Yuan Luo

Protein language models (pLMs) have recently gained significant attention for their ability to uncover relationships between sequence, structure, and function from evolutionary statistics, thereby accelerating therapeutic drug discovery.…

Machine Learning · Computer Science 2026-03-04 Darshan Patil , Pranshu Malviya , Mathieu Reymond , Quentin Fournier , Sarath Chandar

Accurate localization of proteins from fluorescence microscopy images is challenging due to the inter-class similarities and intra-class disparities introducing grave concerns in addressing multi-class classification problems. Conventional…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Muhammad Tahir , Saeed Anwar , Ajmal Mian , Abdul Wahab Muzaffar

Deep learning is playing a vital role in every field which involves data. It has emerged as a strong and efficient framework that can be applied to a broad spectrum of complex learning problems which were difficult to solve using…

Biomolecules · Quantitative Biology 2022-09-23 Faisal Ghaffar , Sarwar Khan , Gaddisa O. , Chen Yu-jhen

Advanced automated AI techniques allow us to classify protein sequences and discern their biological families and functions. Conventional approaches for classifying these protein families often focus on extracting N-Gram features from the…

Quantitative Methods · Quantitative Biology 2024-10-24 Bahar Ali , Anwar Shah , Malik Niaz , Musadaq Mansoord , Sami Ullah , Muhammad Adnan

The classification of DNA sequences is a key research area in bioinformatics as it enables researchers to conduct genomic analysis and detect possible diseases. In this paper, three state-of-the-art algorithms, namely Convolutional Neural…

Other Quantitative Biology · Quantitative Biology 2020-11-03 Xiangxie Zhang , Ben Beinke , Berlian Al Kindhi , Marco Wiering
‹ Prev 1 2 3 10 Next ›