English
Related papers

Related papers: Protein Representation Learning by Capturing Prote…

200 papers

Inferring the structural properties of a protein from its amino acid sequence is a challenging yet important problem in biology. Structures are not known for the vast majority of protein sequences, but structure is critical for…

Machine Learning · Computer Science 2019-10-17 Tristan Bepler , Bonnie Berger

Learning effective protein representations is critical in a variety of tasks in biology such as predicting protein functions. Recent sequence representation learning methods based on Protein Language Models (PLMs) excel in sequence-based…

Quantitative Methods · Quantitative Biology 2023-10-19 Zuobai Zhang , Chuanrui Wang , Minghao Xu , Vijil Chenthamarakshan , Aurélie Lozano , Payel Das , Jian Tang

Protein representation learning is critical for numerous biological tasks. Recently, large transformer-based protein language models (pLMs) pretrained on large scale protein sequences have demonstrated significant success in sequence-based…

Machine Learning · Computer Science 2025-08-12 Xuefeng Liu , Songhao Jiang , Chih-chan Tien , Jinbo Xu , Rick Stevens

Learning effective protein representations is critical in a variety of tasks in biology such as predicting protein function or structure. Existing approaches usually pretrain protein language models on a large number of unlabeled amino acid…

Machine Learning · Computer Science 2023-01-31 Zuobai Zhang , Minghao Xu , Arian Jamasb , Vijil Chenthamarakshan , Aurelie Lozano , Payel Das , Jian Tang

Proteins are fundamental biological entities that play a key role in life activities. The amino acid sequences of proteins can be folded into stable 3D structures in the real physicochemical world, forming a special kind of…

Machine Learning · Computer Science 2023-01-04 Lirong Wu , Yufei Huang , Haitao Lin , Stan Z. Li

Understanding protein sequences is vital and urgent for biology, healthcare, and medicine. Labeling approaches are expensive yet time-consuming, while the amount of unlabeled data is increasing quite faster than that of the labeled data due…

Computation and Language · Computer Science 2021-11-01 Liang He , Shizhuo Zhang , Lijun Wu , Huanhuan Xia , Fusong Ju , He Zhang , Siyuan Liu , Yingce Xia , Jianwei Zhu , Pan Deng , Bin Shao , Tao Qin , Tie-Yan Liu

Deep learning has become a crucial tool in studying proteins. While the significance of modeling protein structure has been discussed extensively in the literature, amino acid types are typically included in the input as a default operation…

Quantitative Methods · Quantitative Biology 2024-07-01 Yang Tan , Lirong Zheng , Bozitao Zhong , Liang Hong , Bingxin Zhou

Proteins are complex biomolecules that play a central role in various biological processes, making them critical targets for breakthroughs in molecular biology, medical research, and drug discovery. Deciphering their intricate, hierarchical…

Machine Learning · Computer Science 2025-05-09 Viet Thanh Duy Nguyen , Truong-Son Hy

Proteins, essential to biological systems, perform functions intricately linked to their three-dimensional structures. Understanding the relationship between protein structures and their amino acid sequences remains a core challenge in…

Quantitative Methods · Quantitative Biology 2024-11-04 Liang He , Peiran Jin , Yaosen Min , Shufang Xie , Lijun Wu , Tao Qin , Xiaozhuan Liang , Kaiyuan Gao , Yuliang Jiang , Tie-Yan Liu

Protein representation learning is a challenging task that aims to capture the structure and function of proteins from their amino acid sequences. Previous methods largely ignored the fact that not all amino acids are equally important for…

Machine Learning · Computer Science 2024-04-02 Ruijie Quan , Wenguan Wang , Fan Ma , Hehe Fan , Yi Yang

Protein representation learning has primarily benefited from the remarkable development of language models (LMs). Accordingly, pre-trained protein models also suffer from a problem in LMs: a lack of factual knowledge. The recent solution…

Machine Learning · Computer Science 2023-02-16 Hong-Yu Zhou , Yunxiang Fu , Zhicheng Zhang , Cheng Bian , Yizhou Yu

Current protein language models (PLMs) learn protein representations mainly based on their sequences, thereby well capturing co-evolutionary information, but they are unable to explicitly acquire protein functions, which is the end goal of…

Biomolecules · Quantitative Biology 2023-07-06 Minghao Xu , Xinyu Yuan , Santiago Miret , Jian Tang

Protein language models have excelled in a variety of tasks, ranging from structure prediction to protein engineering. However, proteins are highly diverse in functions and structures, and current state-of-the-art models including the…

Biomolecules · Quantitative Biology 2023-02-27 Chang Ma , Haiteng Zhao , Lin Zheng , Jiayi Xin , Qintong Li , Lijun Wu , Zhihong Deng , Yang Lu , Qi Liu , Lingpeng Kong

In sequence-based predictions, conventionally an input sequence is represented by a multiple sequence alignment (MSA) or a representation derived from MSA, such as a position-specific scoring matrix. Recently, inspired by the development in…

Quantitative Methods · Quantitative Biology 2021-10-18 Nabil Ibtehaz , Daisuke Kihara

Amino acid sequence portrays most intrinsic form of a protein and expresses primary structure of protein. The order of amino acids in a sequence enables a protein to acquire a particular stable conformation that is responsible for the…

Machine Learning · Computer Science 2022-08-29 Ashish Ranjan , Md Shah Fahad , David Fernandez-Baca , Akshay Deepak , Sudhakar Tripathi

Protein language models often take into consideration the alignment between a protein sequence and its textual description. However, they do not take structural information into consideration. Traditional methods treat sequence and…

Machine Learning · Computer Science 2026-03-10 Aditya Ranganath , Hasin Us Sami , Kowshik Thopalli , Bhavya Kailkhura , Wesam Sakla

Protein function is inherently linked to its localization within the cell, and fluorescent microscopy data is an indispensable resource for learning representations of proteins. Despite major developments in molecular representation…

Quantitative Methods · Quantitative Biology 2022-05-25 Anastasia Razdaibiedina , Alexander Brechalov

Protein representation learning plays a crucial role in understanding the structure and function of proteins, which are essential biomolecules involved in various biological processes. In recent years, deep learning has emerged as a…

Biomolecules · Quantitative Biology 2024-03-11 Bozhen Hu , Cheng Tan , Lirong Wu , Jiangbin Zheng , Jun Xia , Zhangyang Gao , Zicheng Liu , Fandi Wu , Guijun Zhang , Stan Z. Li

In recent years, there has been a surge in the development of 3D structure-based pre-trained protein models, representing a significant advancement over pre-trained protein language models in various downstream tasks. However, most existing…

Machine Learning · Computer Science 2024-06-04 Jiale Zhao , Wanru Zhuang , Jia Song , Yaqi Li , Shuqi Lu

Learning from 3D protein structures has gained wide interest in protein modeling and structural bioinformatics. Unfortunately, the number of available structures is orders of magnitude lower than the training data sizes commonly used in…

Biomolecules · Quantitative Biology 2022-06-01 Pedro Hermosilla , Timo Ropinski
‹ Prev 1 2 3 10 Next ›