English
Related papers

Related papers: MoleCode unlocks structural intelligence in large …

200 papers

Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding. However, their proficiency within the chemistry domain…

Computer Vision and Pattern Recognition · Computer Science 2026-02-02 Khiem Le , Zhichun Guo , Kaiwen Dong , Xiaobao Huang , Bozhao Nan , Roshni Iyer , Xiangliang Zhang , Olaf Wiest , Wei Wang , Ting Hua , Nitesh V. Chawla

We present MolLingo, a multi-agent system that emulates the reasoning process of a chemist to automate molecular design. Existing LLM-based approaches either operate as standalone generative models without access to external tools or lack…

Artificial Intelligence · Computer Science 2026-05-28 Thao Nguyen , Heng Ji

Molecular optimization is a central task in drug discovery that requires precise structural reasoning and domain knowledge. While large language models (LLMs) have shown promise in generating high-level editing intentions in natural…

Machine Learning · Computer Science 2025-10-17 Wenyu Zhu , Chengzhu Li , Xiaohe Tian , Yifan Wang , Yinjun Jia , Jianhui Wang , Bowen Gao , Ya-Qin Zhang , Wei-Ying Ma , Yanyan Lan

Recent advancements in computational chemistry have leveraged the power of trans-former-based language models, such as MoLFormer, pre-trained using a vast amount of simplified molecular-input line-entry system (SMILES) sequences, to…

Biomolecules · Quantitative Biology 2024-11-05 Tianhao Peng , Yuchen Li , Xuhong Li , Jiang Bian , Zeke Xie , Ning Sui , Shahid Mumtaz , Yanwu Xu , Linghe Kong , Haoyi Xiong

Molecular property prediction has gained significant attention due to its transformative potential in multiple scientific disciplines. Conventionally, a molecule graph can be represented either as a graph-structured data or a SMILES text.…

Machine Learning · Computer Science 2023-07-17 Chen Qian , Huayi Tang , Zhirui Yang , Hong Liang , Yong Liu

Despite their ability to understand chemical knowledge, large language models (LLMs) remain limited in their capacity to propose novel molecules with desired functions (e.g., drug-like properties). In addition, the molecules that LLMs…

A molecule's properties are fundamentally determined by its composition and structure encoded in its molecular graph. Thus, reasoning about molecular properties requires the ability to parse and understand the molecular graph. Large…

Machine Learning · Computer Science 2026-01-22 Christoph Bartmann , Johannes Schimunek , Mykyta Ielanskyi , Philipp Seidl , Günter Klambauer , Sohvi Luukkonen

Recently, large language models (LLMs) have shown significant progress, approaching human perception levels. In this work, we demonstrate that despite these advances, LLMs still struggle to reason using molecular structural information.…

Machine Learning · Computer Science 2025-05-26 Yunhui Jang , Jaehyung Kim , Sungsoo Ahn

Large Language Models (LLMs) are central to the one-for-all intelligent paradigm, but they face a fundamental challenge when dealing with heterogeneous scientific data such as molecules: the inherent gap between discrete linguistic symbols…

Artificial Intelligence · Computer Science 2026-05-22 Yuxuan Chen , Changwei Lv , Yunduo Xiao , Zhongjing Du , Daquan Zhou , Yukun Yan , Zheni Zeng , Zhiyuan Liu

Large Language Models (LLMs) have shown impressive performance across various domains, but their ability to perform molecular reasoning remains underexplored. Existing methods mostly rely on general-purpose prompting, which lacks…

Despite deep learning's success in chemistry, its impact is hindered by a lack of interpretability and an inability to resolve activity cliffs, where minor structural nuances trigger drastic property shifts. Current representation learning,…

Machine Learning · Computer Science 2026-03-26 Xiangsen Chen , Ruilong Wu , Yanyan Lan , Ting Ma , Yang Liu

Understanding and continuously refining multimodal molecular knowledge is crucial for advancing biomedicine, chemistry, and materials science. Molecule language models (MoLMs) have become powerful tools in these domains, integrating…

Machine Learning · Computer Science 2025-12-01 Zhenyu Lei , Patrick Soga , Yaochen Zhu , Yinhan He , Yushun Dong , Jundong Li

Large language models (LLMs) are increasingly recognized as powerful tools for scientific discovery, particularly in molecular science. A fundamental requirement for these models is the ability to accurately understand molecular structures,…

Machine Learning · Computer Science 2025-05-23 Yunhui Jang , Jaehyung Kim , Sungsoo Ahn

Molecule and text representation learning has gained increasing interest due to its potential for enhancing the understanding of chemical information. However, existing models often struggle to capture subtle differences between molecules…

Machine Learning · Computer Science 2025-10-31 Hyuntae Park , Yeachan Kim , SangKeun Lee

Recent advances in molecular science have been propelled significantly by large language models (LLMs). However, their effectiveness is limited when relying solely on molecular sequences, which fail to capture the complex structures of…

Quantitative Methods · Quantitative Biology 2025-08-12 Jianting Tang , Yubo Wang , Haoyu Cao , Linli Xu

Molecule discovery is a pivotal research field, impacting everything from medicine to materials. Recently, Large Language Models (LLMs) have been widely adopted in molecular understanding and generation, serving as a bridge between the…

Computation and Language · Computer Science 2026-04-29 Jiatong Li , Yunqing Liu , Wei Liu , Jingdi Le , Di Zhang , Wenqi Fan , Dongzhan Zhou , Yuqiang Li , Qing Li

Large language models (LLMs) have demonstrated broad utility across molecular domains, spanning drug discovery and materials design. Analyzing LLMs' latent representations is crucial for elucidating their underlying mechanisms, improving…

Machine Learning · Computer Science 2026-02-03 Zhuoran Li , Xu Sun , Wanyu Lin , Jiannong Cao

While Large Language Models (LLMs) have shown exceptional generalization capabilities, their ability to process graph data, such as molecular structures, remains limited. To bridge this gap, this paper proposes Graph2Token, an efficient…

Machine Learning · Computer Science 2025-03-11 Runze Wang , Mingqi Yang , Yanming Shen

Recent advances in large language models (LLMs) have led to models that tackle diverse molecular tasks, such as chemical reaction prediction and molecular property prediction. Large-scale molecular instruction-tuning datasets have enabled…

Machine Learning · Computer Science 2025-05-27 Chanhui Lee , Hanbum Ko , Yuheon Song , YongJun Jeong , Rodrigo Hormazabal , Sehui Han , Kyunghoon Bae , Sungbin Lim , Sungwoong Kim

Large language models (LLMs) hold considerable potential for advancing scientific discovery, yet systematic assessment of their dynamic reasoning in real-world research remains limited. Current scientific evaluation benchmarks predominantly…

Computation and Language · Computer Science 2026-03-27 Taolin Han , Shuang Wu , Jinghang Wang , Yuhao Zhou , Renquan Lv , Bing Zhao , Wei Hu
‹ Prev 1 2 3 10 Next ›