English
Related papers

Related papers: Efficient Strategy for Improving Large Language Mo…

200 papers

The rapid growth of Large Language Models (LLMs) has been a driving force in transforming various domains, reshaping the artificial general intelligence landscape. However, the increasing computational and memory demands of these models…

Computation and Language · Computer Science 2024-04-22 Tianyu Ding , Tianyi Chen , Haidong Zhu , Jiachen Jiang , Yiqi Zhong , Jinxin Zhou , Guangzhi Wang , Zhihui Zhu , Ilya Zharkov , Luming Liang

Large Language Models (LLMs) have achieved remarkable success across a wide range of natural language tasks, and recent efforts have sought to extend their capabilities to multimodal domains and resource-constrained environments. However,…

Machine Learning · Computer Science 2025-05-26 Yun-Da Tsai

In recent years, large language models (LLMs) have achieved remarkable success in natural language processing (NLP). LLMs require an extreme amount of parameters to attain high performance. As models grow into the trillion-parameter range,…

Computation and Language · Computer Science 2024-09-10 Zhyar Rzgar K Rostam , Sándor Szénási , Gábor Kertész

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Large Language Models (LLMs) have emerged as powerful tools in various research domains. This article examines their potential through a literature review and firsthand experimentation. While LLMs offer benefits like cost-effectiveness and…

Human-Computer Interaction · Computer Science 2024-04-10 M. Namvarpour , A. Razi

Large Language Models (LLMs) have become extremely potent instruments with exceptional capacities for comprehending and producing human-like text in a wide range of applications. However, the increasing size and complexity of LLMs present…

Machine Learning · Computer Science 2024-06-18 Yingbing Huang , Lily Jiaxin Wan , Hanchen Ye , Manvi Jha , Jinghua Wang , Yuhong Li , Xiaofan Zhang , Deming Chen

The planning ability of Large Language Models (LLMs) has garnered increasing attention in recent years due to their remarkable capacity for multi-step reasoning and their ability to generalize across a wide range of domains. While some…

Artificial Intelligence · Computer Science 2025-02-19 Mohamed Aghzal , Erion Plaku , Gregory J. Stein , Ziyu Yao

Planning represents a fundamental capability of intelligent agents, requiring comprehensive environmental understanding, rigorous logical reasoning, and effective sequential decision-making. While Large Language Models (LLMs) have…

Artificial Intelligence · Computer Science 2025-05-27 Pengfei Cao , Tianyi Men , Wencan Liu , Jingwen Zhang , Xuzhao Li , Xixun Lin , Dianbo Sui , Yanan Cao , Kang Liu , Jun Zhao

The exponential growth of text-based data in domains such as healthcare, education, and social sciences has outpaced the capacity of traditional qualitative analysis methods, which are time-intensive and prone to subjectivity. Large…

Post-training of Large Language Models (LLMs) is crucial for unlocking their task generalization potential and domain-specific capabilities. However, the current LLM post-training paradigm faces significant data challenges, including the…

Computation and Language · Computer Science 2025-10-31 Junyu Luo , Bohan Wu , Xiao Luo , Zhiping Xiao , Yiqiao Jin , Rong-Cheng Tu , Nan Yin , Yifan Wang , Jingyang Yuan , Wei Ju , Ming Zhang

Building effective machine learning (ML) workflows to address complex tasks is a primary focus of the Automatic ML (AutoML) community and a critical step toward achieving artificial general intelligence (AGI). Recently, the integration of…

Machine Learning · Computer Science 2024-12-30 Yang Gu , Hengyu You , Jian Cao , Muran Yu , Haoran Fan , Shiyou Qian

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works…

Computation and Language · Computer Science 2024-10-18 Humza Naveed , Asad Ullah Khan , Shi Qiu , Muhammad Saqib , Saeed Anwar , Muhammad Usman , Naveed Akhtar , Nick Barnes , Ajmal Mian

Fine-tuning large language models (LLMs) with limited data poses a practical challenge in low-resource languages, specialized domains, and constrained deployment settings. While pre-trained LLMs provide strong foundations, effective…

Computation and Language · Computer Science 2025-10-29 Marton Szep , Daniel Rueckert , Rüdiger von Eisenhart-Rothe , Florian Hinterwimmer

Large language models (LLM) have revolutionized the processing of natural language. Although first benchmarks of the process modeling abilities of LLM are promising, it is currently under debate to what extent an LLM can generate good…

Computation and Language · Computer Science 2025-03-19 Peter Fettke , Constantin Houy

Large Language Models (LLMs) possess substantial reasoning capabilities and are increasingly applied to optimization tasks, particularly in synergy with evolutionary computation. However, while recent surveys have explored specific aspects…

Neural and Evolutionary Computing · Computer Science 2026-01-08 Yisong Zhang , Ran Cheng , Guoxing Yi , Kay Chen Tan

Classical and natural language planning tasks remain a difficult domain for modern large language models (LLMs). In this work, we lay the foundations for improving planning capabilities of LLMs. First, we construct a comprehensive benchmark…

Computation and Language · Computer Science 2024-11-05 Bernd Bohnet , Azade Nova , Aaron T Parisi , Kevin Swersky , Katayoon Goshvadi , Hanjun Dai , Dale Schuurmans , Noah Fiedel , Hanie Sedghi

Large Language Models (LLMs) have garnered significant attention due to their remarkable ability to process information across various languages. Despite their capabilities, they exhibit inconsistencies in handling identical queries in…

Computation and Language · Computer Science 2024-06-24 Yue Huang , Chenrui Fan , Yuan Li , Siyuan Wu , Tianyi Zhou , Xiangliang Zhang , Lichao Sun

This study analyzes the multiple functions of Large Language Models (LLMs) in transforming research and development (R&D) processes. By automating knowledge discovery, boosting hypothesis creation, integrating transdisciplinary insights,…

Computation and Language · Computer Science 2025-11-25 Raha Aghaei , Ali A. Kiaei , Mahnaz Boush , Mahan Rofoosheh , Mohammad Zavvar

Large Language Models (LLMs) have demonstrated remarkable capabilities in important tasks such as natural language understanding and language generation, and thus have the potential to make a substantial impact on our society. Such…

Computation and Language · Computer Science 2024-05-24 Zhongwei Wan , Xin Wang , Che Liu , Samiul Alam , Yu Zheng , Jiachen Liu , Zhongnan Qu , Shen Yan , Yi Zhu , Quanlu Zhang , Mosharaf Chowdhury , Mi Zhang

This critical review provides an in-depth analysis of Large Language Models (LLMs), encompassing their foundational principles, diverse applications, and advanced training methodologies. We critically examine the evolution from Recurrent…

Artificial Intelligence · Computer Science 2025-09-29 Milad Moradi , Ke Yan , David Colwell , Matthias Samwald , Rhona Asgari
‹ Prev 1 2 3 10 Next ›