Related papers: Large Language Models and Algorithm Execution: App…

Can a Large Language Model Learn Matrix Functions In Context?

Large Language Models (LLMs) have demonstrated the ability to solve complex tasks through In-Context Learning (ICL), where models learn from a few input-output pairs without explicit fine-tuning. In this paper, we explore the capacity of…

Machine Learning · Computer Science 2024-11-26 Paimon Goulart , Evangelos E. Papalexakis

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges

Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural…

Machine Learning · Computer Science 2024-12-05 Minghao Shao , Abdul Basit , Ramesh Karri , Muhammad Shafique

An Explorative Study on Distributed Computing Techniques in Training and Inference of Large Language Models

Large language models (LLM) are advanced AI systems trained on extensive textual data, leveraging deep learning techniques to understand and generate human-like language. Today's LLMs with billions of parameters are so huge that hardly any…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-14 Sheikh Azizul Hakim , Saem Hasan

A Survey on Large Language Models for Automated Planning

The planning ability of Large Language Models (LLMs) has garnered increasing attention in recent years due to their remarkable capacity for multi-step reasoning and their ability to generalize across a wide range of domains. While some…

Artificial Intelligence · Computer Science 2025-02-19 Mohamed Aghzal , Erion Plaku , Gregory J. Stein , Ziyu Yao

Efficient Strategy for Improving Large Language Model (LLM) Capabilities

Large Language Models (LLMs) have become a milestone in the field of artificial intelligence and natural language processing. However, their large-scale deployment remains constrained by the need for significant computational resources.…

Computation and Language · Computer Science 2025-08-07 Julián Camilo Velandia Gutiérrez

Large Language Models for Constructing and Optimizing Machine Learning Workflows: A Survey

Building effective machine learning (ML) workflows to address complex tasks is a primary focus of the Automatic ML (AutoML) community and a critical step toward achieving artificial general intelligence (AGI). Recently, the integration of…

Machine Learning · Computer Science 2024-12-30 Yang Gu , Hengyu You , Jian Cao , Muran Yu , Haoran Fan , Shiyou Qian

When Large Language Model Meets Optimization

Optimization algorithms and large language models (LLMs) enhance decision-making in dynamic environments by integrating artificial intelligence with traditional techniques. LLMs, with extensive domain knowledge, facilitate intelligent…

Neural and Evolutionary Computing · Computer Science 2024-05-17 Sen Huang , Kaixiang Yang , Sheng Qi , Rui Wang

Case Study: Testing Model Capabilities in Some Reasoning Tasks

Large Language Models (LLMs) excel in generating personalized content and facilitating interactive dialogues, showcasing their remarkable aptitude for a myriad of applications. However, their capabilities in reasoning and providing…

Computation and Language · Computer Science 2024-02-16 Min Zhang , Sato Takumi , Jack Zhang , Jun Wang

Distilling LLMs' Decomposition Abilities into Compact Language Models

Large Language Models (LLMs) have demonstrated proficiency in their reasoning abilities, yet their large size presents scalability challenges and limits any further customization. In contrast, compact models offer customized training but…

Computation and Language · Computer Science 2024-02-06 Denis Tarasov , Kumar Shridhar

Large Language Models for Code Analysis: Do LLMs Really Do Their Job?

Large language models (LLMs) have demonstrated significant potential in the realm of natural language understanding and programming code processing tasks. Their capacity to comprehend and generate human-like code has spurred research into…

Software Engineering · Computer Science 2024-03-07 Chongzhou Fang , Ning Miao , Shaurya Srivastav , Jialin Liu , Ruoyu Zhang , Ruijie Fang , Asmita , Ryan Tsang , Najmeh Nazari , Han Wang , Houman Homayoun

Large Language Model Enhanced Machine Learning Estimators for Classification

Pre-trained large language models (LLM) have emerged as a powerful tool for simulating various scenarios and generating output given specific instructions and multimodal input. In this work, we analyze the specific use of LLM to enhance a…

Machine Learning · Computer Science 2024-05-10 Yuhang Wu , Yingfei Wang , Chu Wang , Zeyu Zheng

MM-LLMs: Recent Advances in MultiModal Large Language Models

In the past year, MultiModal Large Language Models (MM-LLMs) have undergone substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs via cost-effective training strategies. The resulting models not only…

Computation and Language · Computer Science 2024-05-29 Duzhen Zhang , Yahan Yu , Jiahua Dong , Chenxing Li , Dan Su , Chenhui Chu , Dong Yu

An Overview and Discussion on Using Large Language Models for Implementation Generation of Solutions to Open-Ended Problems

Large Language Models offer new opportunities to devise automated implementation generation methods that can tackle problem solving activities beyond traditional methods, which require algorithmic specifications and can use only static…

Computation and Language · Computer Science 2025-01-06 Hashmath Shaik , Alex Doboli

Executing Natural Language-Described Algorithms with Large Language Models: An Investigation

Executing computer programs described in natural language has long been a pursuit of computer science. With the advent of enhanced natural language understanding capabilities exhibited by large language models (LLMs), the path toward this…

Computation and Language · Computer Science 2024-03-15 Xin Zheng , Qiming Zhu , Hongyu Lin , Yaojie Lu , Xianpei Han , Le Sun

Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions

Language models (LMs) are machine learning models designed to predict linguistic patterns by estimating the probability of word sequences based on large-scale datasets, such as text. LMs have a wide range of applications in natural language…

Computation and Language · Computer Science 2025-03-24 Hadi Amini , Md Jueal Mia , Yasaman Saadati , Ahmed Imteaj , Seyedsina Nabavirazavi , Urmish Thakker , Md Zarif Hossain , Awal Ahmed Fime , S. S. Iyengar

Large Language Models for Robotics: A Survey

The human ability to learn, generalize, and control complex manipulation tasks through multi-modality feedback suggests a unique capability, which we refer to as dexterity intelligence. Understanding and assessing this intelligence is a…

Robotics · Computer Science 2025-12-03 Fanlong Zeng , Wensheng Gan , Zezheng Huai , Lichao Sun , Hechang Chen , Yongheng Wang , Ning Liu , Philip S. Yu

Evaluating the Generalization Capabilities of Large Language Models on Code Reasoning

We assess how the code reasoning abilities of large language models (LLMs) generalize to different kinds of programs. We present techniques for obtaining in- and out-of-distribution programs with different characteristics: code sampled from…

Software Engineering · Computer Science 2025-04-09 Rem Yang , Julian Dai , Nikos Vasilakis , Martin Rinard

Improving Small-Scale Large Language Models Function Calling for Reasoning Tasks

Recent advancements in Large Language Models (LLMs) have demonstrated exceptional capabilities in natural language understanding and generation. While these models excel in general complex reasoning tasks, they still face challenges in…

Artificial Intelligence · Computer Science 2024-10-25 Graziano A. Manduzio , Federico A. Galatolo , Mario G. C. A. Cimino , Enzo Pasquale Scilingo , Lorenzo Cominelli

Evaluating the Deductive Competence of Large Language Models

The development of highly fluent large language models (LLMs) has prompted increased interest in assessing their reasoning and problem-solving capabilities. We investigate whether several LLMs can solve a classic type of deductive reasoning…

Computation and Language · Computer Science 2024-04-16 Spencer M. Seals , Valerie L. Shalin