Related papers: Large Language Models: A Mathematical Formulation

Large Language Models: An Applied Econometric Framework

Large language models (LLMs) enable researchers to analyze text at unprecedented scale and minimal cost. Researchers can now revisit old questions and tackle novel ones with rich data. We provide an econometric framework for realizing this…

Econometrics · Economics 2025-12-08 Jens Ludwig , Sendhil Mullainathan , Ashesh Rambachan

Beyond Next Word Prediction: Developing Comprehensive Evaluation Frameworks for measuring LLM performance on real world applications

While Large Language Models (LLMs) are fundamentally next-token prediction systems, their practical applications extend far beyond this basic function. From natural language processing and text generation to conversational assistants and…

Computation and Language · Computer Science 2025-03-10 Vishakha Agrawal , Archie Chaudhury , Shreya Agrawal

A Law of Next-Token Prediction in Large Language Models

Large language models (LLMs) have been widely employed across various application domains, yet their black-box nature poses significant challenges to understanding how these models process input data internally to make predictions. In this…

Machine Learning · Computer Science 2025-09-03 Hangfeng He , Weijie J. Su

A Principled Framework for Knowledge-enhanced Large Language Model

Large Language Models (LLMs) are versatile, yet they often falter in tasks requiring deep and reliable reasoning due to issues like hallucinations, limiting their applicability in critical scenarios. This paper introduces a rigorously…

Computation and Language · Computer Science 2023-11-21 Saizhuo Wang , Zhihan Liu , Zhaoran Wang , Jian Guo

Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges

Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural…

Machine Learning · Computer Science 2024-12-05 Minghao Shao , Abdul Basit , Ramesh Karri , Muhammad Shafique

Optimal Decision Making Through Scenario Simulations Using Large Language Models

The rapid evolution of Large Language Models (LLMs) has markedly expanded their application across diverse domains, transforming how complex problems are approached and solved. Initially conceived to predict subsequent words in texts, these…

Artificial Intelligence · Computer Science 2024-07-11 Sumedh Rasal , E. J. Hauer

Benchmarking Large Language Models for Molecule Prediction Tasks

Large Language Models (LLMs) stand at the forefront of a number of Natural Language Processing (NLP) tasks. Despite the widespread adoption of LLMs in NLP, much of their potential in broader fields remains largely unexplored, and…

Machine Learning · Computer Science 2024-03-11 Zhiqiang Zhong , Kuangyu Zhou , Davide Mottin

Teaching LLMs to Think Mathematically: A Critical Study of Decision-Making via Optimization

This paper investigates the capabilities of large language models (LLMs) in formulating and solving decision-making problems using mathematical programming. We first conduct a systematic review and meta-analysis of recent literature to…

Artificial Intelligence · Computer Science 2025-08-26 Mohammad J. Abdel-Rahman , Yasmeen Alslman , Dania Refai , Amro Saleh , Malik A. Abu Loha , Mohammad Yahya Hamed

What do Large Language Models know about materials?

Large Language Models (LLMs) are increasingly applied in the fields of mechanical engineering and materials science. As models that establish connections through the interface of language, LLMs can be applied for step-wise reasoning through…

Applied Physics · Physics 2025-07-22 Adrian Ehrenhofer , Thomas Wallmersperger , Gianaurelio Cuniberti

Understanding the Capabilities of Large Language Models for Automated Planning

Automated planning is concerned with developing efficient algorithms to generate plans or sequences of actions to achieve a specific goal in a given environment. Emerging Large Language Models (LLMs) can answer questions, write high-quality…

Artificial Intelligence · Computer Science 2023-05-26 Vishal Pallagani , Bharath Muppasani , Keerthiram Murugesan , Francesca Rossi , Biplav Srivastava , Lior Horesh , Francesco Fabiano , Andrea Loreggia

Algorithmic Thinking Theory

Large language models (LLMs) have proven to be highly effective for solving complex reasoning tasks. Surprisingly, their capabilities can often be improved by iterating on previously generated solutions. In this context, a reasoning plan…

Artificial Intelligence · Computer Science 2025-12-05 MohammadHossein Bateni , Vincent Cohen-Addad , Yuzhou Gu , Silvio Lattanzi , Simon Meierhans , Christopher Mohri

Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract)

While large language models (LLMs) have demonstrated strong capability in structured prediction tasks such as semantic parsing, few amounts of research have explored the underlying mechanisms of their success. Our work studies different…

Computation and Language · Computer Science 2023-02-01 Daking Rai , Yilun Zhou , Bailin Wang , Ziyu Yao

Advancing bioinformatics with large language models: components, applications and perspectives

Large language models (LLMs) are a class of artificial intelligence models based on deep learning, which have great performance in various tasks, especially in natural language processing (NLP). Large language models typically consist of…

Quantitative Methods · Quantitative Biology 2025-02-04 Jiajia Liu , Mengyuan Yang , Yankai Yu , Haixia Xu , Tiangang Wang , Kang Li , Xiaobo Zhou

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Large Language Models (LLMs) have shown remarkable performance in various natural language processing tasks but face challenges in mathematical reasoning, where complex problem-solving requires both linguistic understanding and mathematical…

Computation and Language · Computer Science 2025-03-20 Shuguang Chen , Guang Lin

Do Large Language Models (Really) Need Statistical Foundations?

Large language models (LLMs) represent a new paradigm for processing unstructured data, with applications across an unprecedented range of domains. In this paper, we address, through two arguments, whether the development and application of…

Methodology · Statistics 2026-02-03 Weijie Su

Designing Algorithms Empowered by Language Models: An Analytical Framework, Case Studies, and Insights

This work presents an analytical framework for the design and analysis of LLM-based algorithms, i.e., algorithms that contain one or multiple calls of large language models (LLMs) as sub-routines and critically rely on the capabilities of…

Machine Learning · Computer Science 2025-10-14 Yanxi Chen , Yaliang Li , Bolin Ding , Jingren Zhou

A Primer on Large Language Models and their Limitations

This paper provides a primer on Large Language Models (LLMs) and identifies their strengths, limitations, applications and research directions. It is intended to be useful to those in academia and industry who are interested in gaining an…

Computation and Language · Computer Science 2024-12-09 Sandra Johnson , David Hyland-Wood

Large Language Models for Mathematical Reasoning: Progresses and Challenges

Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the…

Computation and Language · Computer Science 2024-09-18 Janice Ahn , Rishu Verma , Renze Lou , Di Liu , Rui Zhang , Wenpeng Yin

Multilingual Large Language Models: A Systematic Survey

This paper provides a comprehensive survey of the latest research on multilingual large language models (MLLMs). MLLMs not only are able to understand and generate language across linguistic boundaries, but also represent an important…

Computation and Language · Computer Science 2024-11-20 Shaolin Zhu , Supryadi , Shaoyang Xu , Haoran Sun , Leiyu Pan , Menglong Cui , Jiangcun Du , Renren Jin , António Branco , Deyi Xiong

Can a Large Language Model Learn Matrix Functions In Context?

Large Language Models (LLMs) have demonstrated the ability to solve complex tasks through In-Context Learning (ICL), where models learn from a few input-output pairs without explicit fine-tuning. In this paper, we explore the capacity of…

Machine Learning · Computer Science 2024-11-26 Paimon Goulart , Evangelos E. Papalexakis