Related papers: ComplexityNet: Increasing LLM Inference Efficiency…

AdaptiveLLM: A Framework for Selecting Optimal Cost-Efficient LLM for Code-Generation Based on CoT Length

While Large Language Models (LLMs) have significantly advanced code generation efficiency, they face inherent challenges in balancing performance and inference costs across diverse programming tasks. Dynamically selecting the optimal LLM…

Software Engineering · Computer Science 2025-06-13 Junhang Cheng , Fang Liu , Chengru Wu , Li Zhang

Compressing Context to Enhance Inference Efficiency of Large Language Models

Large language models (LLMs) achieved remarkable performance across various tasks. However, they face challenges in managing long documents and extended conversations, due to significantly increased computational requirements, both in…

Computation and Language · Computer Science 2023-10-11 Yucheng Li , Bo Dong , Chenghua Lin , Frank Guerin

Compressing Large Language Models with Automated Sub-Network Search

Large Language Models (LLMs) demonstrate exceptional reasoning abilities, enabling strong generalization across diverse tasks such as commonsense reasoning and instruction following. However, as LLMs scale, inference costs become…

Computation and Language · Computer Science 2025-02-06 Rhea Sanjay Sukthanker , Benedikt Staffler , Frank Hutter , Aaron Klein

Efficiently Estimating Data Efficiency for Language Model Fine-tuning

While large language models (LLMs) demonstrate reasonable zero-shot capability across many downstream tasks, fine-tuning is a common practice to improve their performance. However, a task's data efficiency--i.e., the number of fine-tuning…

Machine Learning · Computer Science 2026-01-01 Gyung Hyun Je , Colin Raffel

Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models

To efficiently select optimal dataset combinations for enhancing multi-task learning (MTL) performance in large language models, we proposed a novel framework that leverages a neural network to predict the best dataset combinations. The…

Computation and Language · Computer Science 2025-05-06 Zaifu Zhan , Rui Zhang

LinkGPT: Teaching Large Language Models To Predict Missing Links

Large Language Models (LLMs) have shown promising results on various language and vision tasks. Recently, there has been growing interest in applying LLMs to graph-based tasks, particularly on Text-Attributed Graphs (TAGs). However, most…

Machine Learning · Computer Science 2024-06-10 Zhongmou He , Jing Zhu , Shengyi Qian , Joyce Chai , Danai Koutra

ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries

In recent years, large language models (LLMs) have demonstrated remarkable capabilities in comprehending and generating natural language content, attracting widespread attention in both industry and academia. An increasing number of…

Databases · Computer Science 2026-01-08 Keke Huang , Yimin Shi , Dujian Ding , Yifei Li , Yang Fei , Laks Lakshmanan , Xiaokui Xiao

CodeComplex: Dataset for Worst-Case Time Complexity Prediction

Reasoning ability of Large Language Models (LLMs) is a crucial ability, especially in complex decision-making tasks. One significant task to show LLMs' reasoning capability is code time complexity prediction, which involves various…

Software Engineering · Computer Science 2024-12-25 Seung-Yeop Baik , Joonghyuk Hahn , Jungin Kim , Mingi Jeon , Aditi , Yo-Sub Han , Sang-Ki Ko

Complexity-aware fine-tuning

General-purpose Large Language Models (LLMs) are frequently fine-tuned through supervised fine-tuning (SFT) to enhance performance in specific domains. Better results can be achieved by distilling the chain-of-thought of a larger model at…

Machine Learning · Computer Science 2026-03-24 Andrey Goncharov , Daniil Vyazhev , Petr Sychev , Edvard Khalafyan , Alexey Zaytsev

Text Alignment Is An Efficient Unified Model for Massive NLP Tasks

Large language models (LLMs), typically designed as a function of next-word prediction, have excelled across extensive NLP tasks. Despite the generality, next-word prediction is often not an efficient formulation for many of the tasks,…

Computation and Language · Computer Science 2023-11-03 Yuheng Zha , Yichi Yang , Ruichen Li , Zhiting Hu

Beyond Accuracy: A Cognitive Load Framework for Mapping the Capability Boundaries of Tool-use Agents

The ability of Large Language Models (LLMs) to use external tools unlocks powerful real-world interactions, making rigorous evaluation essential. However, current benchmarks primarily report final accuracy, revealing what models can do but…

Computation and Language · Computer Science 2026-01-29 Qihao Wang , Yue Hu , Mingzhe Lu , Jiayue Wu , Yanbing Liu , Yuanmin Tang

On the use of LLMs to generate a dataset of Neural Networks

Neural networks are increasingly used to support decision-making. To verify their reliability and adaptability, researchers and practitioners have proposed a variety of tools and methods for tasks such as NN code verification, refactoring,…

Machine Learning · Computer Science 2026-02-05 Nadia Daoudi , Jordi Cabot

A Survey on Efficient Inference for Large Language Models

Large Language Models (LLMs) have attracted extensive attention due to their remarkable performance across various tasks. However, the substantial computational and memory requirements of LLM inference pose challenges for deployment in…

Computation and Language · Computer Science 2024-07-22 Zixuan Zhou , Xuefei Ning , Ke Hong , Tianyu Fu , Jiaming Xu , Shiyao Li , Yuming Lou , Luning Wang , Zhihang Yuan , Xiuhong Li , Shengen Yan , Guohao Dai , Xiao-Ping Zhang , Yuhan Dong , Yu Wang

Optimising Language Models for Downstream Tasks: A Post-Training Perspective

Language models (LMs) have demonstrated remarkable capabilities in NLP, yet adapting them efficiently and robustly to specific tasks remains challenging. As their scale and complexity grow, fine-tuning LMs on labelled data often…

Computation and Language · Computer Science 2025-06-27 Zhengyan Shi

ToolNet: Connecting Large Language Models with Massive Tools via Tool Graph

While achieving remarkable progress in a broad range of tasks, large language models (LLMs) remain significantly limited in properly using massive external tools. Existing in-context learning approaches simply format tools into a list of…

Artificial Intelligence · Computer Science 2024-03-05 Xukun Liu , Zhiyuan Peng , Xiaoyuan Yi , Xing Xie , Lirong Xiang , Yuchen Liu , Dongkuan Xu

Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression

Recent years have witnessed the rapid advancements of large language models (LLMs) and their expanding applications, leading to soaring demands for computational resources. The widespread adoption of test-time scaling further intensifies…

Artificial Intelligence · Computer Science 2026-03-11 Cheng Yuan , Jiawei Shao , Xuelong Li

When Prompts Go Wrong: Evaluating Code Model Robustness to Ambiguous, Contradictory, and Incomplete Task Descriptions

Large Language Models (LLMs) have demonstrated impressive performance in code generation tasks under idealized conditions, where task descriptions are clear and precise. However, in practice, task descriptions frequently exhibit ambiguity,…

Software Engineering · Computer Science 2025-07-29 Maya Larbi , Amal Akli , Mike Papadakis , Rihab Bouyousfi , Maxime Cordy , Federica Sarro , Yves Le Traon

Task Scheduling for Efficient Inference of Large Language Models on Single Moderate GPU Systems

Large language models~(LLMs) are known for their high demand on computing resources and memory due to their substantial model size, which leads to inefficient inference on moderate GPU systems. Techniques like quantization or pruning can…

Computational Engineering, Finance, and Science · Computer Science 2024-11-26 Wenxiang Lin , Xinglin Pan , Shaohuai Shi , Xuan Wang , Xiaowen Chu

HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics

Advanced applied mathematics problems are underrepresented in existing Large Language Model (LLM) benchmark datasets. To address this, we introduce HARDMath, a dataset inspired by a graduate course on asymptotic methods, featuring…

Machine Learning · Computer Science 2024-12-17 Jingxuan Fan , Sarah Martinson , Erik Y. Wang , Kaylie Hausknecht , Jonah Brenner , Danxian Liu , Nianli Peng , Corey Wang , Michael P. Brenner

Utilizing Large Language Models for Machine Learning Explainability

This study explores the explainability capabilities of large language models (LLMs), when employed to autonomously generate machine learning (ML) solutions. We examine two classification tasks: (i) a binary classification problem focused on…

Machine Learning · Computer Science 2025-10-09 Alexandros Vassiliades , Nikolaos Polatidis , Stamatios Samaras , Sotiris Diplaris , Ignacio Cabrera Martin , Yannis Manolopoulos , Stefanos Vrochidis , Ioannis Kompatsiaris