English
Related papers

Related papers: Collaborative Performance Prediction for Large Lan…

200 papers

Large language models (LLMs) frequently achieve impressive scores on standardized benchmarks, yet accuracy alone offers a limited view of their capabilities. Evaluating open-source LLMs through leaderboards faces persistent issues like data…

Organizations increasingly rely on predictive models to decide who should be targeted for interventions, such as marketing campaigns, customer retention offers, or medical treatments. Yet these models are usually built to predict outcomes…

Machine Learning · Statistics 2025-10-24 Carlos Fernández-Loría , Yanfang Hou , Foster Provost , Jennifer Hill

We propose conformal predictive programming (CPP), a framework to solve chance constrained optimization problems, i.e., optimization problems with constraints that are functions of random variables. CPP utilizes samples from these random…

Systems and Control · Electrical Eng. & Systems 2025-05-06 Yiqi Zhao , Xinyi Yu , Matteo Sesia , Jyotirmoy V. Deshmukh , Lars Lindemann

With the rapid development of large language models (LLMs) and the growing demand for personalized content, recommendation systems have become critical in enhancing user experience and driving engagement. Collaborative filtering algorithms,…

Artificial Intelligence · Computer Science 2024-12-30 Haowei Yang , Longfei Yun , Jinghan Cao , Qingyi Lu , Yuming Tu

Precise estimation of downstream performance in large language models (LLMs) prior to training is essential for guiding their development process. Scaling laws analysis utilizes the statistics of a series of significantly smaller sampling…

Computation and Language · Computer Science 2025-04-09 Yangyi Chen , Binxuan Huang , Yifan Gao , Zhengyang Wang , Jingfeng Yang , Heng Ji

The escalating scale and cost of Large Language Models (LLMs) training necessitate accurate pre-training prediction of downstream task performance for comprehensive understanding of scaling properties. This is challenged by: 1) the…

Computation and Language · Computer Science 2026-03-10 Chengyin Xu , Kaiyuan Chen , Xiao Li , Ke Shen , Chenggang Li

Scaling laws have transformed our understanding of large language models by linking upstream metrics like cross-entropy loss to design factors such as model size, training data, and compute. However, these conventional laws fail to capture…

Computation and Language · Computer Science 2025-10-17 Kyle Montgomery , David Park , Jianhong Tu , Michael Bendersky , Beliz Gunel , Dawn Song , Chenguang Wang

Large-scale code generation models such as Codex and CodeT5 have achieved impressive performance. However, libraries are upgraded or deprecated very frequently and re-training large-scale language models is computationally expensive.…

Recently, large language models (LLMs) have demonstrated strong performance, ranging from simple to complex tasks. However, while large models achieve remarkable results across diverse tasks, they often incur substantial monetary inference…

Artificial Intelligence · Computer Science 2026-05-12 Byeongchan Lee , Jonghoon Lee , Dongyoung Kim , Jaehyung Kim , Kyungjoon Park , Dongjun Lee , Jinwoo Shin

Chemical language models (CLMs) have emerged as promising competitors to popular classical machine learning models for molecular property prediction (MPP) tasks. However, an increasing number of studies have reported inconsistent and…

Machine Learning · Computer Science 2026-03-17 Mohammad Mostafanejad , Paul Saxe , T. Daniel Crawford

In this work, we utilize Large Language Models (LLMs) for a novel use case: constructing Performance Predictors (PP) that estimate the performance of specific deep neural network architectures on downstream tasks. We create PP prompts for…

Computation and Language · Computer Science 2024-08-09 Ganesh Jawahar , Muhammad Abdul-Mageed , Laks V. S. Lakshmanan , Dujian Ding

As LLMs are increasingly integrated into human-in-the-loop content moderation systems, a central challenge is deciding when their outputs can be trusted versus when escalation for human review is preferable. We propose a novel framework for…

Artificial Intelligence · Computer Science 2026-01-13 Or Bachar , Or Levi , Sardhendu Mishra , Adi Levi , Manpreet Singh Minhas , Justin Miller , Omer Ben-Porat , Eilon Sheetrit , Jonathan Morra

In recent years, Large Language Models (LLMs) have made significant strides towards Artificial General Intelligence. However, training these models from scratch requires substantial computational resources and vast amounts of text data. In…

Computation and Language · Computer Science 2024-10-03 Wenzhen Zheng , Wenbo Pan , Xu Xu , Libo Qin , Li Yue , Ming Zhou

Large Language Models (LLMs) have emerged as powerful tools for software development tasks such as code completion, translation, and optimization. However, their ability to generate efficient and correct code, particularly in complex…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-19 Bowen Cui , Tejas Ramesh , Oscar Hernandez , Keren Zhou

Neural scaling laws define a predictable relationship between a model's parameter count and its performance after training in the form of a power law. However, most research to date has not explicitly investigated whether scaling laws can…

Computation and Language · Computer Science 2022-10-19 Maor Ivgi , Yair Carmon , Jonathan Berant

Large language models (LLMs) have made significant progress in natural language understanding and generation, driven by scalable pretraining and advanced finetuning. However, enhancing reasoning abilities in LLMs, particularly via…

Artificial Intelligence · Computer Science 2025-05-30 Huimu Yu , Xing Wu , Haotian Xu , Debing Zhang , Songlin Hu

With the development of Large Language Models (LLMs), numerous benchmarks have been proposed to measure and compare the capabilities of different LLMs. However, evaluating LLMs is costly due to the large number of test instances and their…

Computation and Language · Computer Science 2025-04-15 Xu-Xiang Zhong , Chao Yi , Han-Jia Ye

Large Language Models (LLMs) have recently emerged as planners for language-instructed agents, generating sequences of actions to accomplish natural language tasks. However, their reliability remains a challenge, especially in long-horizon…

Robotics · Computer Science 2025-11-11 Jun Wang , Yevgeniy Vorobeychik , Yiannis Kantaros

The performance of pre-trained Large Language Models (LLMs) is often sensitive to nuances in prompt templates, requiring careful prompt engineering, adding costs in terms of computing and human effort. In this study, we present experiments…

Computation and Language · Computer Science 2025-05-27 Liang Cheng , Tianyi LI , Zhaowei Wang , Mark Steedman

Chemical Language Models (CLMs) pre-trained on large scale molecular data are widely used for molecular property prediction. However, the common belief that increasing training resources such as model size, dataset size, and training…

Machine Learning · Computer Science 2026-05-14 Tatsuya Sagawa , Ryosuke Kojima
‹ Prev 1 2 3 10 Next ›