Related papers: Collaborative Performance Prediction for Large Lan…

Latent Performance Profiling of Large Language Models

Large language models (LLMs) frequently achieve impressive scores on standardized benchmarks, yet accuracy alone offers a limited view of their capabilities. Evaluating open-source LLMs through leaderboards faces persistent issues like data…

Computation and Language · Computer Science 2026-05-29 Tanmoy Chakraborty , Ayan Sengupta , Suparna Bhattacharya , Partha Pratim Chakrabarti , Amlan Chakrabarti , Supratik Chakraborty , Partha Pratim Das , Lipika Dey , Richa Singh , Mayank Vatsa

Causal Post-Processing of Predictive Models

Organizations increasingly rely on predictive models to decide who should be targeted for interventions, such as marketing campaigns, customer retention offers, or medical treatments. Yet these models are usually built to predict outcomes…

Machine Learning · Statistics 2025-10-24 Carlos Fernández-Loría , Yanfang Hou , Foster Provost , Jennifer Hill

Conformal Predictive Programming for Chance Constrained Optimization

We propose conformal predictive programming (CPP), a framework to solve chance constrained optimization problems, i.e., optimization problems with constraints that are functions of random variables. CPP utilizes samples from these random…

Systems and Control · Electrical Eng. & Systems 2025-05-06 Yiqi Zhao , Xinyi Yu , Matteo Sesia , Jyotirmoy V. Deshmukh , Lars Lindemann

Optimization and Scalability of Collaborative Filtering Algorithms in Large Language Models

With the rapid development of large language models (LLMs) and the growing demand for personalized content, recommendation systems have become critical in enhancing user experience and driving engagement. Collaborative filtering algorithms,…

Artificial Intelligence · Computer Science 2024-12-30 Haowei Yang , Longfei Yun , Jinghan Cao , Qingyi Lu , Yuming Tu

Scaling Laws for Predicting Downstream Performance in LLMs

Precise estimation of downstream performance in large language models (LLMs) prior to training is essential for guiding their development process. Scaling laws analysis utilizes the statistics of a series of significantly smaller sampling…

Computation and Language · Computer Science 2025-04-09 Yangyi Chen , Binxuan Huang , Yifan Gao , Zhengyang Wang , Jingfeng Yang , Heng Ji

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

The escalating scale and cost of Large Language Models (LLMs) training necessitate accurate pre-training prediction of downstream task performance for comprehensive understanding of scaling properties. This is challenged by: 1) the…

Computation and Language · Computer Science 2026-03-10 Chengyin Xu , Kaiyuan Chen , Xiao Li , Ke Shen , Chenggang Li

Predicting Task Performance with Context-aware Scaling Laws

Scaling laws have transformed our understanding of large language models by linking upstream metrics like cross-entropy loss to design factors such as model size, training data, and compute. However, these conventional laws fail to capture…

Computation and Language · Computer Science 2025-10-17 Kyle Montgomery , David Park , Jianhong Tu , Michael Bendersky , Beliz Gunel , Dawn Song , Chenguang Wang

Exploring Continual Learning for Code Generation Models

Large-scale code generation models such as Codex and CodeT5 have achieved impressive performance. However, libraries are upgraded or deprecated very frequently and re-training large-scale language models is computationally expensive.…

Machine Learning · Computer Science 2023-07-06 Prateek Yadav , Qing Sun , Hantian Ding , Xiaopeng Li , Dejiao Zhang , Ming Tan , Xiaofei Ma , Parminder Bhatia , Ramesh Nallapati , Murali Krishna Ramanathan , Mohit Bansal , Bing Xiang

Efficient LLM Collaboration via Planning

Recently, large language models (LLMs) have demonstrated strong performance, ranging from simple to complex tasks. However, while large models achieve remarkable results across diverse tasks, they often incur substantial monetary inference…

Artificial Intelligence · Computer Science 2026-05-12 Byeongchan Lee , Jonghoon Lee , Dongyoung Kim , Jaehyung Kim , Kyungjoon Park , Dongjun Lee , Jinwoo Shin

BERTology of Molecular Property Prediction

Chemical language models (CLMs) have emerged as promising competitors to popular classical machine learning models for molecular property prediction (MPP) tasks. However, an increasing number of studies have reported inconsistent and…

Machine Learning · Computer Science 2026-03-17 Mohammad Mostafanejad , Paul Saxe , T. Daniel Crawford

LLM Performance Predictors are good initializers for Architecture Search

In this work, we utilize Large Language Models (LLMs) for a novel use case: constructing Performance Predictors (PP) that estimate the performance of specific deep neural network architectures on downstream tasks. We create PP prompts for…

Computation and Language · Computer Science 2024-08-09 Ganesh Jawahar , Muhammad Abdul-Mageed , Laks V. S. Lakshmanan , Dujian Ding

LLM Performance Predictors: Learning When to Escalate in Hybrid Human-AI Moderation Systems

As LLMs are increasingly integrated into human-in-the-loop content moderation systems, a central challenge is deciding when their outputs can be trusted versus when escalation for human review is preferable. We propose a novel framework for…

Artificial Intelligence · Computer Science 2026-01-13 Or Bachar , Or Levi , Sardhendu Mishra , Adi Levi , Manpreet Singh Minhas , Justin Miller , Omer Ben-Porat , Eilon Sheetrit , Jonathan Morra

Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale

In recent years, Large Language Models (LLMs) have made significant strides towards Artificial General Intelligence. However, training these models from scratch requires substantial computational resources and vast amounts of text data. In…

Computation and Language · Computer Science 2024-10-03 Wenzhen Zheng , Wenbo Pan , Xu Xu , Libo Qin , Li Yue , Ming Zhou

Do Large Language Models Understand Performance Optimization?

Large Language Models (LLMs) have emerged as powerful tools for software development tasks such as code completion, translation, and optimization. However, their ability to generate efficient and correct code, particularly in complex…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-19 Bowen Cui , Tejas Ramesh , Oscar Hernandez , Keren Zhou

Scaling Laws Under the Microscope: Predicting Transformer Performance from Small Scale Experiments

Neural scaling laws define a predictable relationship between a model's parameter count and its performance after training in the form of a power law. However, most research to date has not explicitly investigated whether scaling laws can…

Computation and Language · Computer Science 2022-10-19 Maor Ivgi , Yair Carmon , Jonathan Berant

CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning

Large language models (LLMs) have made significant progress in natural language understanding and generation, driven by scalable pretraining and advanced finetuning. However, enhancing reasoning abilities in LLMs, particularly via…

Artificial Intelligence · Computer Science 2025-05-30 Huimu Yu , Xing Wu , Haotian Xu , Debing Zhang , Songlin Hu

Efficient Evaluation of Large Language Models via Collaborative Filtering

With the development of Large Language Models (LLMs), numerous benchmarks have been proposed to measure and compare the capabilities of different LLMs. However, evaluating LLMs is costly due to the large number of test instances and their…

Computation and Language · Computer Science 2025-04-15 Xu-Xiang Zhong , Chao Yi , Han-Jia Ye

CoFineLLM: Conformal Finetuning of LLMs for Language-Instructed Robot Planning

Large Language Models (LLMs) have recently emerged as planners for language-instructed agents, generating sequences of actions to accomplish natural language tasks. However, their reliability remains a challenge, especially in long-horizon…

Robotics · Computer Science 2025-11-11 Jun Wang , Yevgeniy Vorobeychik , Yiannis Kantaros

S2LPP: Small-to-Large Prompt Prediction across LLMs

The performance of pre-trained Large Language Models (LLMs) is often sensitive to nuances in prompt templates, requiring careful prompt engineering, adding costs in terms of computing and human effort. In this study, we present experiments…

Computation and Language · Computer Science 2025-05-27 Liang Cheng , Tianyi LI , Zhaowei Wang , Mark Steedman

How Well Do Large-Scale Chemical Language Models Transfer to Downstream Tasks?

Chemical Language Models (CLMs) pre-trained on large scale molecular data are widely used for molecular property prediction. However, the common belief that increasing training resources such as model size, dataset size, and training…

Machine Learning · Computer Science 2026-05-14 Tatsuya Sagawa , Ryosuke Kojima