English
Related papers

Related papers: Quantifying Generalization Complexity for Large La…

200 papers

The impressive capabilities of large language models (LLMs) have sparked debate over whether these models genuinely generalize to unseen tasks or predominantly rely on memorizing vast amounts of pretraining data. To explore this issue, we…

Computation and Language · Computer Science 2025-03-04 Xinyi Wang , Antonis Antoniades , Yanai Elazar , Alfonso Amayuelas , Alon Albalak , Kexun Zhang , William Yang Wang

We investigate how well large language models (LLMs) generalize across different task difficulties, a key question for effective data curation and evaluation. Existing research is mixed regarding whether training on easier or harder data…

Computation and Language · Computer Science 2025-11-27 Yeganeh Kordi , Nihal V. Nayak , Max Zuo , Ilana Nguyen , Stephen H. Bach

Why do larger language models generalize better? To investigate this question, we develop generalization bounds on the pretraining objective of large language models (LLMs) in the compute-optimal regime, as described by the Chinchilla…

Machine Learning · Computer Science 2025-04-22 Marc Finzi , Sanyam Kapoor , Diego Granziol , Anming Gu , Christopher De Sa , J. Zico Kolter , Andrew Gordon Wilson

The unprecedented performance of large language models (LLMs) necessitates improvements in evaluations. Rather than merely exploring the breadth of LLM abilities, we believe meticulous and thoughtful designs are essential to thorough,…

Large language models (LLMs) demonstrate remarkable performance on knowledge-intensive tasks, suggesting that real-world knowledge is encoded in their model parameters. However, besides explorations on a few probing tasks in limited…

Computation and Language · Computer Science 2024-03-26 Yuyang Bai , Shangbin Feng , Vidhisha Balachandran , Zhaoxuan Tan , Shiqi Lou , Tianxing He , Yulia Tsvetkov

Large language models (LLMs) have demonstrated remarkable performance on various medical benchmarks, but their capabilities across different cognitive levels remain underexplored. Inspired by Bloom's Taxonomy, we propose a…

Computation and Language · Computer Science 2025-06-11 Yuxuan Zhou , Xien Liu , Chenwei Yan , Chen Ning , Xiao Zhang , Boxun Li , Xiangling Fu , Shijin Wang , Guoping Hu , Yu Wang , Ji Wu

Advancement in Large Language Models (LLMs) reasoning capabilities enables them to solve scientific problems with enhanced efficacy. Thereby, a high-quality benchmark for comprehensive and appropriate assessment holds significance, while…

Large Language Models (LLMs) have revolutionized both general natural language processing and domain-specific applications such as code synthesis, legal reasoning, and finance. However, while prior studies have explored individual model…

Software Engineering · Computer Science 2025-12-05 Gunjan Das , Paheli Bhattacharya , Rishabh Gupta

While Large language models (LLMs) have become excellent writing assistants, they still struggle with quotation generation. This is because they either hallucinate when providing factual quotations or fail to provide quotes that exceed…

Computation and Language · Computer Science 2025-02-21 Jin Xiao , Bowei Zhang , Qianyu He , Jiaqing Liang , Feng Wei , Jinglei Chen , Zujie Liang , Deqing Yang , Yanghua Xiao

We assess how the code reasoning abilities of large language models (LLMs) generalize to different kinds of programs. We present techniques for obtaining in- and out-of-distribution programs with different characteristics: code sampled from…

Software Engineering · Computer Science 2025-04-09 Rem Yang , Julian Dai , Nikos Vasilakis , Martin Rinard

In-context learning (ICL) is now a common method for teaching large language models (LLMs) new tasks: given labeled examples in the input context, the LLM learns to perform the task without weight updates. Do models guided via ICL infer the…

Computation and Language · Computer Science 2024-04-11 Aaron Mueller , Albert Webson , Jackson Petty , Tal Linzen

Increasing the number of parameters in large language models (LLMs) usually improves performance in downstream tasks but raises compute and memory costs, making deployment difficult in resource-limited settings. Quantization techniques,…

Computation and Language · Computer Science 2024-06-07 Renren Jin , Jiangcun Du , Wuwei Huang , Wei Liu , Jian Luan , Bin Wang , Deyi Xiong

The rapid progress of Large Language Models (LLMs) has transformed natural language processing and broadened its impact across research and society. Yet, systematic evaluation of these models, especially for languages beyond English,…

Large Language Models (LLMs) display strikingly different generalization behaviors: supervised fine-tuning (SFT) often narrows capability, whereas reinforcement-learning (RL) tuning tends to preserve it. The reasons behind this divergence…

Machine Learning · Computer Science 2026-01-01 Haoyue Bai , Yiyou Sun , Wenjie Hu , Shi Qiu , Maggie Ziyu Huan , Peiyang Song , Robert Nowak , Dawn Song

We introduce KoLasSimpleQA, the first benchmark evaluating the multilingual factual ability of Large Language Models (LLMs). Inspired by existing research, we created the question set with features such as single knowledge point coverage,…

Computation and Language · Computer Science 2025-05-23 Bowen Jiang , Runchuan Zhu , Jiang Wu , Zinco Jiang , Yifan He , Junyuan Gao , Jia Yu , Rui Min , Yinfan Wang , Haote Yang , Songyang Zhang , Dahua Lin , Lijun Wu , Conghui He

Large Language Models (LLMs) have demonstrated impressive capabilities across a range of scientific tasks including mathematics, physics, and chemistry. Despite their successes, the effectiveness of LLMs in handling complex statistical…

Computation and Language · Computer Science 2024-10-11 Yizhang Zhu , Shiyin Du , Boyan Li , Yuyu Luo , Nan Tang

Large Language Models (LLMs) are advancing at an amazing speed and have become indispensable across academia, industry, and daily applications. To keep pace with the status quo, this survey probes the core challenges that the rise of LLMs…

The Multimodal Large Language Model (MLLM) is currently experiencing rapid growth, driven by the advanced capabilities of LLMs. Unlike earlier specialists, existing MLLMs are evolving towards a Multimodal Generalist paradigm. Initially…

Large language models (LLMs) are incredible and versatile tools for text-based tasks that have enabled countless, previously unimaginable, applications. Retrieval models, in contrast, have not yet seen such capable general-purpose models…

Information Retrieval · Computer Science 2025-09-10 Julian Killingback , Hamed Zamani

Recent advancements in large language models (LLMs) are propelling us toward artificial general intelligence with their remarkable emergent abilities and reasoning capabilities. However, the substantial computational and memory requirements…

Machine Learning · Computer Science 2024-10-10 Ruihao Gong , Yang Yong , Shiqiao Gu , Yushi Huang , Chengtao Lv , Yunchen Zhang , Xianglong Liu , Dacheng Tao
‹ Prev 1 2 3 10 Next ›