English
Related papers

Related papers: Compression Represents Intelligence Linearly

200 papers

Understanding the relationship between data compression and the capabilities of Large Language Models (LLMs) is crucial, especially in specialized domains like code intelligence. Prior work posited a linear relationship between compression…

Computation and Language · Computer Science 2026-03-27 Shijie Xuyang , Xianzhen Luo , Zheng Chu , Houyi Li , Siming Huang , Qiufeng Wang , Wanxiang Che , Qingfu Zhu , Shuigeng Zhou

We conceptualize the process of understanding as information compression, and propose a method for ranking large language models (LLMs) based on lossless data compression. We demonstrate the equivalence of compression length under…

Artificial Intelligence · Computer Science 2024-06-21 Peijia Guo , Ziguang Li , Haibo Hu , Chao Huang , Ming Li , Rui Zhang

Recent years have witnessed the rapid advancements of large language models (LLMs) and their expanding applications, leading to soaring demands for computational resources. The widespread adoption of test-time scaling further intensifies…

Artificial Intelligence · Computer Science 2026-03-11 Cheng Yuan , Jiawei Shao , Xuelong Li

Despite the increasing prevalence of large language models (LLMs), we still have a limited understanding of how their representational spaces are structured. This limits our ability to interpret how and what they learn or relate them to…

We have recently witnessed that ``Intelligence" and `` Compression" are the two sides of the same coin, where the language large model (LLM) with unprecedented intelligence is a general-purpose lossless compressor for various data…

Computer Vision and Pattern Recognition · Computer Science 2024-11-25 Kecheng Chen , Pingping Zhang , Hui Liu , Jie Liu , Yibing Liu , Jiaxin Huang , Shiqi Wang , Hong Yan , Haoliang Li

Compressing Large Language Models (LLMs) often leads to reduced performance, especially for knowledge-intensive tasks. In this work, we dive into how compression damages LLMs' inherent knowledge and the possible remedies. We start by…

Computation and Language · Computer Science 2024-02-19 Duc N. M Hoang , Minsik Cho , Thomas Merth , Mohammad Rastegari , Zhangyang Wang

Recently, the concept of ``compression as intelligence'' has provided a novel informatics metric perspective for language models (LMs), emphasizing that highly structured representations signify the intelligence level of LMs. However, from…

Computation and Language · Computer Science 2025-11-07 Jianxiang Zang , Meiling Ning , Yongda Wei , Shihan Dou , Jiazheng Zhang , Nijia Mo , Binhong Li , Tao Gui , Qi Zhang , Xuanjing Huang

Compression methods, including quantization, distillation, and pruning, improve the computational efficiency of large reasoning models (LRMs). However, existing studies either fail to sufficiently compare all three compression methods on…

Machine Learning · Computer Science 2026-03-03 Nan Zhang , Eugene Kwek , Yusen Zhang , Ngoc-Hieu Nguyen , Prasenjit Mitra , Rui Zhang

Compressing large language models (LLMs), often consisting of billions of parameters, provides faster inference, smaller memory footprints, and enables local deployment. Two standard compression techniques are pruning and quantization, with…

Computation and Language · Computer Science 2023-12-05 Satya Sai Srinath Namburi , Makesh Sreedhar , Srinath Srinivasan , Frederic Sala

Recent advances in large language models (LLMs) highlight a strong connection between intelligence and compression. Learned image compression, a fundamental task in modern data compression, has made significant progress in recent years.…

Computer Vision and Pattern Recognition · Computer Science 2025-08-13 Yuqi Li , Haotian Zhang , Li Li , Dong Liu , Feng Wu

Prompt engineering enables Large Language Models (LLMs) to perform a variety of tasks. However, lengthy prompts significantly increase computational complexity and economic costs. To address this issue, we study six prompt compression…

Computation and Language · Computer Science 2025-05-02 Zheng Zhang , Jinyi Li , Yihuai Lan , Xiang Wang , Hao Wang

Recent work has shown that scaling large language models (LLMs) improves their alignment with human brain activity, yet it remains unclear what drives these gains and which representational properties are responsible. Although larger models…

Chain-of-thought prompting has emerged as a powerful technique for enabling large language models (LLMs) to solve complex reasoning tasks. However, these reasoning chains can be verbose, raising concerns about efficiency. In response,…

Computation and Language · Computer Science 2025-04-02 Ayeong Lee , Ethan Che , Tianyi Peng

We introduce compression laws for language language models (LLMs). While recent scaling laws have sought to understand how LLMs scale with respect to model size, pre-training data, and computational resources, we focus on understanding how…

Computation and Language · Computer Science 2025-04-08 Ayan Sengupta , Siddhant Chaudhary , Tanmoy Chakraborty

Working memory, or the ability to hold and manipulate information in the mind, is a critical component of human intelligence and executive functioning. It is correlated with performance on various cognitive tasks, including measures of…

Computation and Language · Computer Science 2025-12-01 Karin de Langis , Jong Inn Park , Bin Hu , Khanh Chi Le , Andreas Schramm , Michael C. Mensink , Andrew Elfenbein , Dongyeop Kang

"Compression Tells Intelligence", is supported by research in artificial intelligence, particularly concerning (multimodal) large language models (LLMs/MLLMs), where compression efficiency often correlates with improved model performance…

Computer Vision and Pattern Recognition · Computer Science 2026-01-29 Xin Jin , Jinming Liu , Yuntao Wei , Junyan Lin , Zhicheng Wang , Jianguo Huang , Xudong Yang , Yanxiao Liu , Wenjun Zeng

Large Language Models (LLMs) demonstrate exceptional reasoning abilities, enabling strong generalization across diverse tasks such as commonsense reasoning and instruction following. However, as LLMs scale, inference costs become…

Computation and Language · Computer Science 2025-02-06 Rhea Sanjay Sukthanker , Benedikt Staffler , Frank Hutter , Aaron Klein

Although large language models (LLMs) have demonstrated their strong intelligence ability, the high demand for computation and storage hinders their practical application. To this end, many model compression techniques are proposed to…

Computation and Language · Computer Science 2024-11-01 Ge Yang , Changyi He , Jinyang Guo , Jianyu Wu , Yifu Ding , Aishan Liu , Haotong Qin , Pengliang Ji , Xianglong Liu

Humans organize knowledge into compact conceptual categories that balance compression with semantic richness. Large Language Models (LLMs) exhibit impressive linguistic abilities, but whether they navigate this same compression-meaning…

Computation and Language · Computer Science 2025-12-03 Chen Shani , Liron Soffer , Dan Jurafsky , Yann LeCun , Ravid Shwartz-Ziv

Large language models (LLMs) have been applied in various applications due to their astonishing capabilities. With advancements in technologies such as chain-of-thought (CoT) prompting and in-context learning (ICL), the prompts fed to LLMs…

Computation and Language · Computer Science 2023-12-07 Huiqiang Jiang , Qianhui Wu , Chin-Yew Lin , Yuqing Yang , Lili Qiu
‹ Prev 1 2 3 10 Next ›