Related papers: Compression Represents Intelligence Linearly

Is Compression Really Linear with Code Intelligence?

Understanding the relationship between data compression and the capabilities of Large Language Models (LLMs) is crucial, especially in specialized domains like code intelligence. Prior work posited a linear relationship between compression…

Computation and Language · Computer Science 2026-03-27 Shijie Xuyang , Xianzhen Luo , Zheng Chu , Houyi Li , Siming Huang , Qiufeng Wang , Wanxiang Che , Qingfu Zhu , Shuigeng Zhou

Ranking LLMs by compression

We conceptualize the process of understanding as information compression, and propose a method for ranking large language models (LLMs) based on lossless data compression. We demonstrate the equivalence of compression length under…

Artificial Intelligence · Computer Science 2024-06-21 Peijia Guo , Ziguang Li , Haibo Hu , Chao Huang , Ming Li , Rui Zhang

Information Capacity: Evaluating the Efficiency of Large Language Models via Text Compression

Recent years have witnessed the rapid advancements of large language models (LLMs) and their expanding applications, leading to soaring demands for computational resources. The widespread adoption of test-time scaling further intensifies…

Artificial Intelligence · Computer Science 2026-03-11 Cheng Yuan , Jiawei Shao , Xuelong Li

Learning is Forgetting: LLM Training As Lossy Compression

Despite the increasing prevalence of large language models (LLMs), we still have a limited understanding of how their representational spaces are structured. This limits our ability to interpret how and what they learn or relate them to…

Machine Learning · Computer Science 2026-04-10 Henry C. Conklin , Tom Hosking , Tan Yi-Chern , Julian Gold , Jonathan D. Cohen , Thomas L. Griffiths , Max Bartolo , Seraphina Goldfarb-Tarrant

Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need

We have recently witnessed that ``Intelligence" and `` Compression" are the two sides of the same coin, where the language large model (LLM) with unprecedented intelligence is a general-purpose lossless compressor for various data…

Computer Vision and Pattern Recognition · Computer Science 2024-11-25 Kecheng Chen , Pingping Zhang , Hui Liu , Jie Liu , Yibing Liu , Jiaxin Huang , Shiqi Wang , Hong Yan , Haoliang Li

Do Compressed LLMs Forget Knowledge? An Experimental Study with Practical Implications

Compressing Large Language Models (LLMs) often leads to reduced performance, especially for knowledge-intensive tasks. In this work, we dive into how compression damages LLMs' inherent knowledge and the possible remedies. We start by…

Computation and Language · Computer Science 2024-02-19 Duc N. M Hoang , Minsik Cho , Thomas Merth , Mohammad Rastegari , Zhangyang Wang

Compression Hacking: A Supplementary Perspective on Informatics Properties of Language Models from Geometric Distortion

Recently, the concept of ``compression as intelligence'' has provided a novel informatics metric perspective for language models (LMs), emphasizing that highly structured representations signify the intelligence level of LMs. However, from…

Computation and Language · Computer Science 2025-11-07 Jianxiang Zang , Meiling Ning , Yongda Wei , Shihan Dou , Jiazheng Zhang , Nijia Mo , Binhong Li , Tao Gui , Qi Zhang , Xuanjing Huang

When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models

Compression methods, including quantization, distillation, and pruning, improve the computational efficiency of large reasoning models (LRMs). However, existing studies either fail to sufficiently compare all three compression methods on…

Machine Learning · Computer Science 2026-03-03 Nan Zhang , Eugene Kwek , Yusen Zhang , Ngoc-Hieu Nguyen , Prasenjit Mitra , Rui Zhang

The Cost of Compression: Investigating the Impact of Compression on Parametric Knowledge in Language Models

Compressing large language models (LLMs), often consisting of billions of parameters, provides faster inference, smaller memory footprints, and enables local deployment. Two standard compression techniques are pruning and quantization, with…

Computation and Language · Computer Science 2023-12-05 Satya Sai Srinath Namburi , Makesh Sreedhar , Srinath Srinivasan , Frederic Sala

Scaling Learned Image Compression Models up to 1 Billion

Recent advances in large language models (LLMs) highlight a strong connection between intelligence and compression. Learned image compression, a fundamental task in modern data compression, has made significant progress in recent years.…

Computer Vision and Pattern Recognition · Computer Science 2025-08-13 Yuqi Li , Haotian Zhang , Li Li , Dong Liu , Feng Wu

An Empirical Study on Prompt Compression for Large Language Models

Prompt engineering enables Large Language Models (LLMs) to perform a variety of tasks. However, lengthy prompts significantly increase computational complexity and economic costs. To address this issue, we study six prompt compression…

Computation and Language · Computer Science 2025-05-02 Zheng Zhang , Jinyi Li , Yihuai Lan , Xiang Wang , Hao Wang

Linguistic properties and model scale in brain encoding: from small to compressed language models

Recent work has shown that scaling large language models (LLMs) improves their alignment with human brain activity, yet it remains unclear what drives these gains and which representational properties are responsible. Although larger models…

Neurons and Cognition · Quantitative Biology 2026-02-10 Subba Reddy Oota , Vijay Rowtula , Satya Sai Srinath Namburi , Khushbu Pahwa , Anant Khandelwal , Manish Gupta , Tanmoy Chakraborty , Bapi S. Raju

How Well do LLMs Compress Their Own Chain-of-Thought? A Token Complexity Approach

Chain-of-thought prompting has emerged as a powerful technique for enabling large language models (LLMs) to solve complex reasoning tasks. However, these reasoning chains can be verbose, raising concerns about efficiency. In response,…

Computation and Language · Computer Science 2025-04-02 Ayeong Lee , Ethan Che , Tianyi Peng

Compression Laws for Large Language Models

We introduce compression laws for language language models (LLMs). While recent scaling laws have sought to understand how LLMs scale with respect to model size, pre-training data, and computational resources, we focus on understanding how…

Computation and Language · Computer Science 2025-04-08 Ayan Sengupta , Siddhant Chaudhary , Tanmoy Chakraborty

Strong Memory, Weak Control: An Empirical Study of Executive Functioning in LLMs

Working memory, or the ability to hold and manipulate information in the mind, is a critical component of human intelligence and executive functioning. It is correlated with performance on various cognitive tasks, including measures of…

Computation and Language · Computer Science 2025-12-01 Karin de Langis , Jong Inn Park , Bin Hu , Khanh Chi Le , Andreas Schramm , Michael C. Mensink , Andrew Elfenbein , Dongyeop Kang

Compression Tells Intelligence: Visual Coding, Visual Token Technology, and the Unification

"Compression Tells Intelligence", is supported by research in artificial intelligence, particularly concerning (multimodal) large language models (LLMs/MLLMs), where compression efficiency often correlates with improved model performance…

Computer Vision and Pattern Recognition · Computer Science 2026-01-29 Xin Jin , Jinming Liu , Yuntao Wei , Junyan Lin , Zhicheng Wang , Jianguo Huang , Xudong Yang , Yanxiao Liu , Wenjun Zeng

Compressing Large Language Models with Automated Sub-Network Search

Large Language Models (LLMs) demonstrate exceptional reasoning abilities, enabling strong generalization across diverse tasks such as commonsense reasoning and instruction following. However, as LLMs scale, inference costs become…

Computation and Language · Computer Science 2025-02-06 Rhea Sanjay Sukthanker , Benedikt Staffler , Frank Hutter , Aaron Klein

LLMCBench: Benchmarking Large Language Model Compression for Efficient Deployment

Although large language models (LLMs) have demonstrated their strong intelligence ability, the high demand for computation and storage hinders their practical application. To this end, many model compression techniques are proposed to…

Computation and Language · Computer Science 2024-11-01 Ge Yang , Changyi He , Jinyang Guo , Jianyu Wu , Yifu Ding , Aishan Liu , Haotong Qin , Pengliang Ji , Xianglong Liu

From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning

Humans organize knowledge into compact conceptual categories that balance compression with semantic richness. Large Language Models (LLMs) exhibit impressive linguistic abilities, but whether they navigate this same compression-meaning…

Computation and Language · Computer Science 2025-12-03 Chen Shani , Liron Soffer , Dan Jurafsky , Yann LeCun , Ravid Shwartz-Ziv

LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models

Large language models (LLMs) have been applied in various applications due to their astonishing capabilities. With advancements in technologies such as chain-of-thought (CoT) prompting and in-context learning (ICL), the prompts fed to LLMs…

Computation and Language · Computer Science 2023-12-07 Huiqiang Jiang , Qianhui Wu , Chin-Yew Lin , Yuqing Yang , Lili Qiu