English
Related papers

Related papers: The KoLMogorov Test: Compression by Code Generatio…

200 papers

The Kolmogorov complexity of x, denoted C(x), is the length of the shortest program that generates x. For such a simple definition, Kolmogorov complexity has a rich and deep theory, as well as applications to a wide variety of topics…

Computational Complexity · Computer Science 2017-02-17 Stephen Fenner , Lance Fortnow

The widespread use of Large Language Models (LLMs) in software engineering has intensified the need for improved model and resource efficiency. In particular, for neural code generation, LLMs are used to translate function/method signature…

Software Engineering · Computer Science 2025-06-12 Guang Yang , Yu Zhou , Wei Cheng , Xiangyu Zhang , Xiang Chen , Terry Yue Zhuo , Ke Liu , Xin Zhou , David Lo , Taolue Chen

The classical coding theorem in Kolmogorov complexity states that if an $n$-bit string $x$ is sampled with probability $\delta$ by an algorithm with prefix-free domain then K$(x) \leq \log(1/\delta) + O(1)$. In a recent work, Lu and…

Computational Complexity · Computer Science 2022-04-19 Zhenjian Lu , Igor C. Oliveira , Marius Zimand

In this work, we conceptualize the learning process as information compression. We seek to equip generative pre-trained models with human-like learning capabilities that enable data compression during inference. We present a novel approach…

Artificial Intelligence · Computer Science 2023-08-15 Cynthia Huang , Yuqing Xie , Zhiying Jiang , Jimmy Lin , Ming Li

Prompt engineering enables Large Language Models (LLMs) to perform a variety of tasks. However, lengthy prompts significantly increase computational complexity and economic costs. To address this issue, we study six prompt compression…

Computation and Language · Computer Science 2025-05-02 Zheng Zhang , Jinyi Li , Yihuai Lan , Xiang Wang , Hao Wang

Large Language Models (LLMs) have demonstrated remarkable capabilities across numerous tasks, yet principled explanations for their underlying mechanisms and several phenomena, such as scaling laws, hallucinations, and related behaviors,…

Artificial Intelligence · Computer Science 2025-11-11 Zhixuan Pan , Shaowen Wang , Jian Li

Knowledge tracing (KT), wherein students' problem-solving histories are used to estimate their current levels of knowledge, has attracted significant interest from researchers. However, most existing KT models were developed with an…

Computation and Language · Computer Science 2024-06-19 Heeseok Jung , Jaesang Yoo , Yohaan Yoon , Yeonju Jang

The rise of large language models (LLMs) is revolutionizing information retrieval, question answering, summarization, and code generation tasks. However, in addition to confidently presenting factually inaccurate information at times (known…

Artificial Intelligence · Computer Science 2023-04-26 Henry Gilbert , Michael Sandborn , Douglas C. Schmidt , Jesse Spencer-Smith , Jules White

Understanding the relationship between data compression and the capabilities of Large Language Models (LLMs) is crucial, especially in specialized domains like code intelligence. Prior work posited a linear relationship between compression…

Computation and Language · Computer Science 2026-03-27 Shijie Xuyang , Xianzhen Luo , Zheng Chu , Houyi Li , Siming Huang , Qiufeng Wang , Wanxiang Che , Qingfu Zhu , Shuigeng Zhou

There is a belief that learning to compress well will lead to intelligence. Recently, language modeling has been shown to be equivalent to compression, which offers a compelling rationale for the success of large language models (LLMs): the…

Computation and Language · Computer Science 2024-08-20 Yuzhen Huang , Jinghan Zhang , Zifei Shan , Junxian He

Generating Chain-of-Thought (CoT) before deriving the answer can effectively improve the reasoning capabilities of large language models (LLMs) and significantly improve the accuracy of the generated answer. However, in most cases, the…

Computation and Language · Computer Science 2024-12-17 Yu Kang , Xianghui Sun , Liangyu Chen , Wei Zou

Large Language Models (LLMs) have demonstrated remarkable potential in code generation. The integration of Chain of Thought (CoT) reasoning can further boost their performance. However, current CoT methods often require manual writing or…

Software Engineering · Computer Science 2024-08-06 Guang Yang , Yu Zhou , Xiang Chen , Xiangyu Zhang , Terry Yue Zhuo , Taolue Chen

Key-Value cache (\texttt{KV} \texttt{cache}) compression has emerged as a promising technique to optimize Large Language Model (LLM) serving. It primarily decreases the memory consumption of \texttt{KV} \texttt{cache} to reduce the…

Machine Learning · Computer Science 2025-04-01 Wei Gao , Xinyu Zhou , Peng Sun , Tianwei Zhang , Yonggang Wen

Large Language Models (LLMs) need to adapt to the continuous changes in data, tasks, and user preferences. Due to their massive size and the high costs associated with training, LLMs are not suitable for frequent retraining. However,…

Computation and Language · Computer Science 2024-12-11 Dongfang Li , Zetian Sun , Xinshuo Hu , Baotian Hu , Min Zhang

The coding theorem for Kolmogorov complexity states that any string sampled from a computable distribution has a description length close to its information content. A coding theorem for resource-bounded Kolmogorov complexity is the key to…

Computational Complexity · Computer Science 2024-09-20 Shuichi Hirahara , Zhenjian Lu , Mikito Nanashima

Explicit chain-of-thought (CoT) reasoning substantially improves the reasoning ability of large language models (LLMs), but incurs high inference cost due to lengthy autoregressive traces. Existing latent reasoning methods offer a promising…

Computation and Language · Computer Science 2026-05-26 Hui Xie , Jie Liu , Ziyue Qiao , Joaquin Vanschore

Large language models (LLMs) have been applied in various applications due to their astonishing capabilities. With advancements in technologies such as chain-of-thought (CoT) prompting and in-context learning (ICL), the prompts fed to LLMs…

Computation and Language · Computer Science 2023-12-07 Huiqiang Jiang , Qianhui Wu , Chin-Yew Lin , Yuqing Yang , Lili Qiu

Motivated by reducing the computational and storage costs of LLMs, model compression and KV cache compression have attracted much attention from researchers. However, current methods predominantly emphasize maintaining the performance of…

Machine Learning · Computer Science 2025-02-26 Zhenheng Tang , Xiang Liu , Qian Wang , Peijie Dong , Bingsheng He , Xiaowen Chu , Bo Li

Large Language Models (LLMs) have achieved remarkable success in source code understanding, yet as software systems grow in scale, computational efficiency has become a critical bottleneck. Currently, these models rely on a text-based…

Computation and Language · Computer Science 2026-04-29 Yuling Shi , Chaoxiang Xie , Zhensu Sun , Yeheng Chen , Chenxu Zhang , Longfei Yun , Chengcheng Wan , Hongyu Zhang , David Lo , Xiaodong Gu

In this paper, we present a theoretical effort to connect the theory of program size to psychology by implementing a concrete language of thought with Turing-computable Kolmogorov complexity (LT^2C^2) satisfying the following requirements:…

Neurons and Cognition · Quantitative Biology 2013-03-06 Sergio Romano , Mariano Sigman , Santiago Figueira
‹ Prev 1 2 3 10 Next ›