Related papers: Cross-Modal Memory Compression for Efficient Multi…

AgentOCR: Reimagining Agent History via Optical Self-Compression

Recent advances in large language models (LLMs) enable agentic systems trained with reinforcement learning (RL) over multi-turn interaction trajectories, but practical deployment is bottlenecked by rapidly growing textual histories that…

Machine Learning · Computer Science 2026-03-03 Lang Feng , Fuchao Yang , Feng Chen , Xin Cheng , Haiyang Xu , Zhenglin Wan , Ming Yan , Bo An

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Long-horizon agentic reasoning necessitates effectively compressing growing interaction histories into a limited context window. Most existing memory systems serialize history as text, where token-level cost is uniform and scales linearly…

Artificial Intelligence · Computer Science 2026-05-19 Yaorui Shi , Shugui Liu , Yu Yang , Wenyu Mao , Yuxin Chen , Qi GU , Hui Su , Xunliang Cai , Xiang Wang , An Zhang

Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention

Multi-Agent Debate has emerged as a promising framework for improving the reasoning quality of large language models through iterative inter-agent communication. However, broadcasting all agent messages at every round introduces noise and…

Computation and Language · Computer Science 2026-04-15 Manh Nguyen , Anh Nguyen , Dung Nguyen , Svetha Venkatesh , Hung Le

OCR-Memory: Optical Context Retrieval for Long-Horizon Agent Memory

Autonomous LLM agents increasingly operate in long-horizon, interactive settings where success depends on reusing experience accumulated over extended histories. However, existing agent memory systems are fundamentally constrained by…

Computation and Language · Computer Science 2026-04-30 Jinze Li , Yang Zhang , Xin Yang , Jiayi Qu , Jinfeng Xu , Shuo Yang , Junhua Ding , Edith Cheuk-Han Ngai

Multi-Agent Debate with Memory Masking

Large language models (LLMs) have recently demonstrated impressive capabilities in reasoning tasks. Currently, mainstream LLM reasoning frameworks predominantly focus on scaling up inference-time sampling to enhance performance. In…

Computation and Language · Computer Science 2026-03-24 Hongduan Tian , Xiao Feng , Ziyuan Zhao , Xiangyu Zhu , Rolan Yan , Bo Han

Improving Multi-Agent Debate with Sparse Communication Topology

Multi-agent debate has proven effective in improving large language models quality for reasoning and factuality tasks. While various role-playing strategies in multi-agent debates have been explored, in terms of the communication among…

Computation and Language · Computer Science 2024-06-18 Yunxuan Li , Yibing Du , Jiageng Zhang , Le Hou , Peter Grabowski , Yeqing Li , Eugene Ie

GroupDebate: Enhancing the Efficiency of Multi-Agent Debate Using Group Discussion

In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse NLP tasks. Extensive research has explored how to enhance the logical reasoning abilities such as Chain-of-Thought, Chain-of-Thought with…

Computation and Language · Computer Science 2025-12-29 Tongxuan Liu , Xingyu Wang , Weizhe Huang , Wenjiang Xu , Yuting Zeng , Lei Jiang , Hailong Yang , Jing Li

Less Is More: Elevating RAG via Performance-Driven Context Compression

Retrieval-Augmented Generation (RAG) has emerged as a promising paradigm for improving the timeliness of knowledge updates and the factual accuracy of large language models. However, incorporating a large volume of retrieved documents…

Computation and Language · Computer Science 2026-05-29 Ziqiang Cui , Yunpeng Weng , Xing Tang , Peiyang Liu , Shiwei Li , Bowei He , Jiamin Chen , Yansen Zhang , Xiuqiang He , Chen Ma

Context Compression for Auto-regressive Transformers with Sentinel Tokens

The quadratic complexity of the attention module makes it gradually become the bulk of compute in Transformer-based LLMs during generation. Moreover, the excessive key-value cache that arises when dealing with long inputs also brings severe…

Computation and Language · Computer Science 2023-10-17 Siyu Ren , Qi Jia , Kenny Q. Zhu

Free-MAD: Consensus-Free Multi-Agent Debate

Multi-agent debate (MAD) is an emerging approach to improving the reasoning capabilities of large language models (LLMs). Existing MAD methods rely on multiple rounds of interaction among agents to reach consensus, and the final output is…

Artificial Intelligence · Computer Science 2025-09-16 Yu Cui , Hang Fu , Haibin Zhang , Licheng Wang , Cong Zuo

From Soliloquy to Agora: Memory-Enhanced LLM Agents with Decentralized Debate for Optimization Modeling

Optimization modeling underpins real-world decision-making in logistics, manufacturing, energy, and public services, but reliably solving such problems from natural-language requirements remains challenging for current large language models…

Optimization and Control · Mathematics 2026-04-29 Jianghao Lin , Zi Ling , Chenyu Zhou , Tianyi Xu , Ruoqing Jiang , Zizhuo Wang , Dongdong Ge

Dynamic Long Context Reasoning over Compressed Memory via End-to-End Reinforcement Learning

Large Language Models (LLMs) face significant challenges in long-context processing, including quadratic computational costs, information forgetting, and the context fragmentation inherent in retrieval-augmented generation (RAG). We propose…

Computation and Language · Computer Science 2026-02-10 Zhuoen Chen , Dongfang Li , Meishan Zhang , Baotian Hu , Min Zhang

A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning

This paper presents a pilot study aimed at introducing multi-agent debate into multimodal reasoning. The study addresses two key challenges: the trivialization of opinions resulting from excessive summarization and the diversion of focus…

Artificial Intelligence · Computer Science 2024-08-07 Changmeng Zheng , Dayong Liang , Wengyu Zhang , Xiao-Yong Wei , Tat-Seng Chua , Qing Li

Towards Preserving Semantic Structure in Argumentative Multi-Agent via Abstract Interpretation

Over the recent twenty years, argumentation has received considerable attention in the fields of knowledge representation, reasoning, and multi-agent systems. However, argumentation in dynamic multi-agent systems encounters the problem of…

Artificial Intelligence · Computer Science 2022-11-30 Minal Suresh Patil

OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning

Large Language Models (LLMs) have shown remarkable reasoning capabilities in mathematical and scientific tasks. To enhance complex reasoning, multi-agent systems have been proposed to harness the collective intelligence of LLM agents.…

Artificial Intelligence · Computer Science 2025-10-22 Zhenyu Bi , Meng Lu , Yang Li , Swastik Roy , Weijie Guan , Morteza Ziyadi , Xuan Wang

UniCompress: Token Compression for Unified Vision-Language Understanding and Generation

Unified models aim to support both understanding and generation by encoding images into discrete tokens and processing them alongside text within a single autoregressive framework. This unified design offers architectural simplicity and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-13 Ziyao Wang , Chen Chen , Jingtao Li , Weiming Zhuang , Jiabo Huang , Ang Li , Lingjuan Lyu

Heterogeneous Consensus-Progressive Reasoning for Efficient Multi-Agent Debate

Multi-Agent Debate (MAD) is a collaborative framework in which multiple agents iteratively refine solutions through the generation of reasoning and alternating critique cycles. Current work primarily optimizes intra-round topologies and…

Multiagent Systems · Computer Science 2026-04-14 Yiqing Liu , Hantao Yao , Wu Liu , Allen He , Yongdong Zhang

Bridging Efficiency and Transparency: Explainable CoT Compression in Multimodal Large Reasoning Models

Long chains of thought (Long CoTs) are widely employed in multimodal reasoning models to tackle complex tasks by capturing detailed visual information. However, these Long CoTs are often excessively lengthy and contain redundant reasoning…

Artificial Intelligence · Computer Science 2026-02-11 Yizhi Wang , Linan Yue , Min-Ling Zhang

MemReread: Enhancing Agentic Long-Context Reasoning via Memory-Guided Rereading

To tackle long-context reasoning tasks without the quadratic complexity of standard attention mechanisms, approaches based on agent memory have emerged, which typically maintain a dynamically updated memory when linearly processing document…

Computation and Language · Computer Science 2026-05-12 Baibei Ji , Xiaoyang Weng , Juntao Li , Zecheng Tang , Yihang Lou , Min Zhang

Optical Context Compression Is Just (Bad) Autoencoding

DeepSeek-OCR shows that rendered text can be reconstructed from a small number of vision tokens, sparking excitement about using vision as a compression medium for long textual contexts. But this pipeline requires rendering token embeddings…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Ivan Yee Lee , Cheng Yang , Taylor Berg-Kirkpatrick