Related papers: SpreadsheetLLM: Encoding Spreadsheets for Large La…

Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities

This paper explores capabilities of Vision Language Models on spreadsheet comprehension. We propose three self-supervised challenges with corresponding evaluation metrics to comprehensively evaluate VLMs on Optical Character Recognition…

Computer Vision and Pattern Recognition · Computer Science 2024-09-27 Shiyu Xia , Junyu Xiong , Haoyu Dong , Jianbo Zhao , Yuzhang Tian , Mengyu Zhou , Yeye He , Shi Han , Dongmei Zhang

SheetDesigner: MLLM-Powered Spreadsheet Layout Generation with Rule-Based and Vision-Based Reflection

Spreadsheets are critical to data-centric tasks, with rich, structured layouts that enable efficient information transmission. Given the time and expertise required for manual spreadsheet layout design, there is an urgent need for automated…

Artificial Intelligence · Computer Science 2025-09-10 Qin Chen , Yuanyi Ren , Xiaojun Ma , Mugeng Liu , Han Shi , Dongmei Zhang

PocketLLM: Ultimate Compression of Large Language Models via Meta Networks

As Large Language Models (LLMs) continue to grow in size, storing and transmitting them on edge devices becomes increasingly challenging. Traditional methods like quantization and pruning struggle to achieve extreme compression of LLMs…

Machine Learning · Computer Science 2025-11-25 Ye Tian , Chengcheng Wang , Jing Han , Yehui Tang , Kai Han

EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices

Large Language Models (LLMs) achieve strong performance across tasks, but face storage and compute challenges on edge devices. We propose EntroLLM, a compression framework combining mixed quantization and entropy coding to reduce storage…

Machine Learning · Computer Science 2026-05-05 Arnab Sanyal , Gourav Datta , Prithwish Mukherjee , Sandeep P. Chinchali , Michael Orshansky

A Survey on Model Compression for Large Language Models

Large Language Models (LLMs) have transformed natural language processing tasks successfully. Yet, their large size and high computational needs pose challenges for practical use, especially in resource-limited settings. Model compression…

Computation and Language · Computer Science 2024-07-31 Xunyu Zhu , Jian Li , Yong Liu , Can Ma , Weiping Wang

UltraSketchLLM: Saliency-Driven Sketching for Ultra-Low Bit LLM Compression

The rapid growth of large language models (LLMs) has outpaced the memory constraints of edge devices, necessitating extreme weight compression beyond the 1-bit limit. While quantization reduces model size, it is fundamentally limited to 1…

Machine Learning · Computer Science 2025-06-24 Sunan Zou , Ziyun Zhang , Xueting Sun , Guojie Luo

Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning

Spreadsheets are central to real-world applications such as enterprise reporting, auditing, and scientific data management. Despite their ubiquity, existing large language model based approaches typically treat tables as plain text,…

Computation and Language · Computer Science 2026-04-15 Houxing Ren , Mingjie Zhan , Zimu Lu , Ke Wang , Yunqiao Yang , Haotian Hou , Hongsheng Li

SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation

We introduce SpreadsheetBench, a challenging spreadsheet manipulation benchmark exclusively derived from real-world scenarios, designed to immerse current large language models (LLMs) in the actual workflow of spreadsheet users. Unlike…

Computation and Language · Computer Science 2024-10-18 Zeyao Ma , Bohan Zhang , Jing Zhang , Jifan Yu , Xiaokang Zhang , Xiaohan Zhang , Sijia Luo , Xi Wang , Jie Tang

MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

We present MeshLLM, a novel framework that leverages large language models (LLMs) to understand and generate text-serialized 3D meshes. Our approach addresses key limitations in existing methods, including the limited dataset scale when…

Graphics · Computer Science 2025-08-06 Shuangkang Fang , I-Chao Shen , Yufeng Wang , Yi-Hsuan Tsai , Yi Yang , Shuchang Zhou , Wenrui Ding , Takeo Igarashi , Ming-Hsuan Yang

From Rows to Reasoning: A Retrieval-Augmented Multimodal Framework for Spreadsheet Understanding

Large Language Models (LLMs) struggle to reason over large-scale enterprise spreadsheets containing thousands of numeric rows, multiple linked sheets, and embedded visual content such as charts and receipts. Prior state-of-the-art…

Computation and Language · Computer Science 2026-02-10 Anmol Gulati , Sahil Sen , Waqar Sarguroh , Kevin Paul

LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding

This paper proposes LayoutLLM, a more flexible document analysis method for understanding imaged documents. Visually Rich Document Understanding tasks, such as document image classification and information extraction, have gained…

Computation and Language · Computer Science 2024-03-22 Masato Fujitake

DocSLM: A Small Vision-Language Model for Long Multimodal Document Understanding

Large Vision-Language Models (LVLMs) have demonstrated strong multimodal reasoning capabilities on long and complex documents. However, their high memory footprint makes them impractical for deployment on resource-constrained edge devices.…

Computer Vision and Pattern Recognition · Computer Science 2025-11-24 Tanveer Hannan , Dimitrios Mallios , Parth Pathak , Faegheh Sardari , Thomas Seidl , Gedas Bertasius , Mohsen Fayyaz , Sunando Sengupta

TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios

We introduce TableLLM, a robust large language model (LLM) with 8 billion parameters, purpose-built for proficiently handling tabular data manipulation tasks, whether they are embedded within documents or spreadsheets, catering to…

Computation and Language · Computer Science 2025-02-18 Xiaokang Zhang , Sijia Luo , Bohan Zhang , Zeyao Ma , Jing Zhang , Yang Li , Guanlin Li , Zijun Yao , Kangli Xu , Jinchang Zhou , Daniel Zhang-Li , Jifan Yu , Shu Zhao , Juanzi Li , Jie Tang

Measuring Distributional Shifts in Text: The Advantage of Language Model-Based Embeddings

An essential part of monitoring machine learning models in production is measuring input and output data drift. In this paper, we present a system for measuring distributional shifts in natural language data and highlight and investigate…

Computation and Language · Computer Science 2023-12-06 Gyandev Gupta , Bashir Rastegarpanah , Amalendu Iyer , Joshua Rubin , Krishnaram Kenthapadi

SpreadsheetArena: Decomposing Preference in LLM Generation of Spreadsheet Workbooks

Large language models (LLMs) are increasingly tasked with producing and manipulating structured artifacts. We consider the task of end-to-end spreadsheet generation, where language models are prompted to produce spreadsheet artifacts to…

Computation and Language · Computer Science 2026-03-12 Srivatsa Kundurthy , Clara Na , Michael Handley , Zach Kirshner , Chen Bo Calvin Zhang , Manasi Sharma , Emma Strubell , John Ling

Large Language Models for Spreadsheets: Benchmarking Progress and Evaluating Performance with FLARE

Large Language Models (LLMs) have demonstrated some significant capabilities across various domains; however, their effectiveness in spreadsheet related tasks remains underexplored. This study introduces a foundation for a comprehensive…

Software Engineering · Computer Science 2025-06-24 Simon Thorne

LensVLM: Selective Context Expansion for Compressed Visual Representation of Text

Vision Language Models (VLMs) offer the exciting possibility of processing text as rendered images, bypassing the need for tokenizing the text into long token sequences. Since VLM image encoders map fixed-size images to a fixed number of…

Computer Vision and Pattern Recognition · Computer Science 2026-05-11 Roy Xie , Dan Friedman , Donghan Yu , Bowen Pan , Christopher Fifty , Jang-Hyun Kim , Xianzhi Du , Zhe Gan , Vivek Rathod , Bhuwan Dhingra

EmbedLLM: Learning Compact Representations of Large Language Models

With hundreds of thousands of language models available on Huggingface today, efficiently evaluating and utilizing these models across various downstream, tasks has become increasingly critical. Many existing methods repeatedly learn…

Computation and Language · Computer Science 2024-10-18 Richard Zhuang , Tianhao Wu , Zhaojin Wen , Andrew Li , Jiantao Jiao , Kannan Ramchandran

SheetAgent: Towards A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models

Spreadsheets are ubiquitous across the World Wide Web, playing a critical role in enhancing work efficiency across various domains. Large language model (LLM) has been recently attempted for automatic spreadsheet manipulation but has not…

Artificial Intelligence · Computer Science 2025-03-04 Yibin Chen , Yifu Yuan , Zeyu Zhang , Yan Zheng , Jinyi Liu , Fei Ni , Jianye Hao , Hangyu Mao , Fuzheng Zhang

Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability

Large language models (LLMs) face significant token efficiency bottlenecks in code generation and logical reasoning tasks, a challenge that directly impacts inference cost and model interpretability. This paper proposes a formal framework…

Artificial Intelligence · Computer Science 2025-02-03 Lumen AI , Tengzhou No. 1 Middle School , Shihao Ji , Zihui Song , Fucheng Zhong , Jisen Jia , Zhaobo Wu , Zheyi Cao , Tianhao Xu