Related papers: LILO: Learning Interpretable Libraries by Compress…

LEGO-Compiler: Enhancing Neural Compilation Through Translation Composability

Large language models (LLMs) have the potential to revolutionize how we design and implement compilers and code translation tools. However, existing LLMs struggle to handle long and complex programs. We introduce LEGO-Compiler, a novel…

Programming Languages · Computer Science 2025-05-28 Shuoming Zhang , Jiacheng Zhao , Chunwei Xia , Zheng Wang , Yunji Chen , Xiaobing Feng , Huimin Cui

Leroy: Library Learning for Imperative Programming Languages

Library learning is the process of building a library of common functionalities from a given set of programs. Typically, this process is applied in the context of aiding program synthesis: concise functions can help the synthesizer produce…

Programming Languages · Computer Science 2024-10-10 Abhiram Bellur , Razan Alghamdi , Kidus Workneh , Joseph Izraelevitz

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Large Language Models (LLMs) have achieved remarkable success in source code understanding, yet as software systems grow in scale, computational efficiency has become a critical bottleneck. Currently, these models rely on a text-based…

Computation and Language · Computer Science 2026-04-29 Yuling Shi , Chaoxiang Xie , Zhensu Sun , Yeheng Chen , Chenxu Zhang , Longfei Yun , Chengcheng Wan , Hongyu Zhang , David Lo , Xiaodong Gu

ReadMe.LLM: A Framework to Help LLMs Understand Your Library

Large Language Models (LLMs) often struggle with code generation tasks involving niche software libraries. Existing code generation techniques with only human-oriented documentation can fail -- even when the LLM has access to web search and…

Software Engineering · Computer Science 2025-05-09 Sandya Wijaya , Jacob Bolano , Alejandro Gomez Soteres , Shriyanshu Kode , Yue Huang , Anant Sahai

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

While large language models (LLMs) are increasingly being used for program synthesis, they lack the global view needed to develop useful abstractions; they generally predict programs one at a time, often repeating the same functionality.…

Software Engineering · Computer Science 2024-06-07 Elias Stengel-Eskin , Archiki Prasad , Mohit Bansal

Learning to Superoptimize Real-world Programs

Program optimization is the process of modifying software to execute more efficiently. Superoptimizers attempt to find the optimal program by employing significantly more expensive search and constraint solving techniques. Generally, these…

Machine Learning · Computer Science 2022-04-06 Alex Shypula , Pengcheng Yin , Jeremy Lacomis , Claire Le Goues , Edward Schwartz , Graham Neubig

Tackling Distribution Shift in LLM via KILO: Knowledge-Instructed Learning for Continual Adaptation

Large Language Models (LLMs) often suffer from performance degradation when faced with domain shifts, primarily due to catastrophic forgetting. In this work, we propose KILO (Knowledge-Instructed Learning for Continual Adaptation), a novel…

Computation and Language · Computer Science 2025-08-06 Iing Muttakhiroh , Thomas Fevens

LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers

Logical reasoning, i.e., deductively inferring the truth value of a conclusion from a set of premises, is an important task for artificial intelligence with wide potential impacts on science, mathematics, and society. While many…

Computation and Language · Computer Science 2024-02-15 Theo X. Olausson , Alex Gu , Benjamin Lipkin , Cedegao E. Zhang , Armando Solar-Lezama , Joshua B. Tenenbaum , Roger Levy

SymCode: A Neurosymbolic Approach to Mathematical Reasoning via Verifiable Code Generation

Large Language Models (LLMs) often struggle with complex mathematical reasoning, where prose-based generation leads to unverified and arithmetically unsound solutions. Current prompting strategies like Chain of Thought still operate within…

Computation and Language · Computer Science 2026-01-27 Sina Bagheri Nezhad , Yao Li , Ameeta Agrawal

From Restructuring to Stabilization: A Large-Scale Experiment on Iterative Code Readability Refactoring with Large Language Models

Large language models (LLMs) are increasingly used for automated code refactoring tasks. Although these models can quickly refactor code, the quality may exhibit inconsistencies and unpredictable behavior. In this article, we systematically…

Software Engineering · Computer Science 2026-02-26 Norman Peitek , Julia Hess , Sven Apel

Large Language Models are Interpretable Learners

The trade-off between expressiveness and interpretability remains a core challenge when building human-centric predictive models for classification and decision-making. While symbolic rules offer interpretability, they often lack…

Artificial Intelligence · Computer Science 2024-06-26 Ruochen Wang , Si Si , Felix Yu , Dorothea Wiesmann , Cho-Jui Hsieh , Inderjit Dhillon

Combining Large Language Models and Gradient-Free Optimization for Automatic Control Policy Synthesis

Large Language models (LLMs) have shown promise as generators of symbolic control policies, producing interpretable program-like representations through iterative search. However, these models are not capable of separating the functional…

Machine Learning · Computer Science 2025-10-02 Carlo Bosio , Matteo Guarrera , Alberto Sangiovanni-Vincentelli , Mark W. Mueller

Enhancing Large Language Model Efficiencyvia Symbolic Compression: A Formal Approach Towards Interpretability

Large language models (LLMs) face significant token efficiency bottlenecks in code generation and logical reasoning tasks, a challenge that directly impacts inference cost and model interpretability. This paper proposes a formal framework…

Artificial Intelligence · Computer Science 2025-02-03 Lumen AI , Tengzhou No. 1 Middle School , Shihao Ji , Zihui Song , Fucheng Zhong , Jisen Jia , Zhaobo Wu , Zheyi Cao , Tianhao Xu

A Unified Language Model for Large Scale Search, Recommendation, and Reasoning

LLMs are increasingly applied to recommendation, retrieval, and reasoning, yet deploying a single end-to-end model that can jointly support these behaviors over large, heterogeneous catalogs remains challenging. Such systems must generate…

Information Retrieval · Computer Science 2026-03-19 Marco De Nadai , Edoardo D'Amico , Max Lefarov , Alexandre Tamborrino , Divita Vohra , Mark VanMiddlesworth , Shawn Lin , Jacqueline Wood , Jan Stypka , Eliza Klyce , Keshi Dai , Timothy Christopher Heath , Martin D. Gould , Yves Raimond , Sandeep Ghael , Tony Jebara , Andreas Damianou , Vladan Radosavljevic , Paul N. Bennett , Mounia Lalmas , Praveen Chandar

Teaching Code Refactoring Using LLMs

This Innovative Practice full paper explores how Large Language Models (LLMs) can enhance the teaching of code refactoring in software engineering courses through real-time, context-aware feedback. Refactoring improves code quality but is…

Software Engineering · Computer Science 2025-08-14 Anshul Khairnar , Aarya Rajoju , Edward F. Gehringer

Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders

Large Language Models (LLMs) are traditionally viewed as black-box algorithms, therefore reducing trustworthiness and obscuring potential approaches to increasing performance on downstream tasks. In this work, we apply an effective LLM…

Computation and Language · Computer Science 2025-07-10 Shun Wang , Tyler Loakman , Youbo Lei , Yi Liu , Bohao Yang , Yuting Zhao , Dong Yang , Chenghua Lin

ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning

Large Language Models (LLMs) equipped with external tools have demonstrated enhanced performance on complex reasoning tasks. The widespread adoption of this tool-augmented reasoning is hindered by the scarcity of domain-specific tools. For…

Computation and Language · Computer Science 2025-10-10 Murong Yue , Zhiwei Liu , Liangwei Yang , Jianguo Zhang , Zuxin Liu , Haolin Chen , Ziyu Yao , Silvio Savarese , Caiming Xiong , Shelby Heinecke , Huan Wang

Learn to Relax with Large Language Models: Solving Constraint Optimization Problems via Bidirectional Coevolution

Large Language Model (LLM)-based optimization has recently shown promise for autonomous problem solving, yet most approaches still cast LLMs as passive constraint checkers rather than proactive strategy designers, limiting their…

Artificial Intelligence · Computer Science 2026-04-06 Beidan Liu , Zhengqiu Zhu , Chen Gao , Tianle Pu , Yong Zhao , Wei Qi , Quanjun Yin

Toward Neurosymbolic Program Comprehension

Recent advancements in Large Language Models (LLMs) have paved the way for Large Code Models (LCMs), enabling automation in complex software engineering tasks, such as code generation, software testing, and program comprehension, among…

Software Engineering · Computer Science 2025-02-05 Alejandro Velasco , Aya Garryyeva , David N. Palacio , Antonio Mastropaolo , Denys Poshyvanyk

DaMoC: Efficiently Selecting the Optimal Large Language Model for Fine-tuning Domain Tasks Based on Data and Model Compression

Large language models (LLMs) excel in general tasks but struggle with domain-specific ones, requiring fine-tuning with specific data. With many open-source LLMs available, selecting the best model for fine-tuning downstream tasks is…

Computation and Language · Computer Science 2025-09-05 Wei Huang , Huang Wei , Yinggui Wang