Related papers: Decoding Lua: Formal Semantics for the Developer a…

Understanding Lua's Garbage Collection -- Towards a Formalized Static Analyzer

We provide the semantics of garbage collection (GC) for the Lua programming language. Of interest are the inclusion of finalizers(akin to destructors in object-oriented languages) and weak tables (a particular implementation of weak…

Programming Languages · Computer Science 2020-05-28 Mallku Soldevila , Beta Ziliani , Daniel Fridlender

PLSemanticsBench: Large Language Models As Programming Language Interpreters

As large language models (LLMs) excel at code reasoning, a natural question arises: can an LLM execute programs (i.e., act as an interpreter) purely based on a programming language's formal semantics? If so, it will enable rapid prototyping…

Programming Languages · Computer Science 2025-10-08 Aditya Thimmaiah , Jiyang Zhang , Jayanth Srinivasa , Junyi Jessy Li , Milos Gligoric

Beyond BLEU: A Semantic Evaluation Method for Code Translation

Code translation is one of the core capabilities of LLMs. However, evaluating the correctness of translations remains difficult, as commonly used metrics such as BLEU measure only syntactic similarity, disregarding program semantics. We…

Programming Languages · Computer Science 2026-05-08 Julius Näumann , Sven Keidel , Amir Molzam Sharifloo , Mira Mezini

An Experiment with a User Manual of a Programming Language Based on a Denotational Semantics

Denotational models should provide an opportunity for the revision of current practices seen in the manuals of programming languages. New styles should on one hand base on denotational models but on the other - do not assume that today…

Programming Languages · Computer Science 2019-05-30 Andrzej Blikle

On Robustness of Prompt-based Semantic Parsing with Large Pre-trained Language Model: An Empirical Study on Codex

Semantic parsing is a technique aimed at constructing a structured representation of the meaning of a natural-language question. Recent advancements in few-shot language models trained on code have demonstrated superior performance in…

Computation and Language · Computer Science 2023-03-10 Terry Yue Zhuo , Zhuang Li , Yujin Huang , Fatemeh Shiri , Weiqing Wang , Gholamreza Haffari , Yuan-Fang Li

Mechanized semantics

The goal of this lecture is to show how modern theorem provers---in this case, the Coq proof assistant---can be used to mechanize the specification of programming languages and their semantics, and to reason over individual programs and…

Programming Languages · Computer Science 2010-10-28 Xavier Leroy

Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language Benchmarks

Democratization of AI is an important topic within the broader topic of the digital divide. This issue is relevant to LLMs, which are becoming popular as AI co-pilots but suffer from a lack of accessibility due to high computational demand.…

Software Engineering · Computer Science 2024-10-22 Enkhbold Nyamsuren

Reducing Hallucinations in LLM-Generated Code via Semantic Triangulation

Large language models (LLMs) can generate executable code from natural language descriptions, but the resulting programs frequently contain bugs due to hallucinations. In the absence of formal specifications, existing approaches attempt to…

Software Engineering · Computer Science 2026-03-31 Yihan Dai , Sijie Liang , Haotian Xu , Peichu Xie , Sergey Mechtaev

Automata-based constraints for language model decoding

Language models (LMs) are often expected to generate strings in some formal language; for example, structured data, API calls, or code snippets. Although LMs can be tuned to improve their adherence to formal syntax, this does not guarantee…

Computation and Language · Computer Science 2024-08-06 Terry Koo , Frederick Liu , Luheng He

Improving Symbolic Translation of Language Models for Logical Reasoning

The use of formal language for deductive logical reasoning aligns well with language models (LMs), where translating natural language (NL) into first-order logic (FOL) and employing an external solver results in a verifiable and therefore…

Computation and Language · Computer Science 2026-01-15 Ramya Keerthy Thatikonda , Jiuzhou Han , Wray Buntine , Ehsan Shareghi

Reflection-based language support for the heterogeneous capture and restoration of running computations

This work is devoted to the study of the problem of user-level capture and restoration of running computations in heterogeneous environments. Support for those operations has traditionally been offered through ready-made solutions for…

Programming Languages · Computer Science 2015-03-17 Anolan Milanés , Noemi Rodriguez , Roberto Ierusalimschy

Understanding Tool-Augmented Agents for Lean Formalization: A Factorial Analysis

Automatic translation of natural language mathematics into faithful Lean 4 code is hindered by the fundamental dissonance between informal set-theoretic intuition and strict formal type theory. This gap often causes LLMs to hallucinate…

Software Engineering · Computer Science 2026-04-21 Ke Zhang , Patricio Gallardo , Maziar Raissi , Sudhir Murthy

SLaDe: A Portable Small Language Model Decompiler for Optimized Assembly

Decompilation is a well-studied area with numerous high-quality tools available. These are frequently used for security tasks and to port legacy code. However, they regularly generate difficult-to-read programs and require a large amount of…

Programming Languages · Computer Science 2024-02-16 Jordi Armengol-Estapé , Jackson Woodruff , Chris Cummins , Michael F. P. O'Boyle

Towards a Mathematics Formalisation Assistant using Large Language Models

Mathematics formalisation is the task of writing mathematics (i.e., definitions, theorem statements, proofs) in natural language, as found in books and papers, into a formal language that can then be checked for correctness by a program. It…

Computation and Language · Computer Science 2022-11-15 Ayush Agrawal , Siddhartha Gadgil , Navin Goyal , Ashvni Narayanan , Anand Tadipatri

Meta-Modeling Semantics of UML

The Unified Modelling Language is emerging as a de-facto standard for modelling object-oriented systems. However, the semantics document that a part of the standard definition primarily provides a description of the language's syntax and…

Software Engineering · Computer Science 2014-09-25 Andy Evans , Kevin Lano , Robert France , Bernhard Rumpe

Formal Specifications from Natural Language

We study the generalization abilities of language models when translating natural language into formal specifications with complex semantics. In particular, we fine-tune language models on three datasets consisting of English sentences and…

Software Engineering · Computer Science 2022-10-21 Christopher Hahn , Frederik Schmitt , Julia J. Tillman , Niklas Metzger , Julian Siber , Bernd Finkbeiner

Operational methods in semantics

The focus of these lecture notes is on abstract models and basic ideas and results that relate to the operational semantics of programming languages largely conceived. The approach is to start with an abstract description of the computation…

Programming Languages · Computer Science 2025-10-15 Roberto M. Amadio

Redex -> Coq: towards a theory of decidability of Redex's reduction semantics

We propose the first steps in the development of a tool to automate the translation of Redex models into a (hopefully) semantically equivalent model in Coq, and to provide tactics to help in the certification of fundamental properties of…

Logic in Computer Science · Computer Science 2024-02-07 Mallku Soldevila , Rodrigo Ribeiro , Beta Ziliani

Visualizing the Evaluation of Functional Programs for Debugging

In this position paper, we present a prototype of a visualizer for functional programs. Such programs, whose evaluation model is the reduction of an expression to a value through repeated application of rewriting rules, and which tend to…

Programming Languages · Computer Science 2024-11-04 John Whitington , Tom Ridge

RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization

This study addresses the challenge of extending Large Language Models (LLMs) to non-English languages that use non-Roman scripts. We propose an approach that utilizes the romanized form of text as an interface for LLMs, hypothesizing that…

Computation and Language · Computer Science 2024-06-25 Jaavid Aktar Husain , Raj Dabre , Aswanth Kumar , Jay Gala , Thanmay Jayakumar , Ratish Puduppully , Anoop Kunchukuttan