Related papers: A framework for lexical representation

Towards a Common Framework for Autoformalization

Autoformalization has emerged as a term referring to the automation of formalization - specifically, the formalization of mathematics using interactive theorem provers (proof assistants). Its rapid development has been driven by progress in…

Artificial Intelligence · Computer Science 2025-12-16 Agnieszka Mensfelt , David Tena Cucala , Santiago Franco , Angeliki Koutsoukou-Argyraki , Vince Trencsenyi , Kostas Stathis

A Formalisation of Abstract Argumentation in Higher-Order Logic

We present an approach for representing abstract argumentation frameworks based on an encoding into classical higher-order logic. This provides a uniform framework for computer-assisted assessment of abstract argumentation frameworks using…

Artificial Intelligence · Computer Science 2021-10-19 Alexander Steen , David Fuenmayor

Large Language Models: A Mathematical Formulation

Large language models (LLMs) process and predict sequences containing text to answer questions, and address tasks including document summarization, providing recommendations, writing software and solving quantitative problems. We provide a…

Numerical Analysis · Mathematics 2026-02-02 Ricardo Baptista , Andrew Stuart , Son Tran

GRAMPAL: A Morphological Processor for Spanish implemented in Prolog

A model for the full treatment of Spanish inflection for verbs, nouns and adjectives is presented. This model is based on feature unification and it relies upon a lexicon of allomorphs both for stems and morphemes. Word forms are built by…

cmp-lg · Computer Science 2016-08-15 Antonio Moreno , José M. Goñi

Lexicalization and Grammar Development

In this paper we present a fully lexicalized grammar formalism as a particularly attractive framework for the specification of natural language grammars. We discuss in detail Feature-based, Lexicalized Tree Adjoining Grammars (FB-LTAGs), a…

cmp-lg · Computer Science 2008-02-03 B. Srinivas , Dania Egedi , Christy Doran , Tilman Becker

A Layered Implementation Framework for Regular Languages

I present the most fundamental features of an implemented system designed to manipulate representations of regular languages. The system is structured into two layers, allowing regular languages to be represented in an increasingly compact,…

Formal Languages and Automata Theory · Computer Science 2025-09-24 Baudouin Le Charlier

Accelerating Multilingual Language Model for Excessively Tokenized Languages

Recent advancements in large language models (LLMs) have remarkably enhanced performances on a variety of tasks in multiple languages. However, tokenizers in LLMs trained primarily on English-centric corpora often overly fragment a text…

Computation and Language · Computer Science 2024-08-07 Jimin Hong , Gibbeum Lee , Jaewoong Cho

Compositional Morphology for Word Representations and Language Modelling

This paper presents a scalable method for integrating compositional morphological representations into a vector-based probabilistic language model. Our approach is evaluated in the context of log-bilinear language models, rendered suitably…

Computation and Language · Computer Science 2014-05-19 Jan A. Botha , Phil Blunsom

The Typology of Polysemy: A Multilingual Distributional Framework

Lexical semantic typology has identified important cross-linguistic generalizations about the variation and commonalities in polysemy patterns---how languages package up meanings into words. Recent computational research has enabled…

Computation and Language · Computer Science 2020-06-04 Ella Rabinovich , Yang Xu , Suzanne Stevenson

A Survey of Languages for Formalizing Mathematics

In order to work with mathematical content in computer systems, it is necessary to represent it in formal languages. Ideally, these are supported by tools that verify the correctness of the content, allow computing with it, and produce…

Logic in Computer Science · Computer Science 2020-05-27 Cezary Kaliszyk , Florian Rabe

Unifying Inference-Time Planning Language Generation

A line of work in planning uses LLM not to generate a plan, but to generate a formal representation in some planning language, which can be input into a symbolic solver to deterministically find a plan. While showing improved trust and…

Computation and Language · Computer Science 2025-10-08 Prabhu Prakash Kagitha , Bo Sun , Ishan Desai , Andrew Zhu , Cassie Huang , Manling Li , Ziyang Li , Li Zhang

A Unified Example-Based and Lexicalist Approach to Machine Translation

We present an approach to Machine Translation that combines the ideas and methodologies of the Example-Based and Lexicalist theoretical frameworks. The approach has been implemented in a multilingual Machine Translation system.

Computation and Language · Computer Science 2007-05-23 Davide Turcato , Paul McFetridge , Fred Popowich , Janine Toole

A Freely Available Wide Coverage Morphological Analyzer for English

This paper presents a morphological lexicon for English that handles more than 317000 inflected forms derived from over 90000 stems. The lexicon is available in two formats. The first can be used by an implementation of a two-level…

cmp-lg · Computer Science 2008-02-03 Daniel Karp , Yves Schabes , Martin Zaidel , Dania Egedi

A Unified Multilingual Handwriting Recognition System using multigrams sub-lexical units

We address the design of a unified multilingual system for handwriting recognition. Most of multi- lingual systems rests on specialized models that are trained on a single language and one of them is selected at test time. While some…

Computer Vision and Pattern Recognition · Computer Science 2018-08-29 Wassim Swaileh , Yann Soullard , Thierry Paquet

Unified Form Language: A domain-specific language for weak formulations of partial differential equations

We present the Unified Form Language (UFL), which is a domain-specific language for representing weak formulations of partial differential equations with a view to numerical approximation. Features of UFL include support for variational…

Mathematical Software · Computer Science 2013-04-29 Martin S. Alnaes , Anders Logg , Kristian B. Oelgaard , Marie E. Rognes , Garth N. Wells

REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models

We present REMARK-LLM, a novel efficient, and robust watermarking framework designed for texts generated by large language models (LLMs). Synthesizing human-like content using LLMs necessitates vast computational resources and extensive…

Cryptography and Security · Computer Science 2024-04-09 Ruisi Zhang , Shehzeen Samarah Hussain , Paarth Neekhara , Farinaz Koushanfar

Autoformalization in the Era of Large Language Models: A Survey

Autoformalization, the process of transforming informal mathematical propositions into verifiable formal representations, is a foundational task in automated theorem proving, offering a new perspective on the use of mathematics in both…

Artificial Intelligence · Computer Science 2025-07-04 Ke Weng , Lun Du , Sirui Li , Wangyue Lu , Haozhe Sun , Hengyu Liu , Tiancheng Zhang

Consistent Autoformalization for Constructing Mathematical Libraries

Autoformalization is the task of automatically translating mathematical content written in natural language to a formal language expression. The growing language interpretation capabilities of Large Language Models (LLMs), including in…

Computation and Language · Computer Science 2025-06-16 Lan Zhang , Xin Quan , Andre Freitas

A Concept-Based Explainability Framework for Large Multimodal Models

Large multimodal models (LMMs) combine unimodal encoders and large language models (LLMs) to perform multimodal tasks. Despite recent advancements towards the interpretability of these models, understanding internal representations of LMMs…

Machine Learning · Computer Science 2024-12-03 Jayneel Parekh , Pegah Khayatan , Mustafa Shukor , Alasdair Newson , Matthieu Cord

FACE: A General Framework for Mapping Collaborative Filtering Embeddings into LLM Tokens

Recently, large language models (LLMs) have been explored for integration with collaborative filtering (CF)-based recommendation systems, which are crucial for personalizing user experiences. However, a key challenge is that LLMs struggle…

Information Retrieval · Computer Science 2025-10-20 Chao Wang , Yixin Song , Jinhui Ye , Chuan Qin , Dazhong Shen , Lingfeng Liu , Xiang Wang , Yanyong Zhang