Related papers: Mathematical Language Processing Project

Natural Language Premise Selection: Finding Supporting Statements for Mathematical Text

Mathematical text is written using a combination of words and mathematical expressions. This combination, along with a specific way of structuring sentences makes it challenging for state-of-art NLP tools to understand and reason on top of…

Computation and Language · Computer Science 2020-05-01 Deborah Ferreira , Andre Freitas

Mathematical Language Processing: Automatic Grading and Feedback for Open Response Mathematical Questions

While computer and communication technologies have provided effective means to scale up many aspects of education, the submission and grading of assessments such as homework assignments and tests remains a weak link. In this paper, we study…

Machine Learning · Statistics 2015-01-20 Andrew S. Lan , Divyanshu Vats , Andrew E. Waters , Richard G. Baraniuk

Interpreting and Improving Large Language Models in Arithmetic Calculation

Large language models (LLMs) have demonstrated remarkable potential across numerous applications and have shown an emergent ability to tackle complex reasoning tasks, such as mathematical computations. However, even for the simplest…

Computation and Language · Computer Science 2024-09-04 Wei Zhang , Chaoqun Wan , Yonggang Zhang , Yiu-ming Cheung , Xinmei Tian , Xu Shen , Jieping Ye

Solving Math Word Problems Using Estimation Verification and Equation Generation

Large Language Models (LLMs) excel at various tasks, including problem-solving and question-answering. However, LLMs often find Math Word Problems (MWPs) challenging because solving them requires a range of reasoning and mathematical…

Artificial Intelligence · Computer Science 2025-09-24 Mitchell Piehl , Dillon Wilson , Ananya Kalita , Jugal Kalita

Learning to Match Mathematical Statements with Proofs

We introduce a novel task consisting in assigning a proof to a given mathematical statement. The task is designed to improve the processing of research-level mathematical texts. Applying Natural Language Processing (NLP) tools to research…

Computation and Language · Computer Science 2021-02-04 Maximin Coavoux , Shay B. Cohen

MWPRanker: An Expression Similarity Based Math Word Problem Retriever

Math Word Problems (MWPs) in online assessments help test the ability of the learner to make critical inferences by interpreting the linguistic information in them. To test the mathematical reasoning capabilities of the learners, sometimes…

Information Retrieval · Computer Science 2023-07-06 Mayank Goel , Venktesh V , Vikram Goyal

Semantic Representations of Word Senses and Concepts

Representing the semantics of linguistic items in a machine-interpretable form has been a major goal of Natural Language Processing since its earliest days. Among the range of different linguistic items, words have attracted the most…

Computation and Language · Computer Science 2016-08-04 José Camacho-Collados , Ignacio Iacobacci , Roberto Navigli , Mohammad Taher Pilehvar

Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars

This paper addresses the problem of mapping natural language sentences to lambda-calculus encodings of their meaning. We describe a learning algorithm that takes as input a training set of sentences labeled with expressions in the lambda…

Computation and Language · Computer Science 2012-07-09 Luke S. Zettlemoyer , Michael Collins

Distributed Representations for Compositional Semantics

The mathematical representation of semantics is a key issue for Natural Language Processing (NLP). A lot of research has been devoted to finding ways of representing the semantics of individual words in vector spaces. Distributional…

Computation and Language · Computer Science 2014-11-13 Karl Moritz Hermann

POS Tagging and its Applications for Mathematics

Content analysis of scientific publications is a nontrivial task, but a useful and important one for scientific information services. In the Gutenberg era it was a domain of human experts; in the digital age many machine-based methods,…

Digital Libraries · Computer Science 2014-06-12 Ulf Schöneberg , Wolfram Sperber

Enhancing Semantic Understanding in Pointer Analysis using Large Language Models

Pointer analysis has been studied for over four decades. However, existing frameworks continue to suffer from the propagation of incorrect facts. A major limitation stems from their insufficient semantic understanding of code, resulting in…

Software Engineering · Computer Science 2025-09-01 Baijun Cheng , Kailong Wang , Ling Shi , Haoyu Wang , Yao Guo , Ding Li , Xiangqun Chen

Unravelling Interlanguage Facts via Explainable Machine Learning

Native language identification (NLI) is the task of training (via supervised machine learning) a classifier that guesses the native language of the author of a text. This task has been extensively researched in the last decade, and the…

Computation and Language · Computer Science 2022-08-03 Barbara Berti , Andrea Esuli , Fabrizio Sebastiani

Hypothesis Generation via LLM-Automated Language Bias for ILP

Inductive Logic Programming (ILP) is a principled approach for generalizing regularities from data and constructing hypotheses as interpretable logic programs. However, a key limitation is its reliance on expert-crafted language bias - the…

Artificial Intelligence · Computer Science 2026-01-21 Yang Yang , Jiemin Wu , Yutao Yue

Large Language Models: A Mathematical Formulation

Large language models (LLMs) process and predict sequences containing text to answer questions, and address tasks including document summarization, providing recommendations, writing software and solving quantitative problems. We provide a…

Numerical Analysis · Mathematics 2026-02-02 Ricardo Baptista , Andrew Stuart , Son Tran

How to Compute the Probability of a Word

Language models (LMs) estimate a probability distribution over strings in a natural language; these distributions are crucial for computing perplexity and surprisal in linguistics research. While we are usually concerned with measuring…

Computation and Language · Computer Science 2024-10-15 Tiago Pimentel , Clara Meister

LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language

Machine learning practitioners often face significant challenges in formally integrating their prior knowledge and beliefs into predictive models, limiting the potential for nuanced and context-aware analyses. Moreover, the expertise needed…

Machine Learning · Statistics 2024-12-23 James Requeima , John Bronskill , Dami Choi , Richard E. Turner , David Duvenaud

An Information-Theoretic Approach to Analyze NLP Classification Tasks

Understanding the importance of the inputs on the output is useful across many tasks. This work provides an information-theoretic framework to analyse the influence of inputs for text classification tasks. Natural language processing (NLP)…

Computation and Language · Computer Science 2024-02-05 Luran Wang , Mark Gales , Vatsal Raina

Understanding and Mitigating Classification Errors Through Interpretable Token Patterns

State-of-the-art NLP methods achieve human-like performance on many tasks, but make errors nevertheless. Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors,…

Computation and Language · Computer Science 2023-11-21 Michael A. Hedderich , Jonas Fischer , Dietrich Klakow , Jilles Vreeken

Natural Language Processing: Structure and Complexity

We introduce a method for analyzing the complexity of natural language processing tasks, and for predicting the difficulty new NLP tasks. Our complexity measures are derived from the Kolmogorov complexity of a class of automata --- {\it…

cmp-lg · Computer Science 2016-08-31 Wlodek Zadrozny

Natural Language Semantics and Computability

This paper is a reflexion on the computability of natural language semantics. It does not contain a new model or new results in the formal semantics of natural language: it is rather a computational analysis of the logical models and…

Computation and Language · Computer Science 2016-05-16 Richard Moot , Christian Retoré