Related papers: Lexical State Analyzer

A Lexical Analysis Tool with Ambiguity Support

Lexical ambiguities naturally arise in languages. We present Lamb, a lexical analyzer that produces a lexical analysis graph describing all the possible sequences of tokens that can be found within the input string. Parsers can process such…

Computation and Language · Computer Science 2012-03-01 Luis Quesada , Fernando Berzal , Francisco J. Cortijo

Learning and Enforcing Context-Sensitive Control for LLMs

Controlling the output of Large Language Models (LLMs) through context-sensitive constraints has emerged as a promising approach to overcome the limitations of Context-Free Grammars (CFGs) in guaranteeing generation validity. However, such…

Computation and Language · Computer Science 2026-04-14 Mohammad Albinhassan , Pranava Madhyastha , Mark Law , Alessandra Russo

Taming Context-Sensitive Languages with Principled Stateful Parsing

Historically, true context-sensitive parsing has seldom been applied to programming languages, due to its inherent complexity. However, many mainstream programming and markup languages (C, Haskell, Python, XML, and more) possess…

Programming Languages · Computer Science 2016-09-20 Nicolas Laurent , Kim Mens

Do Not Fire the Linguist: Grammatical Profiles Help Language Models Detect Semantic Change

Morphological and syntactic changes in word usage (as captured, e.g., by grammatical profiles) have been shown to be good predictors of a word's meaning change. In this work, we explore whether large pre-trained contextualised language…

Computation and Language · Computer Science 2022-04-13 Mario Giulianelli , Andrey Kutuzov , Lidia Pivovarova

A context sensitive real-time Spell Checker with language adaptability

We present a novel language adaptable spell checking system which detects spelling errors and suggests context sensitive corrections in real-time. We show that our system can be extended to new languages with minimal language-specific…

Computation and Language · Computer Science 2019-10-25 Prabhakar Gupta

Learning Context-aware Classifier for Semantic Segmentation

Semantic segmentation is still a challenging task for parsing diverse contexts in different scenes, thus the fixed classifier might not be able to well address varying feature distributions during testing. Different from the mainstream…

Computer Vision and Pattern Recognition · Computer Science 2023-03-22 Zhuotao Tian , Jiequan Cui , Li Jiang , Xiaojuan Qi , Xin Lai , Yixin Chen , Shu Liu , Jiaya Jia

Lexical Features Are More Vulnerable, Syntactic Features Have More Predictive Power

Understanding the vulnerability of linguistic features extracted from noisy text is important for both developing better health text classification models and for interpreting vulnerabilities of natural language models. In this paper, we…

Computation and Language · Computer Science 2019-10-02 Jekaterina Novikova , Aparna Balagopalan , Ksenia Shkaruta , Frank Rudzicz

Context-sensitive Spelling Correction Using Google Web 1T 5-Gram Information

In computing, spell checking is the process of detecting and sometimes providing spelling suggestions for incorrectly spelled words in a text. Basically, a spell checker is a computer program that uses a dictionary of words to perform spell…

Computation and Language · Computer Science 2012-04-27 Youssef Bassil , Mohammad Alwani

Grammatical Error Correction: A Survey of the State of the Art

Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject-verb agreement,…

Computation and Language · Computer Science 2023-12-05 Christopher Bryant , Zheng Yuan , Muhammad Reza Qorib , Hannan Cao , Hwee Tou Ng , Ted Briscoe

Evaluating statistical language models as pragmatic reasoners

The relationship between communicated language and intended meaning is often probabilistic and sensitive to context. Numerous strategies attempt to estimate such a mapping, often leveraging recursive Bayesian models of communication. In…

Computation and Language · Computer Science 2023-05-03 Benjamin Lipkin , Lionel Wong , Gabriel Grand , Joshua B Tenenbaum

Machine Learning Techniques with Ontology for Subjective Answer Evaluation

Computerized Evaluation of English Essays is performed using Machine learning techniques like Latent Semantic Analysis (LSA), Generalized LSA, Bilingual Evaluation Understudy and Maximum Entropy. Ontology, a concept map of domain knowledge,…

Artificial Intelligence · Computer Science 2016-05-10 M. Syamala Devi , Himani Mittal

SAGE:Specification-Aware Grammar Extraction for Automated Test Case Generation with LLMs

Grammar-based test case generation has proven effective for competitive programming problems, but generating valid and general grammars from natural language specifications remains a key challenge, especially under limited supervision.…

Computation and Language · Computer Science 2025-06-16 Aditi , Hyunwoo Park , Sicheol Sung , Yo-Sub Han , Sang-Ki Ko

Grammar Repair with Examples and Tree Automata: Extended Version

Context-free grammars (CFGs) are the de-facto formalism for declaratively describing concrete syntax for programming languages and generating parsers. One of the major challenges in defining a desired syntax is ruling out all possible…

Programming Languages · Computer Science 2026-02-23 Yunjeong Lee , Gokul Rajiv , Ilya Sergey

Are Large Language Models Good In-context Learners for Financial Sentiment Analysis?

Recently, large language models (LLMs) with hundreds of billions of parameters have demonstrated the emergent ability, surpassing traditional methods in various domains even without fine-tuning over domain-specific data. However, when it…

Computation and Language · Computer Science 2025-03-10 Xinyu Wei , Luojia Liu

Targeted Syntactic Evaluation of Language Models

We present a dataset for evaluating the grammaticality of the predictions of a language model. We automatically construct a large number of minimally different pairs of English sentences, each consisting of a grammatical and an…

Computation and Language · Computer Science 2018-08-29 Rebecca Marvin , Tal Linzen

Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning

Reasoning abilities of LLMs have been a key focus in recent years. One challenging reasoning domain with interesting nuances is legal reasoning, which requires careful application of rules, and precedents while balancing deductive and…

Computation and Language · Computer Science 2025-02-11 Venkatesh Mishra , Bimsara Pathiraja , Mihir Parmar , Sat Chidananda , Jayanth Srinivasa , Gaowen Liu , Ali Payani , Chitta Baral

An Enhanced Latent Semantic Analysis Approach for Arabic Document Summarization

The fast-growing amount of information on the Internet makes the research in automatic document summarization very urgent. It is an effective solution for information overload. Many approaches have been proposed based on different…

Computation and Language · Computer Science 2018-08-01 Kamal Al-Sabahi , Zuping Zhang , Jun Long , Khaled Alwesabi

How Does Latent Semantic Analysis Work? A Visualisation Approach

By using a small example, an analogy to photographic compression, and a simple visualization using heatmaps, we show that latent semantic analysis (LSA) is able to extract what appears to be semantic meaning of words from a set of documents…

Computation and Language · Computer Science 2014-02-05 Jan Koeman , William Rea

Ungrammatical-syntax-based In-context Example Selection for Grammatical Error Correction

In the era of large language models (LLMs), in-context learning (ICL) stands out as an effective prompting strategy that explores LLMs' potency across various tasks. However, applying LLMs to grammatical error correction (GEC) is still a…

Computation and Language · Computer Science 2024-03-29 Chenming Tang , Fanyi Qu , Yunfang Wu

Detecting Text Formality: A Study of Text Classification Approaches

Formality is one of the important characteristics of text documents. The automatic detection of the formality level of a text is potentially beneficial for various natural language processing tasks. Before, two large-scale datasets were…

Computation and Language · Computer Science 2023-09-11 Daryna Dementieva , Nikolay Babakov , Alexander Panchenko