Related papers: Numerically Grounded Language Models for Semantic …

Clinical Text Prediction with Numerically Grounded Conditional Language Models

Assisted text input techniques can save time and effort and improve text quality. In this paper, we investigate how grounded and conditional extensions to standard neural language models can bring improvements in the tasks of word…

Computation and Language · Computer Science 2016-10-21 Georgios P. Spithourakis , Steffen E. Petersen , Sebastian Riedel

Towards Computationally Verifiable Semantic Grounding for Language Models

The paper presents an approach to semantic grounding of language models (LMs) that conceptualizes the LM as a conditional model generating text given a desired semantic message formalized as a set of entity-relationship triples. It embeds…

Computation and Language · Computer Science 2022-11-17 Chris Alberti , Kuzman Ganchev , Michael Collins , Sebastian Gehrmann , Ciprian Chelba

Grammatical Error Generation Based on Translated Fragments

We perform neural machine translation of sentence fragments in order to create large amounts of training data for English grammatical error correction. Our method aims at simulating mistakes made by second language learners, and produces a…

Computation and Language · Computer Science 2021-04-21 Eetu Sjöblom , Mathias Creutz , Teemu Vahtola

Probing Semantic Grounding in Language Models of Code with Representational Similarity Analysis

Representational Similarity Analysis is a method from cognitive neuroscience, which helps in comparing representations from two different sources of data. In this paper, we propose using Representational Similarity Analysis to probe the…

Computation and Language · Computer Science 2022-07-19 Shounak Naik , Rajaswa Patil , Swati Agarwal , Veeky Baths

Neural Network Translation Models for Grammatical Error Correction

Phrase-based statistical machine translation (SMT) systems have previously been used for the task of grammatical error correction (GEC) to achieve state-of-the-art accuracy. The superiority of SMT systems comes from their ability to learn…

Computation and Language · Computer Science 2016-06-02 Shamil Chollampatt , Kaveh Taghipour , Hwee Tou Ng

Interpretability for Language Learners Using Example-Based Grammatical Error Correction

Grammatical Error Correction (GEC) should not focus only on high accuracy of corrections but also on interpretability for language learning. However, existing neural-based GEC models mainly aim at improving accuracy, and their…

Computation and Language · Computer Science 2022-03-15 Masahiro Kaneko , Sho Takase , Ayana Niwa , Naoaki Okazaki

Getting Started with Neural Models for Semantic Matching in Web Search

The vocabulary mismatch problem is a long-standing problem in information retrieval. Semantic matching holds the promise of solving the problem. Recent advances in language technology have given rise to unsupervised neural models for…

Information Retrieval · Computer Science 2016-11-11 Kezban Dilek Onal , Ismail Sengor Altingovde , Pinar Karagoz , Maarten de Rijke

Attention-based Encoder-Decoder Networks for Spelling and Grammatical Error Correction

Automatic spelling and grammatical correction systems are one of the most widely used tools within natural language applications. In this thesis, we assume the task of error correction as a type of monolingual machine translation where the…

Computation and Language · Computer Science 2018-10-02 Sina Ahmadi

The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction

Recent work on Grammatical Error Correction (GEC) has highlighted the importance of language modeling in that it is certainly possible to achieve good performance by comparing the probabilities of the proposed edits. At the same time,…

Computation and Language · Computer Science 2019-06-06 Dimitrios Alikaniotis , Vipul Raheja

Spelling Error Correction Using a Nested RNN Model and Pseudo Training Data

We propose a nested recurrent neural network (nested RNN) model for English spelling error correction and generate pseudo data based on phonetic similarity to train it. The model fuses orthographic information and context as a whole and is…

Computation and Language · Computer Science 2018-11-02 Hao Li , Yang Wang , Xinyu Liu , Zhichao Sheng , Si Wei

Misspelling Correction with Pre-trained Contextual Language Model

Spelling irregularities, known now as spelling mistakes, have been found for several centuries. As humans, we are able to understand most of the misspelled words based on their location in the sentence, perceived pronunciation, and context.…

Computation and Language · Computer Science 2021-01-12 Yifei Hu , Xiaonan Jing , Youlim Ko , Julia Taylor Rayz

Exploring the Capacity of a Large-scale Masked Language Model to Recognize Grammatical Errors

In this paper, we explore the capacity of a language model-based method for grammatical error detection in detail. We first show that 5 to 10% of training data are enough for a BERT-based error detection method to achieve performance…

Computation and Language · Computer Science 2021-08-30 Ryo Nagata , Manabu Kimura , Kazuaki Hanawa

Contextual Grounding of Natural Language Entities in Images

In this paper, we introduce a contextual grounding approach that captures the context in corresponding text entities and image regions to improve the grounding accuracy. Specifically, the proposed architecture accepts pre-trained text token…

Computer Vision and Pattern Recognition · Computer Science 2019-11-07 Farley Lai , Ning Xie , Derek Doran , Asim Kadav

A Simple but Effective Classification Model for Grammatical Error Correction

We treat grammatical error correction (GEC) as a classification problem in this study, where for different types of errors, a target word is identified, and the classifier predicts the correct word form from a set of possible choices. We…

Computation and Language · Computer Science 2018-07-03 Zhu Kaili , Chuan Wang , Ruobing Li , Yang Liu , Tianlei Hu , Hui Lin

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Language models have demonstrated remarkable performance in solving reasoning tasks; however, even the strongest models still occasionally make reasoning mistakes. Recently, there has been active research aimed at improving reasoning…

Computation and Language · Computer Science 2024-08-30 Tian Ye , Zicheng Xu , Yuanzhi Li , Zeyuan Allen-Zhu

Evaluation and Enhancement of Semantic Grounding in Large Vision-Language Models

Large Vision-Language Models (LVLMs) offer remarkable benefits for a variety of vision-language tasks. However, a challenge hindering their application in real-world scenarios, particularly regarding safety, robustness, and reliability, is…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Jiaying Lu , Jinmeng Rao , Kezhen Chen , Xiaoyuan Guo , Yawen Zhang , Baochen Sun , Carl Yang , Jie Yang

Teaching Language Models to Check Grounded Claim Factuality with Human Test-Taking Strategies

Grounded claim factuality checking is important for large language model (LLM) applications such as retrieval-augmented generation, as it helps users assess the correctness of generated outputs. Existing metrics using entailment classifiers…

Computation and Language · Computer Science 2026-05-29 Yuxuan Ye , Raul Santos-Rodriguez , Edwin Simpson

Neuron Patching: Semantic-based Neuron-level Language Model Repair for Code Generation

Language Models (LMs) have become widely used in software engineering, especially for tasks such as code generation, where they are referred to as code LMs. These models have proven effective in generating code, making it easier for…

Software Engineering · Computer Science 2024-11-21 Jian Gu , Aldeida Aleti , Chunyang Chen , Hongyu Zhang

Exploiting Unlabeled Data for Neural Grammatical Error Detection

Identifying and correcting grammatical errors in the text written by non-native writers has received increasing attention in recent years. Although a number of annotated corpora have been established to facilitate data-driven grammatical…

Computation and Language · Computer Science 2016-11-30 Zhuoran Liu , Yang Liu

Laying Anchors: Semantically Priming Numerals in Language Modeling

Off-the-shelf pre-trained language models have become the de facto standard in NLP pipelines for a multitude of downstream tasks. However, the inability of these models to properly encode numerals limits their performance on tasks requiring…

Computation and Language · Computer Science 2024-08-09 Mandar Sharma , Rutuja Murlidhar Taware , Pravesh Koirala , Nikhil Muralidhar , Naren Ramakrishnan