English
Related papers

Related papers: Idiomatic Expression Identification using Semantic…

200 papers

We describe an algorithm for automatic classification of idiomatic and literal expressions. Our starting point is that words in a given text segment, such as a paragraph, that are highranking representatives of a common topic of discussion…

Computation and Language · Computer Science 2018-02-28 Jing Peng , Anna Feldman , Ekaterina Vylomova

Idiomatic and figurative language form a large portion of colloquial speech and writing. With social media, this informal language has become more easily observable to people and trainers of large language models (LLMs) alike. While the…

Computation and Language · Computer Science 2025-12-04 Blake Matheny , Phuong Minh Nguyen , Minh Le Nguyen , Stephanie Reynolds

Idiomatic expressions (IEs), characterized by their non-compositionality, are an important part of natural language. They have been a classical challenge to NLP, including pre-trained language models that drive today's state-of-the-art.…

Computation and Language · Computer Science 2022-07-11 Ziheng Zeng , Suma Bhat

Idiomatic expressions are an integral part of human languages, often used to express complex ideas in compressed or conventional ways (e.g. eager beaver as a keen and enthusiastic person). However, their interpretations may not be…

Computation and Language · Computer Science 2024-11-06 Wei He , Tiago Kramer Vieira , Marcos Garcia , Carolina Scarton , Marco Idiart , Aline Villavicencio

Idioms are figurative expressions whose meanings often cannot be inferred from their individual words, making them difficult to process computationally and posing challenges for human experimental studies. This survey reviews datasets…

Computation and Language · Computer Science 2025-08-19 Michael Flor , Xinyi Liu , Anna Feldman

Despite their success in a variety of NLP tasks, pre-trained language models, due to their heavy reliance on compositionality, fail in effectively capturing the meanings of multiword expressions (MWEs), especially idioms. Therefore,…

Computation and Language · Computer Science 2021-09-10 Harish Tayyar Madabushi , Edward Gow-Smith , Carolina Scarton , Aline Villavicencio

The same multi-word expressions may have different meanings in different sentences. They can be mainly divided into two categories, which are literal meaning and idiomatic meaning. Non-contextual-based methods perform poorly on this…

Computation and Language · Computer Science 2022-04-14 Zheng Chu , Ziqing Yang , Yiming Cui , Zhigang Chen , Ming Liu

Idiomatic expressions have always been a bottleneck for language comprehension and natural language understanding, specifically for tasks like Machine Translation(MT). MT systems predominantly produce literal translations of idiomatic…

Computation and Language · Computer Science 2020-06-18 Prateek Saxena , Soma Paul

Idioms are common in everyday language, but often pose a challenge to translators because their meanings do not follow from the meanings of their parts. Despite significant advances, machine translation systems still struggle to translate…

Computation and Language · Computer Science 2023-10-24 Emmy Liu , Aditi Chaudhary , Graham Neubig

Idioms pose a fundamental challenge for language models, as their meaning cannot be inferred from surface form alone. Understanding such expressions, therefore, requires semantic abstraction beyond lexical overlap. We introduce IdioLink, a…

Computation and Language · Computer Science 2026-05-22 Kai Golan Hashiloni , Daniel Fadlon , Lior Livyatan , Ofri Hefetz , Jiahuan Pei , Kfir Bar

In this work, we explore idiomatic language processing with Large Language Models (LLMs). We introduce the Idiomatic language Test Suite IdioTS, a new dataset of difficult examples specifically designed by language experts to assess the…

Computation and Language · Computer Science 2024-05-20 Francesca De Luca Fornaciari , Begoña Altuna , Itziar Gonzalez-Dios , Maite Melero

Figurative language is ubiquitous in English. Yet, the vast majority of NLP research focuses on literal language. Existing text representations by design rely on compositionality, while figurative language is often non-compositional. In…

Computation and Language · Computer Science 2022-03-03 Tuhin Chakrabarty , Yejin Choi , Vered Shwartz

Neural Machine Translation (NMT) has been widely used in recent years with significant improvements for many language pairs. Although state-of-the-art NMT systems are generating progressively better translations, idiom translation remains…

Computation and Language · Computer Science 2018-02-14 Marzieh Fadaee , Arianna Bisazza , Christof Monz

Human processing of idioms relies on understanding the contextual sentences in which idioms occur, as well as language-intrinsic features such as frequency and speaker-intrinsic factors like familiarity. While LLMs have shown high…

Computation and Language · Computer Science 2025-07-17 Maggie Mi , Aline Villavicencio , Nafise Sadat Moosavi

Idiomatic expressions can be problematic for natural language processing applications as their meaning cannot be inferred from their constituting words. A lack of successful methodological approaches and sufficiently large datasets prevents…

Computation and Language · Computer Science 2021-11-11 Tadej Škvorc , Polona Gantar , Marko Robnik-Šikonja

We present a fairly large, Potential Idiomatic Expression (PIE) dataset for Natural Language Processing (NLP) in English. The challenges with NLP systems with regards to tasks such as Machine Translation (MT), word sense disambiguation…

Computation and Language · Computer Science 2022-04-26 Tosin P. Adewumi , Roshanak Vadoodi , Aparajita Tripathy , Konstantina Nikolaidou , Foteini Liwicki , Marcus Liwicki

We investigate the processing of idiomatic expressions in transformer-based language models using a novel set of techniques for circuit discovery and analysis. First discovering circuits via a modified path patching algorithm, we find that…

Computation and Language · Computer Science 2025-11-21 Andrew Gomes

Predicting context-dependent and non-literal utterances like sarcastic and ironic expressions still remains a challenging task in NLP, as it goes beyond linguistic patterns, encompassing common sense and shared knowledge as crucial…

Computation and Language · Computer Science 2018-09-27 Suzana Ilić , Edison Marrese-Taylor , Jorge A. Balazs , Yutaka Matsuo

Why should computers interpret language incrementally? In recent years psycholinguistic evidence for incremental interpretation has become more and more compelling, suggesting that humans perform semantic interpretation before constituent…

cmp-lg · Computer Science 2016-08-31 David Milward , Robin Cooper

We study a new application for text generation -- idiomatic sentence generation -- which aims to transfer literal phrases in sentences into their idiomatic counterparts. Inspired by psycholinguistic theories of idiom use in one's native…

Computation and Language · Computer Science 2021-05-12 Jianing Zhou , Hongyu Gong , Srihari Nanniyur , Suma Bhat
‹ Prev 1 2 3 10 Next ›