Related papers: Syntactic variation of support verb constructions

Paracompositionality, MWEs and Argument Substitution

Multi-word expressions, verb-particle constructions, idiomatically combining phrases, and phrasal idioms have something in common: not all of their elements contribute to the argument structure of the predicate implicated by the expression.…

Computation and Language · Computer Science 2018-05-23 Cem Bozsahin , Arzu Burcu Guven

Semantics of Multiword Expressions in Transformer-Based Models: A Survey

Multiword expressions (MWEs) are composed of multiple words and exhibit variable degrees of compositionality. As such, their meanings are notoriously difficult to model, and it is unclear to what extent this issue affects transformer…

Computation and Language · Computer Science 2024-01-30 Filip Miletić , Sabine Schulte im Walde

Corpus-based Method for Automatic Identification of Support Verbs for Nominalizations

Nominalization is a highly productive phenomena in most languages. The process of nominalization ejects a verb from its syntactic role into a nominal position. The original verb is often replaced by a semantically emptied support verb…

cmp-lg · Computer Science 2016-08-31 Gregory Grefenstette , Simone Teufel

Choosing features for classifying multiword expressions

Multiword expressions (MWEs) are a heterogeneous set with a glaring need for classifications. Designing a satisfactory classification involves choosing features. In the case of MWEs, many features are a priori available. Not all features…

Computation and Language · Computer Science 2026-05-13 Eric Laporte

To Be or Not To Be a Verbal Multiword Expression: A Quest for Discriminating Features

Automatic identification of mutiword expressions (MWEs) is a pre-requisite for semantically-oriented downstream applications. This task is challenging because MWEs, especially verbal ones (VMWEs), exhibit surface variability. However, this…

Computation and Language · Computer Science 2020-07-23 Caroline Pasquer , Agata Savary , Jean-Yves Antoine , Carlos Ramisch , Nicolas Labroche , Arnaud Giacometti

CoAM: Corpus of All-Type Multiword Expressions

Multiword expressions (MWEs) refer to idiomatic sequences of multiple words. MWE identification, i.e., detecting MWEs in text, can play a key role in downstream tasks such as machine translation, but existing datasets for the task are…

Computation and Language · Computer Science 2025-07-11 Yusuke Ide , Joshua Tanner , Adam Nohejl , Jacob Hoffman , Justin Vasselli , Hidetaka Kamigaito , Taro Watanabe

A Curious Class of Adpositional Multiword Expressions in Korean

Multiword expressions (MWEs) have been widely studied in cross-lingual annotation frameworks such as PARSEME. However, Korean MWEs remain underrepresented in these efforts. In particular, Korean multiword adpositions lack systematic…

Computation and Language · Computer Science 2026-02-19 Junghyun Min , Na-Rae Han , Jena D. Hwang , Nathan Schneider

Unsupervised Multilingual Word Embeddings

Multilingual Word Embeddings (MWEs) represent words from multiple languages in a single distributional vector space. Unsupervised MWE (UMWE) methods acquire multilingual embeddings without cross-lingual supervision, which is a significant…

Computation and Language · Computer Science 2018-09-07 Xilun Chen , Claire Cardie

Unsupervised Paraphrasing of Multiword Expressions

We propose an unsupervised approach to paraphrasing multiword expressions (MWEs) in context. Our model employs only monolingual corpus data and pre-trained language models (without fine-tuning), and does not make use of any external…

Computation and Language · Computer Science 2023-06-05 Takashi Wada , Yuji Matsumoto , Timothy Baldwin , Jey Han Lau

Detecting Multiword Expression Type Helps Lexical Complexity Assessment

Multiword expressions (MWEs) represent lexemes that should be treated as single lexical units due to their idiosyncratic nature. Multiple NLP applications have been shown to benefit from MWE identification, however the research on lexical…

Computation and Language · Computer Science 2020-05-13 Ekaterina Kochmar , Sian Gooding , Matthew Shardlow

A Multivariate Model for Representing Semantic Non-compositionality

Semantically non-compositional phrases constitute an intriguing research topic in Natural Language Processing. Semantic non-compositionality --the situation when the meaning of a phrase cannot be derived from the meaning of its components,…

Computation and Language · Computer Science 2019-08-16 Meghdad Farahmand

Unsupervised Discovery of Unaccusative and Unergative Verbs

We present an unsupervised method to detect English unergative and unaccusative verbs. These categories allow us to identify verbs participating in the causative-inchoative alternation without knowing the semantic roles of the verb. The…

Computation and Language · Computer Science 2021-11-02 Sharid Loáiciga , Luca Bevacqua , Christian Hardmeier

AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models

Despite their success in a variety of NLP tasks, pre-trained language models, due to their heavy reliance on compositionality, fail in effectively capturing the meanings of multiword expressions (MWEs), especially idioms. Therefore,…

Computation and Language · Computer Science 2021-09-10 Harish Tayyar Madabushi , Edward Gow-Smith , Carolina Scarton , Aline Villavicencio

BERT(s) to Detect Multiword Expressions

Multiword expressions (MWEs) present groups of words in which the meaning of the whole is not derived from the meaning of its parts. The task of processing MWEs is crucial in many natural language processing (NLP) applications, including…

Computation and Language · Computer Science 2022-08-17 Damith Premasiri , Tharindu Ranasinghe

A Data-Driven Approach to Idiomaticity Based on Experts' Criteria in Theoretical Linguistics

The article observes data analysis of 286 multi-word expressions (MWEs) based on 16 lexical, grammatical and other criteria described in theoretical books and papers on the notion of idiomaticity. MWEs were collected from the same…

Computation and Language · Computer Science 2026-05-20 Elena Mikhalkova , Anastasiya Vishnyakova , Anastasiya Drozdova , Polina Gavin , Aleksander Zhmykhov , Timofey Protasov

Joint Semantic Synthesis and Morphological Analysis of the Derived Word

Much like sentences are composed of words, words themselves are composed of smaller units. For example, the English word questionably can be analyzed as question+able+ly. However, this structural decomposition of the word does not directly…

Computation and Language · Computer Science 2018-11-13 Ryan Cotterell , Hinrich Schütze

Word Representations, Tree Models and Syntactic Functions

Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem…

Computation and Language · Computer Science 2016-02-08 Simon Šuster , Gertjan van Noord , Ivan Titov

Cross-Linguistic Syntactic Evaluation of Word Prediction Models

A range of studies have concluded that neural word prediction models can distinguish grammatical from ungrammatical sentences with high accuracy. However, these studies are based primarily on monolingual evidence from English. To…

Computation and Language · Computer Science 2020-05-22 Aaron Mueller , Garrett Nicolai , Panayiota Petrou-Zeniou , Natalia Talmina , Tal Linzen

Dancing with Deer: A Constructional Perspective on MWEs in the Era of LLMs

In this chapter, we argue for the benefits of understanding multiword expressions from the perspective of usage-based, construction grammar approaches. We begin with a historical overview of how construction grammar was developed in order…

Computation and Language · Computer Science 2025-08-25 Claire Bonial , Julia Bonn , Harish Tayyar Madabushi

Death and Lightness: Using a Demographic Model to Find Support Verbs

Some verbs have a particular kind of binary ambiguity: they can carry their normal, full meaning, or they can be merely acting as a prop for the nominal object. It has been suggested that there is a detectable pattern in the relationship…

cmp-lg · Computer Science 2008-02-03 Mark Dras , Mike Johnson