English
Related papers

Related papers: Unsupervised Morphological Paradigm Completion

200 papers

In this paper, we describe the findings of the SIGMORPHON 2020 shared task on unsupervised morphological paradigm completion (SIGMORPHON 2020 Task 2), a novel task in the field of inflectional morphology. Participants were asked to submit…

Computation and Language · Computer Science 2020-05-29 Katharina Kann , Arya McCarthy , Garrett Nicolai , Mans Hulden

The generation of complex derived word forms has been an overlooked problem in NLP; we fill this gap by applying neural sequence-to-sequence models to the task. We overview the theoretical motivation for a paradigmatic treatment of…

Computation and Language · Computer Science 2025-02-18 Ryan Cotterell , Ekaterina Vylomova , Huda Khayrallah , Christo Kirov , David Yarowsky

This paper focuses on unsupervised modeling of morphological families, collectively comprising a forest over the language vocabulary. This formulation enables us to capture edgewise properties reflecting single-step morphological…

Computation and Language · Computer Science 2017-02-24 Jiaming Luo , Karthik Narasimhan , Regina Barzilay

Critical to natural language generation is the production of correctly inflected text. In this paper, we isolate the task of predicting a fully inflected sentence from its partially lemmatized version. Unlike traditional morphological…

Computation and Language · Computer Science 2019-05-07 Ekaterina Vylomova , Ryan Cotterell , Timothy Baldwin , Trevor Cohn , Jason Eisner

Neural models for the various flavours of morphological inflection tasks have proven to be extremely accurate given ample labeled data -- data that may be slow and costly to obtain. In this work we aim to overcome this annotation bottleneck…

Computation and Language · Computer Science 2021-10-13 Omer Goldman , Reut Tsarfaty

Morpho-syntactic lexicons provide information about the morphological and syntactic roles of words in a language. Such lexicons are not available for all languages and even when available, their coverage can be limited. We present a…

Computation and Language · Computer Science 2016-01-26 Manaal Faruqui , Ryan McDonald , Radu Soricut

Most state-of-the-art systems today produce morphological analysis based only on orthographic patterns. In contrast, we propose a model for unsupervised morphological analysis that integrates orthographic and semantic views of words. We…

Computation and Language · Computer Science 2015-03-10 Karthik Narasimhan , Regina Barzilay , Tommi Jaakkola

We propose to cast the task of morphological inflection - mapping a lemma to an indicated inflected form - for resource-poor languages as a meta-learning problem. Treating each language as a separate task, we use data from high-resource…

Computation and Language · Computer Science 2020-04-29 Katharina Kann , Samuel R. Bowman , Kyunghyun Cho

The use of Deep Neural Network architectures for Language Modeling has recently seen a tremendous increase in interest in the field of NLP with the advent of transfer learning and the shift in focus from rule-based and predictive models…

Computation and Language · Computer Science 2019-12-04 Octavia-Maria Sulea , Steve Young

Publicly available, large pretrained LanguageModels (LMs) generate text with remarkable quality, but only sequentially from left to right. As a result, they are not immediately applicable to generation tasks that break the unidirectional…

Computation and Language · Computer Science 2021-12-28 Peter West , Ximing Lu , Ari Holtzman , Chandra Bhagavatula , Jena Hwang , Yejin Choi

This work treats the paradigm discovery problem (PDP), the task of learning an inflectional morphological system from unannotated sentences. We formalize the PDP and develop evaluation metrics for judging systems. Using currently available…

Computation and Language · Computer Science 2020-05-05 Alexander Erdmann , Micha Elsner , Shijie Wu , Ryan Cotterell , Nizar Habash

The CoNLL-SIGMORPHON 2017 shared task on supervised morphological generation required systems to be trained and tested in each of 52 typologically diverse languages. In sub-task 1, submitted systems were asked to predict a specific…

Morphological inflection generation is the task of generating the inflected form of a given lemma corresponding to a particular linguistic transformation. We model the problem of inflection generation as a character sequence to sequence…

Computation and Language · Computer Science 2016-03-23 Manaal Faruqui , Yulia Tsvetkov , Graham Neubig , Chris Dyer

We present a language independent, unsupervised method for building word embeddings using morphological expansion of text. Our model handles the problem of data sparsity and yields improved word embeddings by relying on training word…

Computation and Language · Computer Science 2017-11-16 Syed Sarfaraz Akhtar , Arihant Gupta , Avijit Vajpayee , Arjit Srivastava , Manish Shrivastava

Recent years have brought great advances into solving morphological tasks, mostly due to powerful neural models applied to various tasks as (re)inflection and analysis. Yet, such morphological tasks cannot be considered solved, especially…

Computation and Language · Computer Science 2023-06-23 David Guriel , Omer Goldman , Reut Tsarfaty

Morphological tasks use large multi-lingual datasets that organize words into inflection tables, which then serve as training and evaluation data for various tasks. However, a closer inspection of these data reveals profound…

Computation and Language · Computer Science 2022-10-20 Omer Goldman , Reut Tsarfaty

Automatic morphological processing can aid downstream natural language processing applications, especially for low-resource languages, and assist language documentation efforts for endangered languages. Having long been multilingual, the…

Computation and Language · Computer Science 2022-03-18 Adam Wiemerslage , Miikka Silfverberg , Changbing Yang , Arya D. McCarthy , Garrett Nicolai , Eliana Colunga , Katharina Kann

Self-supervised objectives have driven major advances in NLP by leveraging large-scale unlabeled data, but such resources are scarce for many of the world's languages. Surprisingly, they have not been explored much for character-level…

Computation and Language · Computer Science 2025-06-06 Adam Wiemerslage , Katharina von der Wense

Lemmatization is a natural language processing (NLP) task which consists of producing, from a given inflected word, its canonical form or lemma. Lemmatization is one of the basic tasks that facilitate downstream NLP applications, and is of…

Computation and Language · Computer Science 2023-10-23 Olia Toporkov , Rodrigo Agerri

This paper presents a joint model for performing unsupervised morphological analysis on words, and learning a character-level composition function from morphemes to word embeddings. Our model splits individual words into segments, and…

Computation and Language · Computer Science 2016-06-09 Kris Cao , Marek Rei
‹ Prev 1 2 3 10 Next ›