English
Related papers

Related papers: CommonMorph: Participatory Morphological Documenta…

200 papers

The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a…

Computation and Language · Computer Science 2022-06-22 Khuyagbaatar Batsuren , Omer Goldman , Salam Khalifa , Nizar Habash , Witold Kieraś , Gábor Bella , Brian Leonard , Garrett Nicolai , Kyle Gorman , Yustinus Ghanggo Ate , Maria Ryskina , Sabrina J. Mielke , Elena Budianskaya , Charbel El-Khaissi , Tiago Pimentel , Michael Gasser , William Lane , Mohit Raj , Matt Coler , Jaime Rafael Montoya Samame , Delio Siticonatzi Camaiteri , Benoît Sagot , Esaú Zumaeta Rojas , Didier López Francis , Arturo Oncevay , Juan López Bautista , Gema Celeste Silva Villegas , Lucas Torroba Hennigen , Adam Ek , David Guriel , Peter Dirix , Jean-Philippe Bernardy , Andrey Scherbakov , Aziyana Bayyr-ool , Antonios Anastasopoulos , Roberto Zariquiey , Karina Sheifer , Sofya Ganieva , Hilaria Cruz , Ritván Karahóǧa , Stella Markantonatou , George Pavlidis , Matvey Plugaryov , Elena Klyachko , Ali Salehi , Candy Angulo , Jatayu Baxi , Andrew Krizhanovsky , Natalia Krizhanovskaya , Elizabeth Salesky , Clara Vania , Sardana Ivanova , Jennifer White , Rowan Hall Maudslay , Josef Valvoda , Ran Zmigrod , Paula Czarnowska , Irene Nikkarinen , Aelita Salchak , Brijesh Bhatt , Christopher Straughn , Zoey Liu , Jonathan North Washington , Yuval Pinter , Duygu Ataman , Marcin Wolinski , Totok Suhardijanto , Anna Yablonskaya , Niklas Stoehr , Hossep Dolatian , Zahroh Nuriah , Shyam Ratan , Francis M. Tyers , Edoardo M. Ponti , Grant Aiton , Aryaman Arora , Richard J. Hatcher , Ritesh Kumar , Jeremiah Young , Daria Rodionova , Anastasia Yemelina , Taras Andrushko , Igor Marchenko , Polina Mashkovtseva , Alexandra Serova , Emily Prud'hommeaux , Maria Nepomniashchaya , Fausto Giunchiglia , Eleanor Chodroff , Mans Hulden , Miikka Silfverberg , Arya D. McCarthy , David Yarowsky , Ryan Cotterell , Reut Tsarfaty , Ekaterina Vylomova

The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema.…

Automatic morphological processing can aid downstream natural language processing applications, especially for low-resource languages, and assist language documentation efforts for endangered languages. Having long been multilingual, the…

Computation and Language · Computer Science 2022-03-18 Adam Wiemerslage , Miikka Silfverberg , Changbing Yang , Arya D. McCarthy , Garrett Nicolai , Eliana Colunga , Katharina Kann

The metaphor studies community has developed numerous valuable labelled corpora in various languages over the years. Many of these resources are not only unknown to the NLP community, but are also often not easily shared among the…

Computation and Language · Computer Science 2025-03-11 Joanne Boisson , Arif Mehmood , Jose Camacho-Collados

Morphological tasks use large multi-lingual datasets that organize words into inflection tables, which then serve as training and evaluation data for various tasks. However, a closer inspection of these data reveals profound…

Computation and Language · Computer Science 2022-10-20 Omer Goldman , Reut Tsarfaty

Computational morphology handles the language processing at the word level. It is one of the foundational tasks in the NLP pipeline for the development of higher level NLP applications. It mainly deals with the processing of words and word…

Computation and Language · Computer Science 2024-06-11 Jatayu Baxi , Brijesh Bhatt

We generalized a voice morphing algorithm capable of handling temporally variable, multiple-attributes, and multiple instances. The generalized morphing provides a new strategy for investigating speech diversity. However, excessive…

Human-Computer Interaction · Computer Science 2024-04-23 Hideki Kawahara , Masanori Morise

Empirical natural language processing (NLP) systems in application domains (e.g., healthcare, finance, education) involve interoperation among multiple components, ranging from data ingestion, human annotation, to text retrieval, analysis,…

Users often have to integrate information about entities from multiple data sources. This task is challenging as each data source may represent information about the same entity in a distinct form, e.g., each data source may use a different…

Databases · Computer Science 2019-10-24 Ben McCamish , Christopher Buss , Arash Termehchy , David Maier

Creating new documents by synthesizing information from existing sources is an important part of knowledge work in many domains. This process often involves gathering content from multiple documents, organizing it, and then transforming it…

Human-Computer Interaction · Computer Science 2026-03-02 Balasaravanan Thoravi Kumaravel

We present SoundMorpher, an open-world sound morphing method designed to generate perceptually uniform morphing trajectories. Traditional sound morphing techniques typically assume a linear relationship between the morphing factor and sound…

Sound · Computer Science 2024-12-17 Xinlei Niu , Jing Zhang , Charles Patrick Martin

We present MultiMorph, a fast and efficient method for constructing anatomical atlases on the fly. Atlases capture the canonical structure of a collection of images and are essential for quantifying anatomical variability across…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 S. Mazdak Abulnaga , Andrew Hoopes , Neel Dey , Malte Hoffmann , Marianne Rakic , Bruce Fischl , John Guttag , Adrian Dalca

Recent advances in large language models (LLMs) have led to new summarization strategies, offering an extensive toolkit for extracting important information. However, these approaches are frequently limited by their reliance on isolated…

Artificial Intelligence · Computer Science 2024-06-21 Pranav Janjani , Mayank Palan , Sarvesh Shirude , Ninad Shegokar , Sunny Kumar , Faruk Kazi

Translation into morphologically-rich languages challenges neural machine translation (NMT) models with extremely sparse vocabularies where atomic treatment of surface forms is unrealistic. This problem is typically addressed by either…

Computation and Language · Computer Science 2020-02-28 Duygu Ataman , Wilker Aziz , Alexandra Birch

Morphologically rich languages accentuate two properties of distributional vector space models: 1) the difficulty of inducing accurate representations for low-frequency word forms; and 2) insensitivity to distinct lexical relations that…

Computation and Language · Computer Science 2017-06-02 Ivan Vulić , Nikola Mrkšić , Roi Reichart , Diarmuid Ó Séaghdha , Steve Young , Anna Korhonen

In recent years, a flurry of morphological datasets had emerged, most notably UniMorph, a multi-lingual repository of inflection tables. However, the flat structure of the current morphological annotation schema makes the treatment of some…

Computation and Language · Computer Science 2022-03-22 David Guriel , Omer Goldman , Reut Tsarfaty

Morphological modeling in neural machine translation (NMT) is a promising approach to achieving open-vocabulary machine translation for morphologically-rich languages. However, existing methods such as sub-word tokenization and…

Computation and Language · Computer Science 2024-04-04 Antoine Nzeyimana

Canonical morphological segmentation is the process of analyzing words into the standard (aka underlying) forms of their constituent morphemes. This is a core task in language documentation, and NLP systems have the potential to…

Computation and Language · Computer Science 2024-10-16 Enora Rice , Ali Marashian , Luke Gessler , Alexis Palmer , Katharina von der Wense

We present an integrated architecture for word-level and sentence-level processing in a unification-based paradigm. The core of the system is a CLP implementation of a unification engine for feature structures supporting relational values.…

cmp-lg · Computer Science 2008-02-03 Harald Trost , Johannes Matiasek

Various NLP tasks require a complex hierarchical structure over nodes, where each node is a cluster of items. Examples include generating entailment graphs, hierarchical cross-document coreference resolution, annotating event and subevent…

Computation and Language · Computer Science 2023-11-21 Arie Cattan , Tom Hope , Doug Downey , Roy Bar-Haim , Lilach Eden , Yoav Kantor , Ido Dagan
‹ Prev 1 2 3 10 Next ›