Related papers: CommonMorph: Participatory Morphological Documenta…

UniMorph 4.0: Universal Morphology

The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a…

Computation and Language · Computer Science 2022-06-22 Khuyagbaatar Batsuren , Omer Goldman , Salam Khalifa , Nizar Habash , Witold Kieraś , Gábor Bella , Brian Leonard , Garrett Nicolai , Kyle Gorman , Yustinus Ghanggo Ate , Maria Ryskina , Sabrina J. Mielke , Elena Budianskaya , Charbel El-Khaissi , Tiago Pimentel , Michael Gasser , William Lane , Mohit Raj , Matt Coler , Jaime Rafael Montoya Samame , Delio Siticonatzi Camaiteri , Benoît Sagot , Esaú Zumaeta Rojas , Didier López Francis , Arturo Oncevay , Juan López Bautista , Gema Celeste Silva Villegas , Lucas Torroba Hennigen , Adam Ek , David Guriel , Peter Dirix , Jean-Philippe Bernardy , Andrey Scherbakov , Aziyana Bayyr-ool , Antonios Anastasopoulos , Roberto Zariquiey , Karina Sheifer , Sofya Ganieva , Hilaria Cruz , Ritván Karahóǧa , Stella Markantonatou , George Pavlidis , Matvey Plugaryov , Elena Klyachko , Ali Salehi , Candy Angulo , Jatayu Baxi , Andrew Krizhanovsky , Natalia Krizhanovskaya , Elizabeth Salesky , Clara Vania , Sardana Ivanova , Jennifer White , Rowan Hall Maudslay , Josef Valvoda , Ran Zmigrod , Paula Czarnowska , Irene Nikkarinen , Aelita Salchak , Brijesh Bhatt , Christopher Straughn , Zoey Liu , Jonathan North Washington , Yuval Pinter , Duygu Ataman , Marcin Wolinski , Totok Suhardijanto , Anna Yablonskaya , Niklas Stoehr , Hossep Dolatian , Zahroh Nuriah , Shyam Ratan , Francis M. Tyers , Edoardo M. Ponti , Grant Aiton , Aryaman Arora , Richard J. Hatcher , Ritesh Kumar , Jeremiah Young , Daria Rodionova , Anastasia Yemelina , Taras Andrushko , Igor Marchenko , Polina Mashkovtseva , Alexandra Serova , Emily Prud'hommeaux , Maria Nepomniashchaya , Fausto Giunchiglia , Eleanor Chodroff , Mans Hulden , Miikka Silfverberg , Arya D. McCarthy , David Yarowsky , Ryan Cotterell , Reut Tsarfaty , Ekaterina Vylomova

UniMorph 2.0: Universal Morphology

The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema.…

Computation and Language · Computer Science 2020-02-26 Christo Kirov , Ryan Cotterell , John Sylak-Glassman , Géraldine Walther , Ekaterina Vylomova , Patrick Xia , Manaal Faruqui , Sabrina J. Mielke , Arya D. McCarthy , Sandra Kübler , David Yarowsky , Jason Eisner , Mans Hulden

Morphological Processing of Low-Resource Languages: Where We Are and What's Next

Automatic morphological processing can aid downstream natural language processing applications, especially for low-resource languages, and assist language documentation efforts for endangered languages. Having long been multilingual, the…

Computation and Language · Computer Science 2022-03-18 Adam Wiemerslage , Miikka Silfverberg , Changbing Yang , Arya D. McCarthy , Garrett Nicolai , Eliana Colunga , Katharina Kann

MetaphorShare: A Dynamic Collaborative Repository of Open Metaphor Datasets

The metaphor studies community has developed numerous valuable labelled corpora in various languages over the years. Many of these resources are not only unknown to the NLP community, but are also often not easily shared among the…

Computation and Language · Computer Science 2025-03-11 Joanne Boisson , Arif Mehmood , Jose Camacho-Collados

Morphology Without Borders: Clause-Level Morphology

Morphological tasks use large multi-lingual datasets that organize words into inflection tables, which then serve as training and evaluation data for various tasks. However, a closer inspection of these data reveals profound…

Computation and Language · Computer Science 2022-10-20 Omer Goldman , Reut Tsarfaty

Recent advancements in computational morphology : A comprehensive survey

Computational morphology handles the language processing at the word level. It is one of the foundational tasks in the NLP pipeline for the development of higher level NLP applications. It mainly deals with the processing of words and word…

Computation and Language · Computer Science 2024-06-11 Jatayu Baxi , Brijesh Bhatt

Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances for explorational research and education

We generalized a voice morphing algorithm capable of handling temporally variable, multiple-attributes, and multiple instances. The generalized morphing provides a new strategy for investigating speech diversity. However, excessive…

Human-Computer Interaction · Computer Science 2024-04-23 Hideki Kawahara , Masanori Morise

A Data-Centric Framework for Composable NLP Workflows

Empirical natural language processing (NLP) systems in application domains (e.g., healthcare, finance, education) involve interoperation among multiple components, ranging from data ingestion, human annotation, to text retrieval, analysis,…

Computation and Language · Computer Science 2021-09-03 Zhengzhong Liu , Guanxiong Ding , Avinash Bukkittu , Mansi Gupta , Pengzhi Gao , Atif Ahmed , Shikun Zhang , Xin Gao , Swapnil Singhavi , Linwei Li , Wei Wei , Zecong Hu , Haoran Shi , Haoying Zhang , Xiaodan Liang , Teruko Mitamura , Eric P. Xing , Zhiting Hu

Integrating Information About Entities Progressively

Users often have to integrate information about entities from multiple data sources. This task is challenging as each data source may represent information about the same entity in a distinct form, e.g., each data source may use a different…

Databases · Computer Science 2019-10-24 Ben McCamish , Christopher Buss , Arash Termehchy , David Maier

Doc To The Future: Infomorphs for Interactive, Multimodal Document Transformation and Generation

Creating new documents by synthesizing information from existing sources is an important part of knowledge work in many domains. This process often involves gathering content from multiple documents, organizing it, and then transforming it…

Human-Computer Interaction · Computer Science 2026-03-02 Balasaravanan Thoravi Kumaravel

SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model

We present SoundMorpher, an open-world sound morphing method designed to generate perceptually uniform morphing trajectories. Traditional sound morphing techniques typically assume a linear relationship between the morphing factor and sound…

Sound · Computer Science 2024-12-17 Xinlei Niu , Jing Zhang , Charles Patrick Martin

MultiMorph: On-demand Atlas Construction

We present MultiMorph, a fast and efficient method for constructing anatomical atlases on the fly. Atlases capture the canonical structure of a collection of images and are essential for quantifying anatomical variability across…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 S. Mazdak Abulnaga , Andrew Hoopes , Neel Dey , Malte Hoffmann , Marianne Rakic , Bruce Fischl , John Guttag , Adrian Dalca

Converging Dimensions: Information Extraction and Summarization through Multisource, Multimodal, and Multilingual Fusion

Recent advances in large language models (LLMs) have led to new summarization strategies, offering an extensive toolkit for extracting important information. However, these approaches are frequently limited by their reliance on isolated…

Artificial Intelligence · Computer Science 2024-06-21 Pranav Janjani , Mayank Palan , Sarvesh Shirude , Ninad Shegokar , Sunny Kumar , Faruk Kazi

A Latent Morphology Model for Open-Vocabulary Neural Machine Translation

Translation into morphologically-rich languages challenges neural machine translation (NMT) models with extremely sparse vocabularies where atomic treatment of surface forms is unrealistic. This problem is typically addressed by either…

Computation and Language · Computer Science 2020-02-28 Duygu Ataman , Wilker Aziz , Alexandra Birch

Morph-fitting: Fine-Tuning Word Vector Spaces with Simple Language-Specific Rules

Morphologically rich languages accentuate two properties of distributional vector space models: 1) the difficulty of inducing accurate representations for low-frequency word forms; and 2) insensitivity to distinct lexical relations that…

Computation and Language · Computer Science 2017-06-02 Ivan Vulić , Nikola Mrkšić , Roi Reichart , Diarmuid Ó Séaghdha , Steve Young , Anna Korhonen

Morphological Reinflection with Multiple Arguments: An Extended Annotation schema and a Georgian Case Study

In recent years, a flurry of morphological datasets had emerged, most notably UniMorph, a multi-lingual repository of inflection tables. However, the flat structure of the current morphological annotation schema makes the treatment of some…

Computation and Language · Computer Science 2022-03-22 David Guriel , Omer Goldman , Reut Tsarfaty

Low-resource neural machine translation with morphological modeling

Morphological modeling in neural machine translation (NMT) is a promising approach to achieving open-vocabulary machine translation for morphologically-rich languages. However, existing methods such as sub-word tokenization and…

Computation and Language · Computer Science 2024-04-04 Antoine Nzeyimana

TAMS: Translation-Assisted Morphological Segmentation

Canonical morphological segmentation is the process of analyzing words into the standard (aka underlying) forms of their constituent morphemes. This is a core task in language documentation, and NLP systems have the potential to…

Computation and Language · Computer Science 2024-10-16 Enora Rice , Ali Marashian , Luke Gessler , Alexis Palmer , Katharina von der Wense

Morphology with a Null-Interface

We present an integrated architecture for word-level and sentence-level processing in a unification-based paradigm. The core of the system is a CLP implementation of a unification engine for feature structures supporting relational values.…

cmp-lg · Computer Science 2008-02-03 Harald Trost , Johannes Matiasek

CHAMP: Efficient Annotation and Consolidation of Cluster Hierarchies

Various NLP tasks require a complex hierarchical structure over nodes, where each node is a cluster of items. Examples include generating entailment graphs, hierarchical cross-document coreference resolution, annotating event and subevent…

Computation and Language · Computer Science 2023-11-21 Arie Cattan , Tom Hope , Doug Downey , Roy Bar-Haim , Lilach Eden , Yoav Kantor , Ido Dagan