Related papers: Yet Another Format of Universal Dependencies for K…

Enhancing Korean Dependency Parsing with Morphosyntactic Features

This paper introduces UniDive for Korean, an integrated framework that bridges Universal Dependencies (UD) and Universal Morphology (UniMorph) to enhance the representation and processing of Korean {morphosyntax}. Korean's rich inflectional…

Computation and Language · Computer Science 2025-03-28 Jungyeul Park , Yige Chen , Kyuwon Kim , KyungTae Lim , Chulwoo Park

K-UD: Revising Korean Universal Dependencies Guidelines

Critique has surfaced concerning the existing linguistic annotation framework for Korean Universal Dependencies (UDs), particularly in relation to syntactic relationships. In this paper, our primary objective is to refine the definition of…

Computation and Language · Computer Science 2024-12-03 Kyuwon Kim , Yige Chen , Eunkyul Leah Jo , KyungTae Lim , Jungyeul Park , Chulwoo Park

Analysis of the Penn Korean Universal Dependency Treebank (PKT-UD): Manual Revision to Build Robust Parsing Model in Korean

In this paper, we first open on important issues regarding the Penn Korean Universal Treebank (PKT-UD) and address these issues by revising the entire corpus manually with the aim of producing cleaner UD annotations that are more faithful…

Computation and Language · Computer Science 2020-05-27 Tae Hwan Oh , Ji Yoon Han , Hyonsu Choe , Seokwon Park , Han He , Jinho D. Choi , Na-Rae Han , Jena D. Hwang , Hansaem Kim

K-UniMorph: Korean Universal Morphology and its Feature Schema

We present in this work a new Universal Morphology dataset for Korean. Previously, the Korean language has been underrepresented in the field of morphological paradigms amongst hundreds of diverse world languages. Hence, we propose this…

Computation and Language · Computer Science 2023-05-18 Eunkyul Leah Jo , Kyuwon Kim , Xihan Wu , KyungTae Lim , Jungyeul Park , Chulwoo Park

A resource-based Korean morphological annotation system

We describe a resource-based method of morphological annotation of written Korean text. Korean is an agglutinative language. The output of our system is a graph of morphemes annotated with accurate linguistic information. The language…

Computation and Language · Computer Science 2007-11-22 Hyun-Gue Huh , Eric Laporte

Second language Korean Universal Dependency treebank v1.2: Focus on data augmentation and annotation scheme refinement

We expand the second language (L2) Korean Universal Dependencies (UD) treebank with 5,454 manually annotated sentences. The annotation guidelines are also revised to better align with the UD framework. Using this enhanced treebank, we…

Computation and Language · Computer Science 2025-03-20 Hakyung Sung , Gyu-Ho Shin

Chart-driven Connectionist Categorial Parsing of Spoken Korean

While most of the speech and natural language systems which were developed for English and other Indo-European languages neglect the morphological processing and integrate speech and natural language at the word level, for the agglutinative…

cmp-lg · Computer Science 2008-02-03 WonIl Lee , Geunbae Lee , Jong-Hyeok Lee

Parsing Universal Dependencies without training

We propose UDP, the first training-free parser for Universal Dependencies (UD). Our algorithm is based on PageRank and a small set of head attachment rules. It features two-step decoding to guarantee that function words are attached as leaf…

Computation and Language · Computer Science 2017-01-13 Héctor Martínez Alonso , Željko Agić , Barbara Plank , Anders Søgaard

Integrated speech and morphological processing in a connectionist continuous speech understanding for Korean

A new tightly coupled speech and natural language integration model is presented for a TDNN-based continuous possibly large vocabulary speech recognition system for Korean. Unlike popular n-best techniques developed for integrating mainly…

cmp-lg · Computer Science 2008-02-03 Geunbae Lee , Jong-Hyeok Lee

Universal Semantic Parsing

Universal Dependencies (UD) offer a uniform cross-lingual syntactic representation, with the aim of advancing multilingual applications. Recent work shows that semantic parsing can be accomplished by transforming syntactic dependencies to…

Computation and Language · Computer Science 2017-08-30 Siva Reddy , Oscar Täckström , Slav Petrov , Mark Steedman , Mirella Lapata

A Syllable-based Technique for Word Embeddings of Korean Words

Word embedding has become a fundamental component to many NLP tasks such as named entity recognition and machine translation. However, popular models that learn such embeddings are unaware of the morphology of words, so it is not directly…

Computation and Language · Computer Science 2017-08-08 Sanghyuk Choi , Taeuk Kim , Jinseok Seol , Sang-goo Lee

Morphological annotation of Korean with Directly Maintainable Resources

This article describes an exclusively resource-based method of morphological annotation of written Korean text. Korean is an agglutinative language. Our annotator is designed to process text before the operation of a syntactic parser. In…

Computation and Language · Computer Science 2007-11-22 Ivan Berlocher , Hyun-Gue Huh , Eric Laporte , Jee-Sun Nam

A Hybrid Approach to Dependency Parsing: Combining Rules and Morphology with Deep Learning

Fully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is low-resourced and the amount…

Computation and Language · Computer Science 2022-09-21 Şaziye Betül Özateş , Arzucan Özgür , Tunga Güngör , Balkız Öztürk

UD-KSL Treebank v1.3: A semi-automated framework for aligning XPOS-extracted units with UPOS tags

The present study extends recent work on Universal Dependencies annotations for second-language (L2) Korean by introducing a semi-automated framework that identifies morphosyntactic constructions from XPOS sequences and aligns those…

Computation and Language · Computer Science 2025-06-12 Hakyung Sung , Gyu-Ho Shin , Chanyoung Lee , You Kyung Sung , Boo Kyung Jung

Universal Dependencies to Logical Forms with Negation Scope

Many language technology applications would benefit from the ability to represent negation and its scope on top of widely-used linguistic resources. In this paper, we investigate the possibility of obtaining a first-order logic…

Computation and Language · Computer Science 2017-02-14 Federico Fancellu , Siva Reddy , Adam Lopez , Bonnie Webber

Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection

Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. The annotation consists in a linguistically motivated word…

Computation and Language · Computer Science 2020-04-23 Joakim Nivre , Marie-Catherine de Marneffe , Filip Ginter , Jan Hajič , Christopher D. Manning , Sampo Pyysalo , Sebastian Schuster , Francis Tyers , Daniel Zeman

A Compositional Typed Semantics for Universal Dependencies

Languages may encode similar meanings using different sentence structures. This makes it a challenge to provide a single set of formal rules that can derive meanings from sentences in many languages at once. To overcome the challenge, we…

Computation and Language · Computer Science 2024-03-05 Laurestine Bradford , Timothy John O'Donnell , Siva Reddy

Improving Korean NLP Tasks with Linguistically Informed Subword Tokenization and Sub-character Decomposition

We introduce a morpheme-aware subword tokenization method that utilizes sub-character decomposition to address the challenges of applying Byte Pair Encoding (BPE) to Korean, a language characterized by its rich morphology and unique writing…

Computation and Language · Computer Science 2023-11-08 Taehee Jeon , Bongseok Yang , Changhwan Kim , Yoonseob Lim

Phonological modeling for continuous speech recognition in Korean

A new scheme to represent phonological changes during continuous speech recognition is suggested. A phonological tag coupled with its morphological tag is designed to represent the conditions of Korean phonological changes. A pairwise…

cmp-lg · Computer Science 2008-02-03 WonIl Lee , Geunbae Lee , Jong-Hyeok Lee

Marrying Universal Dependencies and Universal Morphology

The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the…

Computation and Language · Computer Science 2019-10-28 Arya D. McCarthy , Miikka Silfverberg , Ryan Cotterell , Mans Hulden , David Yarowsky