English
Related papers

Related papers: Yet Another Format of Universal Dependencies for K…

200 papers

This paper introduces UniDive for Korean, an integrated framework that bridges Universal Dependencies (UD) and Universal Morphology (UniMorph) to enhance the representation and processing of Korean {morphosyntax}. Korean's rich inflectional…

Computation and Language · Computer Science 2025-03-28 Jungyeul Park , Yige Chen , Kyuwon Kim , KyungTae Lim , Chulwoo Park

Critique has surfaced concerning the existing linguistic annotation framework for Korean Universal Dependencies (UDs), particularly in relation to syntactic relationships. In this paper, our primary objective is to refine the definition of…

Computation and Language · Computer Science 2024-12-03 Kyuwon Kim , Yige Chen , Eunkyul Leah Jo , KyungTae Lim , Jungyeul Park , Chulwoo Park

In this paper, we first open on important issues regarding the Penn Korean Universal Treebank (PKT-UD) and address these issues by revising the entire corpus manually with the aim of producing cleaner UD annotations that are more faithful…

Computation and Language · Computer Science 2020-05-27 Tae Hwan Oh , Ji Yoon Han , Hyonsu Choe , Seokwon Park , Han He , Jinho D. Choi , Na-Rae Han , Jena D. Hwang , Hansaem Kim

We present in this work a new Universal Morphology dataset for Korean. Previously, the Korean language has been underrepresented in the field of morphological paradigms amongst hundreds of diverse world languages. Hence, we propose this…

Computation and Language · Computer Science 2023-05-18 Eunkyul Leah Jo , Kyuwon Kim , Xihan Wu , KyungTae Lim , Jungyeul Park , Chulwoo Park

We describe a resource-based method of morphological annotation of written Korean text. Korean is an agglutinative language. The output of our system is a graph of morphemes annotated with accurate linguistic information. The language…

Computation and Language · Computer Science 2007-11-22 Hyun-Gue Huh , Eric Laporte

We expand the second language (L2) Korean Universal Dependencies (UD) treebank with 5,454 manually annotated sentences. The annotation guidelines are also revised to better align with the UD framework. Using this enhanced treebank, we…

Computation and Language · Computer Science 2025-03-20 Hakyung Sung , Gyu-Ho Shin

While most of the speech and natural language systems which were developed for English and other Indo-European languages neglect the morphological processing and integrate speech and natural language at the word level, for the agglutinative…

cmp-lg · Computer Science 2008-02-03 WonIl Lee , Geunbae Lee , Jong-Hyeok Lee

We propose UDP, the first training-free parser for Universal Dependencies (UD). Our algorithm is based on PageRank and a small set of head attachment rules. It features two-step decoding to guarantee that function words are attached as leaf…

Computation and Language · Computer Science 2017-01-13 Héctor Martínez Alonso , Željko Agić , Barbara Plank , Anders Søgaard

A new tightly coupled speech and natural language integration model is presented for a TDNN-based continuous possibly large vocabulary speech recognition system for Korean. Unlike popular n-best techniques developed for integrating mainly…

cmp-lg · Computer Science 2008-02-03 Geunbae Lee , Jong-Hyeok Lee

Universal Dependencies (UD) offer a uniform cross-lingual syntactic representation, with the aim of advancing multilingual applications. Recent work shows that semantic parsing can be accomplished by transforming syntactic dependencies to…

Computation and Language · Computer Science 2017-08-30 Siva Reddy , Oscar Täckström , Slav Petrov , Mark Steedman , Mirella Lapata

Word embedding has become a fundamental component to many NLP tasks such as named entity recognition and machine translation. However, popular models that learn such embeddings are unaware of the morphology of words, so it is not directly…

Computation and Language · Computer Science 2017-08-08 Sanghyuk Choi , Taeuk Kim , Jinseok Seol , Sang-goo Lee

This article describes an exclusively resource-based method of morphological annotation of written Korean text. Korean is an agglutinative language. Our annotator is designed to process text before the operation of a syntactic parser. In…

Computation and Language · Computer Science 2007-11-22 Ivan Berlocher , Hyun-Gue Huh , Eric Laporte , Jee-Sun Nam

Fully data-driven, deep learning-based models are usually designed as language-independent and have been shown to be successful for many natural language processing tasks. However, when the studied language is low-resourced and the amount…

Computation and Language · Computer Science 2022-09-21 Şaziye Betül Özateş , Arzucan Özgür , Tunga Güngör , Balkız Öztürk

The present study extends recent work on Universal Dependencies annotations for second-language (L2) Korean by introducing a semi-automated framework that identifies morphosyntactic constructions from XPOS sequences and aligns those…

Computation and Language · Computer Science 2025-06-12 Hakyung Sung , Gyu-Ho Shin , Chanyoung Lee , You Kyung Sung , Boo Kyung Jung

Many language technology applications would benefit from the ability to represent negation and its scope on top of widely-used linguistic resources. In this paper, we investigate the possibility of obtaining a first-order logic…

Computation and Language · Computer Science 2017-02-14 Federico Fancellu , Siva Reddy , Adam Lopez , Bonnie Webber

Universal Dependencies is an open community effort to create cross-linguistically consistent treebank annotation for many languages within a dependency-based lexicalist framework. The annotation consists in a linguistically motivated word…

Languages may encode similar meanings using different sentence structures. This makes it a challenge to provide a single set of formal rules that can derive meanings from sentences in many languages at once. To overcome the challenge, we…

Computation and Language · Computer Science 2024-03-05 Laurestine Bradford , Timothy John O'Donnell , Siva Reddy

We introduce a morpheme-aware subword tokenization method that utilizes sub-character decomposition to address the challenges of applying Byte Pair Encoding (BPE) to Korean, a language characterized by its rich morphology and unique writing…

Computation and Language · Computer Science 2023-11-08 Taehee Jeon , Bongseok Yang , Changhwan Kim , Yoonseob Lim

A new scheme to represent phonological changes during continuous speech recognition is suggested. A phonological tag coupled with its morphological tag is designed to represent the conditions of Korean phonological changes. A pairwise…

cmp-lg · Computer Science 2008-02-03 WonIl Lee , Geunbae Lee , Jong-Hyeok Lee

The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the…

Computation and Language · Computer Science 2019-10-28 Arya D. McCarthy , Miikka Silfverberg , Ryan Cotterell , Mans Hulden , David Yarowsky
‹ Prev 1 2 3 10 Next ›