English
Related papers

Related papers: Morphological annotation of Korean with Directly M…

200 papers

We describe a resource-based method of morphological annotation of written Korean text. Korean is an agglutinative language. The output of our system is a graph of morphemes annotated with accurate linguistic information. The language…

Computation and Language · Computer Science 2007-11-22 Hyun-Gue Huh , Eric Laporte

Due to the fact that Korean is a highly agglutinative, character-rich language, previous work on Korean morphological analysis typically employs the use of sub-character features known as graphemes or otherwise utilizes comprehensive prior…

Computation and Language · Computer Science 2018-06-29 Andrew Matteson , Chanhee Lee , Young-Bum Kim , Heuiseok Lim

While most of the speech and natural language systems which were developed for English and other Indo-European languages neglect the morphological processing and integrate speech and natural language at the word level, for the agglutinative…

cmp-lg · Computer Science 2008-02-03 WonIl Lee , Geunbae Lee , Jong-Hyeok Lee

Intention identification is a core issue in dialog management. However, due to the non-canonicality of the spoken language, it is difficult to extract the content automatically from the conversation-style utterances. This is much more…

Computation and Language · Computer Science 2019-07-10 Won Ik Cho , Young Ki Moon , Woo Hyun Kang , Nam Soo Kim

In the paper, we propose a novel way of improving named entity recognition in the Korean language using its language-specific features. While the field of named entity recognition has been studied extensively in recent years, the mechanism…

Computation and Language · Computer Science 2024-05-15 Yige Chen , KyungTae Lim , Jungyeul Park

Word embedding has become a fundamental component to many NLP tasks such as named entity recognition and machine translation. However, popular models that learn such embeddings are unaware of the morphology of words, so it is not directly…

Computation and Language · Computer Science 2017-08-08 Sanghyuk Choi , Taeuk Kim , Jinseok Seol , Sang-goo Lee

A new tightly coupled speech and natural language integration model is presented for a TDNN-based continuous possibly large vocabulary speech recognition system for Korean. Unlike popular n-best techniques developed for integrating mainly…

cmp-lg · Computer Science 2008-02-03 Geunbae Lee , Jong-Hyeok Lee

This paper describes word {segmentation} granularity in Korean language processing. From a word separated by blank space, which is termed an eojeol, to a sequence of morphemes in Korean, there are multiple possible levels of word…

Computation and Language · Computer Science 2023-09-08 Jungyeul Park , Mija Kim

We present in this work a new Universal Morphology dataset for Korean. Previously, the Korean language has been underrepresented in the field of morphological paradigms amongst hundreds of diverse world languages. Hence, we propose this…

Computation and Language · Computer Science 2023-05-18 Eunkyul Leah Jo , Kyuwon Kim , Xihan Wu , KyungTae Lim , Jungyeul Park , Chulwoo Park

Research on Korean grammatical error correction (GEC) is limited, compared to other major languages such as English. We attribute this problematic circumstance to the lack of a carefully designed evaluation benchmark for Korean GEC. In this…

Computation and Language · Computer Science 2023-05-25 Soyoung Yoon , Sungjoon Park , Gyuwan Kim , Junhee Cho , Kihyo Park , Gyutae Kim , Minjoon Seo , Alice Oh

This paper introduces UniDive for Korean, an integrated framework that bridges Universal Dependencies (UD) and Universal Morphology (UniMorph) to enhance the representation and processing of Korean {morphosyntax}. Korean's rich inflectional…

Computation and Language · Computer Science 2025-03-28 Jungyeul Park , Yige Chen , Kyuwon Kim , KyungTae Lim , Chulwoo Park

Most of the post-processing methods for character recognition rely on contextual information of character and word-fragment levels. However, due to linguistic characteristics of Korean, such low-level information alone is not sufficient for…

cmp-lg · Computer Science 2008-02-03 Geunbae Lee , Jong-Hyeok Lee , JinHee Yoo

For readability and disambiguation of the written text, appropriate word segmentation is recommended for documentation, and it also holds for the digitized texts. If the language is agglutinative while far from scriptio continua, for…

Computation and Language · Computer Science 2021-05-05 Won Ik Cho , Sung Jun Cheon , Woo Hyun Kang , Ji Won Kim , Nam Soo Kim

This memoir explores two fundamental aspects of Natural Language Processing (NLP): the creation of linguistic resources and the evaluation of NLP system performance. Over the past decade, my work has focused on developing a morpheme-based…

Computation and Language · Computer Science 2026-02-16 Jungyeul Park

In this study, we propose a morpheme-based scheme for Korean dependency parsing and adopt the proposed scheme to Universal Dependencies. We present the linguistic rationale that illustrates the motivation and the necessity of adopting the…

Computation and Language · Computer Science 2022-09-21 Yige Chen , Eunkyul Leah Jo , Yundong Yao , KyungTae Lim , Miikka Silfverberg , Francis M. Tyers , Jungyeul Park

We propose a simple yet effective approach for improving Korean word representations using additional linguistic annotation (i.e. Hanja). We employ cross-lingual transfer learning in training word representations by leveraging the fact that…

Computation and Language · Computer Science 2019-11-01 Kang Min Yoo , Taeuk Kim , Sang-goo Lee

Automatic font generation (AFG) is the process of creating a new font using only a few examples of the style images. Generating fonts for complex languages like Korean and Chinese, particularly in handwritten styles, presents significant…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Abdul Sami , Avinash Kumar , Irfanullah Memon , Youngwon Jo , Muhammad Rizwan , Jaeyoung Choi

In this study, we introduce KOPL, a novel framework for handling Korean OOV words with Phoneme representation Learning. Our work is based on the linguistic property of Korean as a phonemic script, the high correlation between phonemes and…

Computation and Language · Computer Science 2025-07-08 Nayeon Kim , Eojin Jeon , Jun-Hyung Park , SangKeun Lee

Syntactic elements, such as word order and case markers, are fundamental in natural language processing. Recent studies show that syntactic information boosts language model performance and offers clues for people to understand their…

Computation and Language · Computer Science 2024-07-15 Jong Myoung Kim , Young-Jun Lee , Yong-jin Han , Sangkeun Jung , Ho-Jin Choi

In this paper we present a lexicon-based approach to the problem of morphological processing. Full-form words, lemmas and grammatical tags are interconnected in a DAWG. Thus, the process of analysis/synthesis is reduced to a search in the…

Computation and Language · Computer Science 2007-05-23 Kyriakos N. Sgarbas , Nikos D. Fakotakis , George K. Kokkinakis
‹ Prev 1 2 3 10 Next ›