Related papers: Pattern Based Term Extraction Using ACABIT System

Pattern Matching and Discourse Processing in Information Extraction from Japanese Text

Information extraction is the task of automatically picking up information of interest from an unconstrained text. Information of interest is usually extracted in two steps. First, sentence level processing locates relevant pieces of…

Artificial Intelligence · Computer Science 2008-02-03 T. Kitani , Y. Eriguchi , M. Hara

Japanese Sentiment Classification using a Tree-Structured Long Short-Term Memory with Attention

Previous approaches to training syntax-based sentiment classification models required phrase-level annotated corpora, which are not readily available in many languages other than English. Thus, we propose the use of tree-structured Long…

Computation and Language · Computer Science 2018-10-02 Ryosuke Miyazaki , Mamoru Komachi

Feature-Less End-to-End Nested Term Extraction

In this paper, we proposed a deep learning-based end-to-end method on the domain specified automatic term extraction (ATE), it considers possible term spans within a fixed length in the sentence and predicts them whether they can be…

Computation and Language · Computer Science 2019-09-10 Yuze Gao , Yu Yuan

Extracting linguistic speech patterns of Japanese fictional characters using subword units

This study extracted and analyzed the linguistic speech patterns that characterize Japanese anime or game characters. Conventional morphological analyzers, such as MeCab, segment words with high performance, but they are unable to segment…

Computation and Language · Computer Science 2022-03-08 Mika Kishino , Kanako Komiya

The Recent Advances in Automatic Term Extraction: A survey

Automatic term extraction (ATE) is a Natural Language Processing (NLP) task that eases the effort of manually identifying terms from domain-specific corpora by providing a list of candidate terms. As units of knowledge in a specific field…

Computation and Language · Computer Science 2023-01-18 Hanh Thi Hong Tran , Matej Martinc , Jaya Caporusso , Antoine Doucet , Senja Pollak

A Machine-Learning Approach to Estimating the Referential Properties of Japanese Noun Phrases

The referential properties of noun phrases in the Japanese language, which has no articles, are useful for article generation in Japanese-English machine translation and for anaphora resolution in Japanese noun phrases. They are generally…

Computation and Language · Computer Science 2007-05-23 Masaki Murata , Kiyotaka Uchimoto , Qing Ma , Hitoshi Isahara

Abstract Generation based on Rhetorical Structure Extraction

We have developed an automatic abstract generation system for Japanese expository writings based on rhetorical structure extraction. The system first extracts the rhetorical structure, the compound of the rhetorical relations between…

cmp-lg · Computer Science 2008-02-03 Kenji Ono , Kazuo Sumita , Seiji Miike Research , Development Center , Toshiba Corporation Komukai-Toshiba-cho 1 , Saiwai-ku , Kawasaki , 210 , Japan

Pattern-based Subterm Selection in Isabelle

This article presents a pattern-based language designed to select (a set of) subterms of a given term in a concise and robust way. Building on this language, we implement a single-step rewriting tactic in the Isabelle theorem prover, which…

Logic in Computer Science · Computer Science 2021-11-09 Lars Noschinski , Christoph Traut

Automatically Suggesting Diverse Example Sentences for L2 Japanese Learners Using Pre-Trained Language Models

Providing example sentences that are diverse and aligned with learners' proficiency levels is essential for fostering effective language acquisition. This study examines the use of Pre-trained Language Models (PLMs) to produce example…

Computation and Language · Computer Science 2025-06-05 Enrico Benedetti , Akiko Aizawa , Florian Boudin

Preference Learning in Terminology Extraction: A ROC-based approach

A key data preparation step in Text Mining, Term Extraction selects the terms, or collocation of words, attached to specific concepts. In this paper, the task of extracting relevant collocations is achieved through a supervised learning…

Machine Learning · Computer Science 2016-08-16 Jérôme Azé , Mathieu Roche , Yves Kodratoff , Michèle Sebag

Construction of a Japanese Word Similarity Dataset

An evaluation of distributed word representation is generally conducted using a word similarity task and/or a word analogy task. There are many datasets readily available for these tasks in English. However, evaluating distributed…

Computation and Language · Computer Science 2018-02-23 Yuya Sakaizawa , Mamoru Komachi

Possessive Pronouns as Determiners in Japanese-to-English Machine Translation

Possessive pronouns are used as determiners in English when no equivalent would be used in a Japanese sentence with the same meaning. This paper proposes a heuristic method of generating such possessive pronouns even when there is no…

cmp-lg · Computer Science 2008-02-03 Francis Bond , Kentaro Ogura , Satoru Ikehara

Bilingual Terminology Extraction Using Multi-level Termhood

Purpose: Terminology is the set of technical words or expressions used in specific contexts, which denotes the core concept in a formal discipline and is usually applied in the fields of machine translation, information retrieval,…

Computation and Language · Computer Science 2013-02-20 Chengzhi Zhang , Dan Wu

Japanese/English Cross-Language Information Retrieval: Exploration of Query Translation and Transliteration

Cross-language information retrieval (CLIR), where queries and documents are in different languages, has of late become one of the major topics within the information retrieval community. This paper proposes a Japanese/English CLIR system,…

Computation and Language · Computer Science 2007-05-23 Atsushi Fujii , Tetsuya Ishikawa

Analysis of Japanese Compound Nouns using Collocational Information

Analyzing compound nouns is one of the crucial issues for natural language processing systems, in particular for those systems that aim at a wide coverage of domains. In this paper, we propose a method to analyze structures of Japanese…

cmp-lg · Computer Science 2008-02-03 Kobayasi Yosiyuki , Takunaga Takenobu , Tanaka Hozumi

An Aspect Extraction Framework using Different Embedding Types, Learning Models, and Dependency Structure

Aspect-based sentiment analysis has gained significant attention in recent years due to its ability to provide fine-grained insights for sentiment expressions related to specific features of entities. An important component of aspect-based…

Computation and Language · Computer Science 2025-03-06 Ali Erkan , Tunga Güngör

Pattern Sampling for Shapelet-based Time Series Classification

Subsequence-based time series classification algorithms provide accurate and interpretable models, but training these models is extremely computation intensive. The asymptotic time complexity of subsequence-based algorithms remains a…

Machine Learning · Computer Science 2021-02-18 Atif Raza , Stefan Kramer

Exploring Cultures through Pattern Mining - Practices from Generative Beauty Workshops

This paper presents a method for understanding personal ways of thinking and doing in daily lives among different countries by mining their ways as patterns in a sense of pattern language. Pattern language is a methodology of describing…

Computers and Society · Computer Science 2015-03-04 Jei-Hee Hong , Yuma Akado , Sakurako Kogure , Alice Sasabe , Keishi Saruwatari , Takashi Iba

A Rule-Based Approach For Aligning Japanese-Spanish Sentences From A Comparable Corpora

The performance of a Statistical Machine Translation System (SMT) system is proportionally directed to the quality and length of the parallel corpus it uses. However for some pair of languages there is a considerable lack of them. The long…

Computation and Language · Computer Science 2012-11-20 Jessica C. Ramírez , Yuji Matsumoto

Neural Machine Translation Model with a Large Vocabulary Selected by Branching Entropy

Neural machine translation (NMT), a new approach to machine translation, has achieved promising results comparable to those of traditional approaches such as statistical machine translation (SMT). Despite its recent success, NMT cannot…

Computation and Language · Computer Science 2017-09-07 Zi Long , Ryuichiro Kimura , Takehito Utsuro , Tomoharu Mitsuhashi , Mikio Yamamoto