Related papers: Improving Term Extraction Using Particle Swarm Opt…

User-Oriented Summaries Using a PSO Based Scoring Optimization Method

Automatic text summarization tools have a great impact on many fields, such as medicine, law, and scientific research in general. As information overload increases, automatic summaries allow handling the growing volume of documents, usually…

Machine Learning · Computer Science 2019-06-28 Augusto Villa-Monte , Laura Lanzarini , Aurelio F. Bariviera , José A. Olivas

Unsupervised Technical Domain Terms Extraction using Term Extractor

Terminology extraction, also known as term extraction, is a subtask of information extraction. The goal of terminology extraction is to extract relevant words or phrases from a given corpus automatically. This paper focuses on the…

Computation and Language · Computer Science 2021-01-25 Suman Dowlagar , Radhika Mamidi

Bilingual Terminology Extraction Using Multi-level Termhood

Purpose: Terminology is the set of technical words or expressions used in specific contexts, which denotes the core concept in a formal discipline and is usually applied in the fields of machine translation, information retrieval,…

Computation and Language · Computer Science 2013-02-20 Chengzhi Zhang , Dan Wu

Preference Learning in Terminology Extraction: A ROC-based approach

A key data preparation step in Text Mining, Term Extraction selects the terms, or collocation of words, attached to specific concepts. In this paper, the task of extracting relevant collocations is achieved through a supervised learning…

Machine Learning · Computer Science 2016-08-16 Jérôme Azé , Mathieu Roche , Yves Kodratoff , Michèle Sebag

Automatic extraction of requirements expressed in industrial standards : a way towards machine readable standards ?

The project, under industrial funding, presented in this publication aims at the semantic analysis of a normative document describing requirements applicable to electrical appliances. The objective of the project is to build a semantic…

Information Retrieval · Computer Science 2021-12-28 Helene de Ribaupierre , Anne-Francoise Cutting-Decelle , Nathalie Baumier , Serge Blumental

Improving Term Extraction with Terminological Resources

Studies of different term extractors on a corpus of the biomedical domain revealed decreasing performances when applied to highly technical texts. The difficulty or impossibility of customising them to new domains is an additional…

Computation and Language · Computer Science 2007-05-23 Sophie Aubin , Thierry Hamon

Extracting domain-specific terms using contextual word embeddings

Automated terminology extraction refers to the task of extracting meaningful terms from domain-specific texts. This paper proposes a novel machine learning approach to terminology extraction, which combines features from traditional term…

Computation and Language · Computer Science 2025-02-25 Andraž Repar , Nada Lavrač , Senja Pollak

A Bayesian Interpretation of the Particle Swarm Optimization and Its Kernel Extension

Particle swarm optimization is a popular method for solving difficult optimization problems. There have been attempts to formulate the method in formal probabilistic or stochastic terms (e.g. bare bones particle swarm) with the aim to…

Neural and Evolutionary Computing · Computer Science 2012-11-19 Peter Andras

Extraction of Keyphrases from Text: Evaluation of Four Algorithms

This report presents an empirical evaluation of four algorithms for automatically extracting keywords and keyphrases from documents. The four algorithms are compared using five different collections of documents. For each document, we have…

Machine Learning · Computer Science 2007-05-23 Peter D. Turney

The Recent Advances in Automatic Term Extraction: A survey

Automatic term extraction (ATE) is a Natural Language Processing (NLP) task that eases the effort of manually identifying terms from domain-specific corpora by providing a list of candidate terms. As units of knowledge in a specific field…

Computation and Language · Computer Science 2023-01-18 Hanh Thi Hong Tran , Matej Martinc , Jaya Caporusso , Antoine Doucet , Senja Pollak

FRAKE: Fusional Real-time Automatic Keyword Extraction

Keyword extraction is the process of identifying the words or phrases that express the main concepts of text to the best of one's ability. Electronic infrastructure creates a considerable amount of text every day and at all times. This…

Computation and Language · Computer Science 2021-10-04 Aidin Zehtab-Salmasi , Mohammad-Reza Feizi-Derakhshi , Mohamad-Ali Balafar

Automatic Extraction of Agriculture Terms from Domain Text: A Survey of Tools and Techniques

Agriculture is a key component in any country's development. Domain-specific knowledge resources serve to gain insight into the domain. Existing knowledge resources such as AGROVOC and NAL Thesaurus are developed and maintained by the…

Information Retrieval · Computer Science 2020-09-25 Niladri Chatterjee , Neha Kaushik

A Review of Keyphrase Extraction

Keyphrase extraction is a textual information processing task concerned with the automatic extraction of representative and characteristic phrases from a document that express all the key aspects of its content. Keyphrases constitute a…

Computation and Language · Computer Science 2019-07-31 Eirini Papagiannopoulou , Grigorios Tsoumakas

TEST: A Terminology Extraction System for Technology Related Terms

Tracking developments in the highly dynamic data-technology landscape are vital to keeping up with novel technologies and tools, in the various areas of Artificial Intelligence (AI). However, It is difficult to keep track of all the…

Information Retrieval · Computer Science 2019-03-08 Murhaf Hossari , Soumyabrata Dev , John D. Kelleher

Data Augmentation Techniques for Process Extraction from Scientific Publications

We present data augmentation techniques for process extraction tasks in scientific publications. We cast the process extraction task as a sequence labeling task where we identify all the entities in a sentence and label them according to…

Computation and Language · Computer Science 2025-04-16 Yuni Susanti

Theme-weighted Ranking of Keywords from Text Documents using Phrase Embeddings

Keyword extraction is a fundamental task in natural language processing that facilitates mapping of documents to a concise set of representative single and multi-word phrases. Keywords from text documents are primarily extracted using…

Computation and Language · Computer Science 2018-07-17 Debanjan Mahata , John Kuriakose , Rajiv Ratn Shah , Roger Zimmermann , John R. Talburt

Graph-based Semantical Extractive Text Analysis

In the past few decades, there has been an explosion in the amount of available data produced from various sources with different topics. The availability of this enormous data necessitates us to adopt effective computational tools to…

Computation and Language · Computer Science 2022-12-20 Mina Samizadeh

Improving Statistical Multimedia Information Retrieval Model by using Ontology

A typical IR system that delivers and stores information is affected by problem of matching between user query and available content on web. Use of Ontology represents the extracted terms in form of network graph consisting of nodes, edges,…

Information Retrieval · Computer Science 2017-03-23 Gagandeep Singh Narula , Vishal Jain

Enhancing Topic Extraction in Recommender Systems with Entropy Regularization

In recent years, many recommender systems have utilized textual data for topic extraction to enhance interpretability. However, our findings reveal a noticeable deficiency in the coherence of keywords within topics, resulting in low…

Computation and Language · Computer Science 2023-06-14 Xuefei Jiang , Dairui Liu , Ruihai Dong

Topic Aware Contextualized Embeddings for High Quality Phrase Extraction

Keyphrase extraction from a given document is the task of automatically extracting salient phrases that best describe the document. This paper proposes a novel unsupervised graph-based ranking method to extract high-quality phrases from a…

Information Retrieval · Computer Science 2022-01-27 Venktesh V , Mukesh Mohania , Vikram Goyal