Related papers: Toward Network-based Keyword Extraction from Multi…

Toward Selectivity Based Keyword Extraction for Croatian News

Preliminary report on network based keyword extraction for Croatian is an unsupervised method for keyword extraction from the complex network. We build our approach with a new network measure the node selectivity, motivated by the research…

Computation and Language · Computer Science 2018-02-15 Slobodan Beliga , Ana Meštrović , Sanda Martinčić-Ipšić

Complex Network based Supervised Keyword Extractor

In this paper, we present a supervised framework for automatic keyword extraction from single document. We model the text as complex network, and construct the feature set by extracting select node properties from it. Several node…

Information Retrieval · Computer Science 2019-09-27 Swagata Duari , Vasudha Bhatnagar

Theme-weighted Ranking of Keywords from Text Documents using Phrase Embeddings

Keyword extraction is a fundamental task in natural language processing that facilitates mapping of documents to a concise set of representative single and multi-word phrases. Keywords from text documents are primarily extracted using…

Computation and Language · Computer Science 2018-07-17 Debanjan Mahata , John Kuriakose , Rajiv Ratn Shah , Roger Zimmermann , John R. Talburt

Keyword and Keyphrase Extraction Using Centrality Measures on Collocation Networks

Keyword and keyphrase extraction is an important problem in natural language processing, with applications ranging from summarization to semantic search to document clustering. Graph-based approaches to keyword and keyphrase extraction…

Computation and Language · Computer Science 2014-01-28 Shibamouli Lahiri , Sagnik Ray Choudhury , Cornelia Caragea

Keywords lie far from the mean of all words in local vector space

Keyword extraction is an important document process that aims at finding a small set of terms that concisely describe a document's topics. The most popular state-of-the-art unsupervised approaches belong to the family of the graph-based…

Computation and Language · Computer Science 2020-08-24 Eirini Papagiannopoulou , Grigorios Tsoumakas , Apostolos N. Papadopoulos

Extractive Multi Document Summarization using Dynamical Measurements of Complex Networks

Due to the large amount of textual information available on Internet, it is of paramount relevance to use techniques that find relevant and concise content. A typical task devoted to the identification of informative sentences in documents…

Computation and Language · Computer Science 2018-03-23 Jorge V. Tohalino , Diego R. Amancio

Neural Net Model for Featured Word Extraction

Search engines perform the task of retrieving information related to the user-supplied query words. This task has two parts; one is finding "featured words" which describe an article best and the other is finding a match among these words…

Neural and Evolutionary Computing · Computer Science 2007-05-23 A. Das , M. Marko , A. Probst , M. A. Porter , C. Gershenson

Using virtual edges to extract keywords from texts modeled as complex networks

Detecting keywords in texts is important for many text mining applications. Graph-based methods have been commonly used to automatically find the key concepts in texts, however, relevant information provided by embeddings has not been…

Computation and Language · Computer Science 2022-05-05 Jorge A. V. Tohalino , Thiago C. Silva , Diego R. Amancio

Topic Aware Contextualized Embeddings for High Quality Phrase Extraction

Keyphrase extraction from a given document is the task of automatically extracting salient phrases that best describe the document. This paper proposes a novel unsupervised graph-based ranking method to extract high-quality phrases from a…

Information Retrieval · Computer Science 2022-01-27 Venktesh V , Mukesh Mohania , Vikram Goyal

Linguistic data mining with complex networks: a stylometric-oriented approach

By representing a text by a set of words and their co-occurrences, one obtains a word-adjacency network being a reduced representation of a given language sample. In this paper, the possibility of using network representation to extract…

Computation and Language · Computer Science 2019-01-18 Tomasz Stanisz , Jarosław Kwapień , Stanisław Drożdż

A Model for Personalized Keyword Extraction from Web Pages using Segmentation

The World Wide Web caters to the needs of billions of users in heterogeneous groups. Each user accessing the World Wide Web might have his / her own specific interest and would expect the web to respond to the specific requirements. The…

Information Retrieval · Computer Science 2017-11-22 K. S. Kuppusamy , G. Aghila

Complex networks analysis of language complexity

Methods from statistical physics, such as those involving complex networks, have been increasingly used in quantitative analysis of linguistic phenomena. In this paper, we represented pieces of text with different levels of simplification…

Physics and Society · Physics 2013-02-20 Diego R. Amancio , Sandra M. Aluisio , Osvaldo N. Oliveira , Luciano da F. Costa

RaKUn: Rank-based Keyword extraction via Unsupervised learning and Meta vertex aggregation

Keyword extraction is used for summarizing the content of a document and supports efficient document retrieval, and is as such an indispensable part of modern text-based systems. We explore how load centrality, a graph-theoretic measure…

Computation and Language · Computer Science 2019-11-12 Blaž Škrlj , Andraž Repar , Senja Pollak

An Algorithm to Self-Extract Secondary Keywords and Their Combinations Based on Abstracts Collected using Primary Keywords from Online Digital Libraries

The high-level contribution of this paper is the development and implementation of an algorithm to selfextract secondary keywords and their combinations (combo words) based on abstracts collected using standard primary keywords for research…

Information Retrieval · Computer Science 2010-07-15 Natarajan Meghanathan , Nataliya Kostyuk , Raphael Isokpehi , Hari Cohly

Effective extractive summarization using frequency-filtered entity relationship graphs

Word frequency-based methods for extractive summarization are easy to implement and yield reasonable results across languages. However, they have significant limitations - they ignore the role of context, they offer uneven coverage of…

Computation and Language · Computer Science 2018-10-25 Archit Sakhadeo , Nisheeth Srivastava

Term-community-based topic detection with variable resolution

Network-based procedures for topic detection in huge text collections offer an intuitive alternative to probabilistic topic models. We present in detail a method that is especially designed with the requirements of domain experts in mind.…

Computation and Language · Computer Science 2021-07-27 Andreas Hamm , Simon Odrowski

Cross-Domain Keyword Extraction with Keyness Patterns

Domain dependence and annotation subjectivity pose challenges for supervised keyword extraction. Based on the premises that second-order keyness patterns are existent at the community level and learnable from annotated keyword extraction…

Information Retrieval · Computer Science 2024-09-30 Dongmei Zhou , Xuri Tang

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

Keyphrase extraction from documents is useful to a variety of applications such as information retrieval and document summarization. This paper presents an end-to-end method called DivGraphPointer for extracting a set of diversified…

Computation and Language · Computer Science 2019-05-21 Zhiqing Sun , Jian Tang , Pan Du , Zhi-Hong Deng , Jian-Yun Nie

A Semantic approach for effective document clustering using WordNet

Now a days, the text document is spontaneously increasing over the internet, e-mail and web pages and they are stored in the electronic database format. To arrange and browse the document it becomes difficult. To overcome such problem the…

Computation and Language · Computer Science 2013-03-05 Leena H. Patil , Mohammed Atique

Abstractive Summarization Improved by WordNet-based Extractive Sentences

Recently, the seq2seq abstractive summarization models have achieved good results on the CNN/Daily Mail dataset. Still, how to improve abstractive methods with extractive methods is a good research direction, since extractive methods have…

Computation and Language · Computer Science 2018-08-07 Niantao Xie , Sujian Li , Huiling Ren , Qibin Zhai