Related papers: Abstract Mining

An Algorithm to Self-Extract Secondary Keywords and Their Combinations Based on Abstracts Collected using Primary Keywords from Online Digital Libraries

The high-level contribution of this paper is the development and implementation of an algorithm to selfextract secondary keywords and their combinations (combo words) based on abstracts collected using standard primary keywords for research…

Information Retrieval · Computer Science 2010-07-15 Natarajan Meghanathan , Nataliya Kostyuk , Raphael Isokpehi , Hari Cohly

PubTree: A Hierarchical Search Tool for the MEDLINE Database

Keeping track of the ever-increasing body of scientific literature is an escalating challenge. We present PubTree a hierarchical search tool that efficiently searches the PubMed/MEDLINE dataset based upon a decision tree constructed using…

Information Retrieval · Computer Science 2017-02-28 William Rowe , Paul D. Dobson , Bede Constantinides , Mark Platt

Which Clustering Do You Want? Inducing Your Ideal Clustering with Minimal Feedback

While traditional research on text clustering has largely focused on grouping documents by topic, it is conceivable that a user may want to cluster documents along other dimensions, such as the authors mood, gender, age, or sentiment.…

Information Retrieval · Computer Science 2014-01-22 Sajib Dasgupta , Vincent Ng

Towards Constructing a Corpus for Studying the Effects of Treatments and Substances Reported in PubMed Abstracts

We present the construction of an annotated corpus of PubMed abstracts reporting about positive, negative or neutral effects of treatments or substances. Our ultimate goal is to annotate one sentence (rationale) for each abstract and to use…

Computation and Language · Computer Science 2019-12-05 Evgeni Stefchov , Galia Angelova , Preslav Nakov

Rank Based Clustering For Document Retrieval From Biomedical Databases

Now a day's, search engines are been most widely used for extracting information's from various resources throughout the world. Where, majority of searches lies in the field of biomedical for retrieving related documents from various…

Information Retrieval · Computer Science 2009-12-14 Jayanthi Manicassamy , P. Dhavachelvan

Document Clustering using K-Medoids

People are always in search of matters for which they are prone to use internet, but again it has huge assemblage of data due to which it becomes difficult for the reader to get the most accurate data. To make it easier for people to gather…

Information Retrieval · Computer Science 2015-04-07 Monica Jha

Enhancing Unsupervised Keyword Extraction in Academic Papers through Integrating Highlights with Abstract

Automatic keyword extraction from academic papers is a key area of interest in natural language processing and information retrieval. Although previous research has mainly focused on utilizing abstract and references for keyword extraction,…

Information Retrieval · Computer Science 2026-04-22 Yi Xiang , Chengzhi Zhang

Using Artificial Intuition in Distinct, Minimalist Classification of Scientific Abstracts for Management of Technology Portfolios

Classification of scientific abstracts is useful for strategic activities but challenging to automate because the sparse text provides few contextual clues. Metadata associated with the scientific publication can be used to improve…

Digital Libraries · Computer Science 2025-09-09 Prateek Ranka , Fred Morstatter , Alexandra Graddy-Reed , Andrea Belz

PDC -- a probabilistic distributional clustering algorithm: a case study on suicide articles in PubMed

The need to organize a large collection in a manner that facilitates human comprehension is crucial given the ever-increasing volumes of information. In this work, we present PDC (probabilistic distributional clustering), a novel algorithm…

Computation and Language · Computer Science 2020-03-09 Rezarta Islamaj , Lana Yeganova , Won Kim , Natalie Xie , W. John Wilbur , Zhiyong Lu

Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies

We present a multi-document summarizer, called MEAD, which generates summaries using cluster centroids produced by a topic detection and tracking system. We also describe two new techniques, based on sentence utility and subsumption, which…

Computation and Language · Computer Science 2007-05-23 Dragomir R. Radev , Hongyan Jing , Malgorzata Budzikowska

A Case Study in Text Mining: Interpreting Twitter Data From World Cup Tweets

Cluster analysis is a field of data analysis that extracts underlying patterns in data. One application of cluster analysis is in text-mining, the analysis of large collections of text to find similarities between documents. We used a…

Machine Learning · Statistics 2014-08-26 Daniel Godfrey , Caley Johns , Carl Meyer , Shaina Race , Carol Sadek

Bibliometric Perspectives on Medical Innovation using the Medical Subject Headings (MeSH) of PubMed

Multiple perspectives on the nonlinear processes of medical innovations can be distinguished and combined using the Medical Subject Headings (MeSH) of the Medline database. Focusing on three main branches-"diseases," "drugs and chemicals,"…

Digital Libraries · Computer Science 2012-04-10 Loet Leydesdorff , Daniele Rotolo , Ismael Rafols

Estimating the Effective Topics of Articles and journals Abstract Using LDA And K-Means Clustering Algorithm

Analyzing journals and articles abstract text or documents using topic modelling and text clustering has become a modern solution for the increasing number of text documents. Topic modelling and text clustering are both intensely involved…

Information Retrieval · Computer Science 2025-08-25 Shadikur Rahman , Umme Ayman Koana , Aras M. Ismael , Karmand Hussein Abdalla

Exploring text datasets by visualizing relevant words

When working with a new dataset, it is important to first explore and familiarize oneself with it, before applying any advanced machine learning algorithms. However, to the best of our knowledge, no tools exist that quickly and reliably…

Computation and Language · Computer Science 2017-07-18 Franziska Horn , Leila Arras , Grégoire Montavon , Klaus-Robert Müller , Wojciech Samek

Visualization of association graphs for assisting the interpretation of classifications

Given a query on the PASCAL database maintained by the INIST, we design user interfaces to visualize and browse two types of graphs extracted from abstracts: 1) the graph of all associations between authors (co-author graph), 2) the graph…

Applications · Statistics 2008-11-06 Eric San Juan , Ivana Roche

Identifying user habits through data mining on call data records

In this paper we propose a framework for identifying patterns and regularities in the pseudo-anonymized Call Data Records (CDR) pertaining a generic subscriber of a mobile operator. We face the challenging task of automatically deriving…

Data Structures and Algorithms · Computer Science 2017-11-23 Filippo Maria Bianchi , Antonello Rizzi , Alireza Sadeghian , Corrado Moiso

Accessing accurate documents by mining auxiliary document information

Earlier techniques of text mining included algorithms like k-means, Naive Bayes, SVM which classify and cluster the text document for mining relevant information about the documents. The need for improving the mining techniques has us…

Information Retrieval · Computer Science 2016-05-10 Jinju Joby , Jyothi Korra

Proposition-Level Clustering for Multi-Document Summarization

Text clustering methods were traditionally incorporated into multi-document summarization (MDS) as a means for coping with considerable information repetition. Particularly, clusters were leveraged to indicate information saliency as well…

Computation and Language · Computer Science 2022-05-23 Ori Ernst , Avi Caciularu , Ori Shapira , Ramakanth Pasunuru , Mohit Bansal , Jacob Goldberger , Ido Dagan

Document Clustering using K-Means and K-Medoids

With the huge upsurge of information in day-to-days life, it has become difficult to assemble relevant information in nick of time. But people, always are in dearth of time, they need everything quick. Hence clustering was introduced to…

Information Retrieval · Computer Science 2015-03-02 Rakesh Chandra Balabantaray , Chandrali Sarma , Monica Jha

Query Clustering using Segment Specific Context Embeddings

This paper presents a novel query clustering approach to capture the broad interest areas of users querying search engines. We make use of recent advances in NLP - word2vec and extend it to get query2vec, vector representations of queries,…

Information Retrieval · Computer Science 2016-11-08 S. K Kolluru , Prasenjit Mukherjee