English

Subject Specific Stream Classification Preprocessing Algorithm for Twitter Data Stream

Computation and Language 2017-05-30 v1

Abstract

Micro-blogging service Twitter is a lucrative source for data mining applications on global sentiment. But due to the omnifariousness of the subjects mentioned in each data item; it is inefficient to run a data mining algorithm on the raw data. This paper discusses an algorithm to accurately classify the entire stream in to a given number of mutually exclusive collectively exhaustive streams upon each of which the data mining algorithm can be run separately yielding more relevant results with a high efficiency.

Keywords

Cite

@article{arxiv.1705.09995,
  title  = {Subject Specific Stream Classification Preprocessing Algorithm for Twitter Data Stream},
  author = {Nisansa de Silva and Danaja Maldeniya and Chamilka Wijeratne},
  journal= {arXiv preprint arXiv:1705.09995},
  year   = {2017}
}

Comments

6 pages