English

Typesafe Modeling in Text Mining

Programming Languages 2011-08-02 v1 Information Retrieval

Abstract

Based on the concept of annotation-based agents, this report introduces tools and a formal notation for defining and running text mining experiments using a statically typed domain-specific language embedded in Scala. Using machine learning for classification as an example, the framework is used to develop and document text mining experiments, and to show how the concept of generic, typesafe annotation corresponds to a general information model that goes beyond text processing.

Keywords

Cite

@article{arxiv.1108.0363,
  title  = {Typesafe Modeling in Text Mining},
  author = {Fabian Steeg},
  journal= {arXiv preprint arXiv:1108.0363},
  year   = {2011}
}

Comments

63 pages, in German

R2 v1 2026-06-21T18:44:53.935Z