English

CLAIRLIB Documentation v1.03

Information Retrieval 2007-12-21 v1 Computation and Language

Abstract

The Clair library is intended to simplify a number of generic tasks in Natural Language Processing (NLP), Information Retrieval (IR), and Network Analysis. Its architecture also allows for external software to be plugged in with very little effort. Functionality native to Clairlib includes Tokenization, Summarization, LexRank, Biased LexRank, Document Clustering, Document Indexing, PageRank, Biased PageRank, Web Graph Analysis, Network Generation, Power Law Distribution Analysis, Network Analysis (clustering coefficient, degree distribution plotting, average shortest path, diameter, triangles, shortest path matrices, connected components), Cosine Similarity, Random Walks on Graphs, Statistics (distributions, tests), Tf, Idf, Community Finding.

Keywords

Cite

@article{arxiv.0712.3298,
  title  = {CLAIRLIB Documentation v1.03},
  author = {Dragomir Radev and Mark Hodges and Anthony Fader and Mark Joseph and Joshua Gerrish and Mark Schaller and Jonathan dePeri and Bryan Gibson},
  journal= {arXiv preprint arXiv:0712.3298},
  year   = {2007}
}

Comments

for download and additional information, please see http://www.clairlib.org

R2 v1 2026-06-21T09:55:58.962Z