English

Logic Mill -- A Knowledge Navigation System

Computation and Language 2024-10-14 v2

Abstract

Logic Mill is a scalable and openly accessible software system that identifies semantically similar documents within either one domain-specific corpus or multi-domain corpora. It uses advanced Natural Language Processing (NLP) techniques to generate numerical representations of documents. Currently it leverages a large pre-trained language model to generate these document representations. The system focuses on scientific publications and patent documents and contains more than 200 million documents. It is easily accessible via a simple Application Programming Interface (API) or via a web interface. Moreover, it is continuously being updated and can be extended to text corpora from other domains. We see this system as a general-purpose tool for future research applications in the social sciences and other domains.

Keywords

Cite

@article{arxiv.2301.00200,
  title  = {Logic Mill -- A Knowledge Navigation System},
  author = {Sebastian Erhardt and Mainak Ghosh and Erik Buunk and Michael E. Rose and Dietmar Harhoff},
  journal= {arXiv preprint arXiv:2301.00200},
  year   = {2024}
}

Comments

10 pages, 3 figures, 1 table

R2 v1 2026-06-28T07:58:12.716Z