English

Equation Embeddings

Machine Learning 2018-03-28 v1 Computation and Language Machine Learning

Abstract

We present an unsupervised approach for discovering semantic representations of mathematical equations. Equations are challenging to analyze because each is unique, or nearly unique. Our method, which we call equation embeddings, finds good representations of equations by using the representations of their surrounding words. We used equation embeddings to analyze four collections of scientific articles from the arXiv, covering four computer science domains (NLP, IR, AI, and ML) and \sim98.5k equations. Quantitatively, we found that equation embeddings provide better models when compared to existing word embedding approaches. Qualitatively, we found that equation embeddings provide coherent semantic representations of equations and can capture semantic similarity to other equations and to words.

Keywords

Cite

@article{arxiv.1803.09123,
  title  = {Equation Embeddings},
  author = {Kriste Krstovski and David M. Blei},
  journal= {arXiv preprint arXiv:1803.09123},
  year   = {2018}
}

Comments

12 pages, 2 figures