English

Neural String Edit Distance

Computation and Language 2022-04-28 v2 Machine Learning

Abstract

We propose the neural string edit distance model for string-pair matching and string transduction based on learnable string edit distance. We modify the original expectation-maximization learned edit distance algorithm into a differentiable loss function, allowing us to integrate it into a neural network providing a contextual representation of the input. We evaluate on cognate detection, transliteration, and grapheme-to-phoneme conversion, and show that we can trade off between performance and interpretability in a single framework. Using contextual representations, which are difficult to interpret, we match the performance of state-of-the-art string-pair matching models. Using static embeddings and a slightly different loss function, we force interpretability, at the expense of an accuracy drop.

Keywords

Cite

@article{arxiv.2104.08388,
  title  = {Neural String Edit Distance},
  author = {Jindřich Libovický and Alexander Fraser},
  journal= {arXiv preprint arXiv:2104.08388},
  year   = {2022}
}

Comments

14 pages, 5 figures; Workshop on Structured Prediction for NLP @ACL 2022, camera-ready