English

Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages

Computation and Language 2023-10-25 v1 Formal Languages and Automata Theory

Abstract

The class of tree-adjoining languages can be characterized by various two-level formalisms, consisting of a context-free grammar (CFG) or pushdown automaton (PDA) controlling another CFG or PDA. These four formalisms are equivalent to tree-adjoining grammars (TAG), linear indexed grammars (LIG), pushdown-adjoining automata (PAA), and embedded pushdown automata (EPDA). We define semiring-weighted versions of the above two-level formalisms, and we design new algorithms for computing their stringsums (the weight of all derivations of a string) and allsums (the weight of all derivations). From these, we also immediately obtain stringsum and allsum algorithms for TAG, LIG, PAA, and EPDA. For LIG, our algorithm is more time-efficient by a factor of O(nN)\mathcal{O}(n|\mathcal{N}|) (where nn is the string length and N|\mathcal{N}| is the size of the nonterminal set) and more space-efficient by a factor of O(Γ)\mathcal{O}(|\Gamma|) (where Γ|\Gamma| is the size of the stack alphabet) than the algorithm of Vijay-Shanker and Weir (1989). For EPDA, our algorithm is both more space-efficient and time-efficient than the algorithm of Alonso et al. (2001) by factors of O(Γ2)\mathcal{O}(|\Gamma|^2) and O(Γ3)\mathcal{O}(|\Gamma|^3), respectively. Finally, we give the first PAA stringsum and allsum algorithms.

Keywords

Cite

@article{arxiv.2310.15276,
  title  = {Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages},
  author = {Alexandra Butoi and Tim Vieira and Ryan Cotterell and David Chiang},
  journal= {arXiv preprint arXiv:2310.15276},
  year   = {2023}
}

Comments

23 pages, 9 figures. Accepted at EMNLP 2023