Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages
Abstract
The class of tree-adjoining languages can be characterized by various two-level formalisms, consisting of a context-free grammar (CFG) or pushdown automaton (PDA) controlling another CFG or PDA. These four formalisms are equivalent to tree-adjoining grammars (TAG), linear indexed grammars (LIG), pushdown-adjoining automata (PAA), and embedded pushdown automata (EPDA). We define semiring-weighted versions of the above two-level formalisms, and we design new algorithms for computing their stringsums (the weight of all derivations of a string) and allsums (the weight of all derivations). From these, we also immediately obtain stringsum and allsum algorithms for TAG, LIG, PAA, and EPDA. For LIG, our algorithm is more time-efficient by a factor of (where is the string length and is the size of the nonterminal set) and more space-efficient by a factor of (where is the size of the stack alphabet) than the algorithm of Vijay-Shanker and Weir (1989). For EPDA, our algorithm is both more space-efficient and time-efficient than the algorithm of Alonso et al. (2001) by factors of and , respectively. Finally, we give the first PAA stringsum and allsum algorithms.
Keywords
Cite
@article{arxiv.2310.15276,
title = {Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages},
author = {Alexandra Butoi and Tim Vieira and Ryan Cotterell and David Chiang},
journal= {arXiv preprint arXiv:2310.15276},
year = {2023}
}
Comments
23 pages, 9 figures. Accepted at EMNLP 2023