English

Learning Languages with Decidable Hypotheses

Logic in Computer Science 2020-11-20 v1 Computation and Language Formal Languages and Automata Theory Machine Learning

Abstract

In language learning in the limit, the most common type of hypothesis is to give an enumerator for a language. This so-called WW-index allows for naming arbitrary computably enumerable languages, with the drawback that even the membership problem is undecidable. In this paper we use a different system which allows for naming arbitrary decidable languages, namely programs for characteristic functions (called CC-indices). These indices have the drawback that it is now not decidable whether a given hypothesis is even a legal CC-index. In this first analysis of learning with CC-indices, we give a structured account of the learning power of various restrictions employing CC-indices, also when compared with WW-indices. We establish a hierarchy of learning power depending on whether CC-indices are required (a) on all outputs; (b) only on outputs relevant for the class to be learned and (c) only in the limit as final, correct hypotheses. Furthermore, all these settings are weaker than learning with WW-indices (even when restricted to classes of computable languages). We analyze all these questions also in relation to the mode of data presentation. Finally, we also ask about the relation of semantic versus syntactic convergence and derive the map of pairwise relations for these two kinds of convergence coupled with various forms of data presentation.

Keywords

Cite

@article{arxiv.2011.09866,
  title  = {Learning Languages with Decidable Hypotheses},
  author = {Julian Berger and Maximilian Böther and Vanja Doskoč and Jonathan Gadea Harder and Nicolas Klodt and Timo Kötzing and Winfried Lötzsch and Jannik Peters and Leon Schiller and Lars Seifert and Armin Wells and Simon Wietheger},
  journal= {arXiv preprint arXiv:2011.09866},
  year   = {2020}
}