Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models

Siddharth Dalmia; Abdelrahman Mohamed; Mike Lewis; Florian Metze; Luke Zettlemoyer

Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models

Computation and Language 2019-11-12 v1 Machine Learning

Authors: Siddharth Dalmia , Abdelrahman Mohamed , Mike Lewis , Florian Metze , Luke Zettlemoyer

Abstract

Inspired by modular software design principles of independence, interchangeability, and clarity of interface, we introduce a method for enforcing encoder-decoder modularity in seq2seq models without sacrificing the overall model quality or its full differentiability. We discretize the encoder output units into a predefined interpretable vocabulary space using the Connectionist Temporal Classification (CTC) loss. Our modular systems achieve near SOTA performance on the 300h Switchboard benchmark, with WER of 8.3% and 17.6% on the SWB and CH subsets, using seq2seq models with encoder and decoder modules which are independent and interchangeable.

Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models

Abstract

Cite

Related papers