English

Complex Transformer: A Framework for Modeling Complex-Valued Sequence

Machine Learning 2021-08-10 v2 Sound Audio and Speech Processing Machine Learning

Abstract

While deep learning has received a surge of interest in a variety of fields in recent years, major deep learning models barely use complex numbers. However, speech, signal and audio data are naturally complex-valued after Fourier Transform, and studies have shown a potentially richer representation of complex nets. In this paper, we propose a Complex Transformer, which incorporates the transformer model as a backbone for sequence modeling; we also develop attention and encoder-decoder network operating for complex input. The model achieves state-of-the-art performance on the MusicNet dataset and an In-phase Quadrature (IQ) signal dataset.

Keywords

Cite

@article{arxiv.1910.10202,
  title  = {Complex Transformer: A Framework for Modeling Complex-Valued Sequence},
  author = {Muqiao Yang and Martin Q. Ma and Dongyu Li and Yao-Hung Hubert Tsai and Ruslan Salakhutdinov},
  journal= {arXiv preprint arXiv:1910.10202},
  year   = {2021}
}