English

Topological Deep Learning for Speech Data

Machine Learning 2025-05-28 v1 Computer Vision and Pattern Recognition Sound Audio and Speech Processing

Abstract

Topological data analysis (TDA) offers novel mathematical tools for deep learning. Inspired by Carlsson et al., this study designs topology-aware convolutional kernels that significantly improve speech recognition networks. Theoretically, by investigating orthogonal group actions on kernels, we establish a fiber-bundle decomposition of matrix spaces, enabling new filter generation methods. Practically, our proposed Orthogonal Feature (OF) layer achieves superior performance in phoneme recognition, particularly in low-noise scenarios, while demonstrating cross-domain adaptability. This work reveals TDA's potential in neural network optimization, opening new avenues for mathematics-deep learning interdisciplinary studies.

Keywords

Cite

@article{arxiv.2505.21173,
  title  = {Topological Deep Learning for Speech Data},
  author = {Zhiwang Yu},
  journal= {arXiv preprint arXiv:2505.21173},
  year   = {2025}
}

Comments

21 pages, 15 figures