Topological Deep Learning for Speech Data

Zhiwang Yu

Topological Deep Learning for Speech Data

Machine Learning 2025-05-28 v1 Computer Vision and Pattern Recognition Sound Audio and Speech Processing

Authors: Zhiwang Yu

Abstract

Topological data analysis (TDA) offers novel mathematical tools for deep learning. Inspired by Carlsson et al., this study designs topology-aware convolutional kernels that significantly improve speech recognition networks. Theoretically, by investigating orthogonal group actions on kernels, we establish a fiber-bundle decomposition of matrix spaces, enabling new filter generation methods. Practically, our proposed Orthogonal Feature (OF) layer achieves superior performance in phoneme recognition, particularly in low-noise scenarios, while demonstrating cross-domain adaptability. This work reveals TDA's potential in neural network optimization, opening new avenues for mathematics-deep learning interdisciplinary studies.

Topological Deep Learning for Speech Data

Abstract

Keywords

Cite

Comments

Related papers