Topological Deep Learning for Speech Data
Machine Learning
2025-05-28 v1 Computer Vision and Pattern Recognition
Sound
Audio and Speech Processing
Abstract
Topological data analysis (TDA) offers novel mathematical tools for deep learning. Inspired by Carlsson et al., this study designs topology-aware convolutional kernels that significantly improve speech recognition networks. Theoretically, by investigating orthogonal group actions on kernels, we establish a fiber-bundle decomposition of matrix spaces, enabling new filter generation methods. Practically, our proposed Orthogonal Feature (OF) layer achieves superior performance in phoneme recognition, particularly in low-noise scenarios, while demonstrating cross-domain adaptability. This work reveals TDA's potential in neural network optimization, opening new avenues for mathematics-deep learning interdisciplinary studies.
Cite
@article{arxiv.2505.21173,
title = {Topological Deep Learning for Speech Data},
author = {Zhiwang Yu},
journal= {arXiv preprint arXiv:2505.21173},
year = {2025}
}
Comments
21 pages, 15 figures