English
Related papers

Related papers: Hyperbolic Distance-Based Speech Separation

200 papers

We introduce a framework for audio source separation using embeddings on a hyperbolic manifold that compactly represent the hierarchical relationship between sound sources and time-frequency features. Inspired by recent successes modeling…

Audio and Speech Processing · Electrical Eng. & Systems 2022-12-12 Darius Petermann , Gordon Wichern , Aswin Subramanian , Jonathan Le Roux

Finding meaningful representations and distances of hierarchical data is important in many fields. This paper presents a new method for hierarchical data embedding and distance. Our method relies on combining diffusion geometry, a central…

Machine Learning · Computer Science 2023-05-31 Ya-Wei Eileen Lin , Ronald R. Coifman , Gal Mishne , Ronen Talmon

We propose the novel task of distance-based sound separation, where sounds are separated based only on their distance from a single microphone. In the context of assisted listening devices, proximity provides a simple criterion for sound…

Sound · Computer Science 2022-07-04 Katharine Patterson , Kevin Wilson , Scott Wisdom , John R. Hershey

Hierarchy is a natural representation of semantic taxonomies, including the ones routinely used in image segmentation. Indeed, recent work on semantic segmentation reports improved accuracy from supervised training leveraging hierarchical…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Simon Weber , Barış Zöngür , Nikita Araslanov , Daniel Cremers

Speaker embedding learning based on Euclidean space has achieved significant progress, but it is still insufficient in modeling hierarchical information within speaker features. Hyperbolic space, with its negative curvature geometric…

Sound · Computer Science 2026-04-29 Zhihua Fang , Liang He

Hyperbolic manifolds for visual representation learning allow for effective learning of semantic class hierarchies by naturally embedding tree-like structures with low distortion within a low-dimensional representation space. The highly…

Computer Vision and Pattern Recognition · Computer Science 2023-05-19 Aiden Durrant , Georgios Leontidis

Open-vocabulary semantic segmentation requires adapting image-level vision-language models such as CLIP to dense pixel-level prediction, which is challenging due to the mismatch between hierarchical structure and semantic alignment in the…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Hoang M. Truong , Hai Nguyen-Truong , Dang Huynh

Learning in hyperbolic spaces has attracted increasing attention due to its superior ability to model hierarchical structures of data. Most existing hyperbolic learning methods use fixed distance measures for all data, assuming a uniform…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Pengxiang Li , Yuwei Wu , Zhi Gao , Xiaomeng Fan , Wei Wu , Zhipeng Lu , Yunde Jia , Mehrtash Harandi

We propose a hyperbolic set-to-set distance measure for computing dissimilarity between sets in hyperbolic space. While point-to-point distances in hyperbolic space effectively capture hierarchical relationships between data points, many…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Pengxiang Li , Wei Wu , Zhi Gao , Xiaomeng Fan , Peilin Yu , Yuwei Wu , Zhipeng Lu , Yunde Jia , Mehrtash Harandi

Classroom environments are particularly challenging for children with hearing impairments, where background noise, multiple talkers, and reverberation degrade speech perception. These difficulties are greater for children than adults, yet…

Hyperbolic spaces have proven to be suitable for modeling data of hierarchical nature. As such we use the Poincare ball to embed sentences with the goal of proving how hyperbolic spaces can be used for solving Textual Entailment. To this…

Computation and Language · Computer Science 2024-06-25 Igor Petrovski

Hyperbolic spaces, which have the capacity to embed tree structures without distortion owing to their exponential volume growth, have recently been applied to machine learning to better capture the hierarchical nature of data. In this…

Machine Learning · Computer Science 2021-03-18 Ryohei Shimizu , Yusuke Mukuta , Tatsuya Harada

Speech separation with several speakers is a challenging task because of the non-stationarity of the speech and the strong signal similarity between interferent sources. Current state-of-the-art solutions can separate well the different…

Signal Processing · Electrical Eng. & Systems 2021-02-09 Nicolas Furnon , Romain Serizel , Irina Illina , Slim Essid

We introduce a number of tools for finding and studying \emph{hierarchically hyperbolic spaces (HHS)}, a rich class of spaces including mapping class groups of surfaces, Teichm\"{u}ller space with either the Teichm\"{u}ller or…

Group Theory · Mathematics 2019-06-05 Jason Behrstock , Mark F. Hagen , Alessandro Sisto

Target speaker extraction aims to isolate a specific speaker's voice from a composite of multiple sound sources, guided by an enrollment utterance or called anchor. Current methods predominantly derive speaker embeddings from the anchor and…

Sound · Computer Science 2024-01-08 Shulin He , Huaiwen Zhang , Wei Rao , Kanghao Zhang , Yukai Ju , Yang Yang , Xueliang Zhang

Speech separation is the task of separating target speech from background interference. Traditionally, speech separation is studied as a signal processing problem. A more recent approach formulates speech separation as a supervised learning…

Computation and Language · Computer Science 2018-06-18 DeLiang Wang , Jitong Chen

Disentangling uncorrelated information in speech utterances is a crucial research topic within speech community. Different speech-related tasks focus on extracting distinct speech representations while minimizing the affects of other…

Computation and Language · Computer Science 2023-09-26 Siqi Zheng , Luyao Cheng , Yafeng Chen , Hui Wang , Qian Chen

The problem of speech separation, also known as the cocktail party problem, refers to the task of isolating a single speech signal from a mixture of speech signals. Previous work on source separation derived an upper bound for the source…

Audio and Speech Processing · Electrical Eng. & Systems 2023-06-27 Shahar Lutati , Eliya Nachmani , Lior Wolf

The field of speech separation, addressing the "cocktail party problem", has seen revolutionary advances with DNNs. Speech separation enhances clarity in complex acoustic environments and serves as crucial pre-processing for speech…

Sound · Computer Science 2025-08-15 Kai Li , Guo Chen , Wendi Sang , Yi Luo , Zhuo Chen , Shuai Wang , Shulin He , Zhong-Qiu Wang , Andong Li , Zhiyong Wu , Xiaolin Hu

This work presents a reformulation of the recently proposed Wasserstein autoencoder framework on a non-Euclidean manifold, the Poincar\'e ball model of the hyperbolic space. By assuming the latent space to be hyperbolic, we can use its…

Machine Learning · Computer Science 2020-03-18 Ivan Ovinnikov
‹ Prev 1 2 3 10 Next ›