English
Related papers

Related papers: PyNeuralFx: A Python Package for Neural Audio Effe…

200 papers

We introduce PyPhonPlan, a Python toolkit for implementing dynamical models of phonetic planning using coupled dynamic neural fields and task dynamic simulations. The toolkit provides modular components for defining planning, perception and…

Computation and Language · Computer Science 2026-03-18 Sam Kirkham

Neuron analysis provides insights into how knowledge is structured in representations and discovers the role of neurons in the network. In addition to developing an understanding of our models, neuron analysis enables various applications…

Computation and Language · Computer Science 2023-05-29 Fahim Dalvi , Hassan Sajjad , Nadir Durrani

We introduce pyannote.audio, an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it provides a set of trainable end-to-end neural building blocks that can be combined and jointly…

Audio and Speech Processing · Electrical Eng. & Systems 2019-11-05 Hervé Bredin , Ruiqing Yin , Juan Manuel Coria , Gregory Gelly , Pavel Korshunov , Marvin Lavechin , Diego Fustes , Hadrien Titeux , Wassim Bouaziz , Marie-Philippe Gill

psifx is a plug-and-play multi-modal feature extraction toolkit, aiming to facilitate and democratize the use of state-of-the-art machine learning techniques for human sciences research. It is motivated by a need (a) to automate and…

Computation and Language · Computer Science 2026-05-06 Guillaume Rochette , Mathieu Rochat , Nizar Michaud , Matthew J. Vowels

Recently, the computational neuroscience community has pushed for more transparent and reproducible methods across the field. In the interest of unifying the domain of auditory neuroscience, naplib-python provides an intuitive and general…

Neurons and Cognition · Quantitative Biology 2023-09-20 Gavin Mischler , Vinay Raghavan , Menoua Keshishian , Nima Mesgarani

We present NablAFx, an open-source framework developed to support research in differentiable black-box and gray-box modeling of audio effects. Built in PyTorch, NablAFx offers a versatile ecosystem to configure, train, evaluate, and compare…

Sound · Computer Science 2025-02-26 Marco Comunità , Christian J. Steinmetz , Joshua D. Reiss

Studying facial expressions is a notoriously difficult endeavor. Recent advances in the field of affective computing have yielded impressive progress in automatically detecting facial expressions from pictures and videos. However, much of…

Computer Vision and Pattern Recognition · Computer Science 2023-03-09 Jin Hyun Cheong , Eshin Jolly , Tiankang Xie , Sophie Byrne , Matthew Kenney , Luke J. Chang

We present pyroomacoustics, a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components: an intuitive Python object-oriented…

Sound · Computer Science 2019-05-08 Robin Scheibler , Eric Bezzam , Ivan Dokmanić

Speech audio in the wild is often processed by post-production effects, but existing speech datasets rarely provide precise annotations of effects and parameters, limiting systematic study. We introduce VoxEffects, a speech audio effects…

Audio and Speech Processing · Electrical Eng. & Systems 2026-04-15 Zhe Zhang , Yigitcan Özer , Junichi Yamagishi

Speech deepfake detection is a well-established research field with different models, datasets, and training strategies. However, the lack of standardized implementations and evaluation protocols limits reproducibility, benchmarking, and…

As neuroimaging databases grow in size and complexity, the time researchers spend investigating and managing the data increases to the expense of data analysis. As a result, investigators rely more and more heavily on scripting using…

Artificial intelligence (AI) is increasingly central to understanding how the brain processes information. However, the integration of neuroscience and modern AI is bottlenecked by a fragmented software ecosystem. Current tools are siloed…

NeurST is an open-source toolkit for neural speech translation. The toolkit mainly focuses on end-to-end speech translation, which is easy to use, modify, and extend to advanced speech translation research and products. NeurST aims at…

Computation and Language · Computer Science 2021-06-16 Chengqi Zhao , Mingxuan Wang , Qianqian Dong , Rong Ye , Lei Li

pyAMPACT (Python-based Automatic Music Performance Analysis and Comparison Toolkit) links symbolic and audio music representations to facilitate score-informed estimation of performance data in audio as well as general linking of symbolic…

Sound · Computer Science 2026-01-06 Johanna Devaney , Daniel McKemie , Alex Morgan

The open-source PyNX toolkit [Favre-Nicolin et al (2011) arXiv:1010.2641, Mandula et al (2016)] has been extended to provide tools for coherent X-ray imaging data analysis and simulation. All calculations can be executed on graphical…

Deep Audio Analyzer is an open source speech framework that aims to simplify the research and the development process of neural speech processing pipelines, allowing users to conceive, compare and share results in a fast and reproducible…

Sound · Computer Science 2023-10-31 Valerio Francesco Puglisi , Oliver Giudice , Sebastiano Battiato

auDeep is a Python toolkit for deep unsupervised representation learning from acoustic data. It is based on a recurrent sequence to sequence autoencoder approach which can learn representations of time series data by taking into account…

Audio effects are extensively used at every stage of audio and music content creation. The majority of differentiable audio effects modeling approaches fall into the black-box or gray-box paradigms; and most models have been proposed and…

Sound · Computer Science 2025-02-21 Marco Comunità , Christian J. Steinmetz , Joshua D. Reiss

SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the research and development of neural speech processing technologies by being simple, flexible, user-friendly, and well-documented. This paper…

1. Natural sounds have been recorded for millions of hours over the previous decades using passive acoustic monitoring. Improvements in deep learning models have vastly accelerated the analysis of large portions of this data. While new…

Machine Learning · Computer Science 2026-04-14 Vincent S. Kather , Sylvain Haupert , Burooj Ghani , Dan Stowell
‹ Prev 1 2 3 10 Next ›