Related papers: PyNeuralFx: A Python Package for Neural Audio Effe…

PyPhonPlan: Simulating phonetic planning with dynamic neural fields and task dynamics

We introduce PyPhonPlan, a Python toolkit for implementing dynamical models of phonetic planning using coupled dynamic neural fields and task dynamic simulations. The toolkit provides modular components for defining planning, perception and…

Computation and Language · Computer Science 2026-03-18 Sam Kirkham

NeuroX Library for Neuron Analysis of Deep NLP Models

Neuron analysis provides insights into how knowledge is structured in representations and discovers the role of neurons in the network. In addition to developing an understanding of our models, neuron analysis enables various applications…

Computation and Language · Computer Science 2023-05-29 Fahim Dalvi , Hassan Sajjad , Nadir Durrani

pyannote.audio: neural building blocks for speaker diarization

We introduce pyannote.audio, an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it provides a set of trainable end-to-end neural building blocks that can be combined and jointly…

Audio and Speech Processing · Electrical Eng. & Systems 2019-11-05 Hervé Bredin , Ruiqing Yin , Juan Manuel Coria , Gregory Gelly , Pavel Korshunov , Marvin Lavechin , Diego Fustes , Hadrien Titeux , Wassim Bouaziz , Marie-Philippe Gill

psifx -- Psychological and Social Interactions Feature Extraction Package

psifx is a plug-and-play multi-modal feature extraction toolkit, aiming to facilitate and democratize the use of state-of-the-art machine learning techniques for human sciences research. It is motivated by a need (a) to automate and…

Computation and Language · Computer Science 2026-05-06 Guillaume Rochette , Mathieu Rochat , Nizar Michaud , Matthew J. Vowels

naplib-python: Neural Acoustic Data Processing and Analysis Tools in Python

Recently, the computational neuroscience community has pushed for more transparent and reproducible methods across the field. In the interest of unifying the domain of auditory neuroscience, naplib-python provides an intuitive and general…

Neurons and Cognition · Quantitative Biology 2023-09-20 Gavin Mischler , Vinay Raghavan , Menoua Keshishian , Nima Mesgarani

NablAFx: A Framework for Differentiable Black-box and Gray-box Modeling of Audio Effects

We present NablAFx, an open-source framework developed to support research in differentiable black-box and gray-box modeling of audio effects. Built in PyTorch, NablAFx offers a versatile ecosystem to configure, train, evaluate, and compare…

Sound · Computer Science 2025-02-26 Marco Comunità , Christian J. Steinmetz , Joshua D. Reiss

Py-Feat: Python Facial Expression Analysis Toolbox

Studying facial expressions is a notoriously difficult endeavor. Recent advances in the field of affective computing have yielded impressive progress in automatically detecting facial expressions from pictures and videos. However, much of…

Computer Vision and Pattern Recognition · Computer Science 2023-03-09 Jin Hyun Cheong , Eshin Jolly , Tiankang Xie , Sophie Byrne , Matthew Kenney , Luke J. Chang

Pyroomacoustics: A Python package for audio room simulations and array processing algorithms

We present pyroomacoustics, a software package aimed at the rapid development and testing of audio array processing algorithms. The content of the package can be divided into three main components: an intuitive Python object-oriented…

Sound · Computer Science 2019-05-08 Robin Scheibler , Eric Bezzam , Ivan Dokmanić

VoxEffects: A Speech-Oriented Audio Effects Dataset and Benchmark

Speech audio in the wild is often processed by post-production effects, but existing speech datasets rarely provide precise annotations of effects and parameters, limiting systematic study. We introduce VoxEffects, a speech audio effects…

Audio and Speech Processing · Electrical Eng. & Systems 2026-04-15 Zhe Zhang , Yigitcan Özer , Junichi Yamagishi

DeepFense: A Unified, Modular, and Extensible Framework for Robust Deepfake Audio Detection

Speech deepfake detection is a well-established research field with different models, datasets, and training strategies. However, the lack of standardized implementations and evaluation protocols limits reproducibility, benchmarking, and…

Sound · Computer Science 2026-04-10 Yassine El Kheir , Arnab Das , Yixuan Xiao , Xin Wang , Feidi Kallel , Enes Erdem Erdogan , Ngoc Thang Vu , Tim Polzehl , Sebastian Moeller

PyXNAT: XNAT in Python

As neuroimaging databases grow in size and complexity, the time researchers spend investigating and managing the data increases to the expense of data analysis. As a result, investigators rely more and more heavily on scripting using…

Databases · Computer Science 2013-01-30 Yannick Schwartz , Alexis Barbot , Benjamin Thyreau , Vincent Frouin , Gaël Varoquaux , Aditya Siram , Daniel Marcus , Jean-Baptiste Poline

NeuralSet: A High-Performing Python Package for Neuro-AI

Artificial intelligence (AI) is increasingly central to understanding how the brain processes information. However, the integration of neuroscience and modern AI is bottlenecked by a fragmented software ecosystem. Current tools are siloed…

Neurons and Cognition · Quantitative Biology 2026-05-11 Jean-Rémi King , Corentin Bel , Linnea Evanson , Julien Gadonneix , Sophia Houhamdi , Jarod Lévy , Josephine Raugel , Andrea Santos Revilla , Mingfang Zhang , Julie Bonnaire , Charlotte Caucheteux , Alexandre Défossez , Théo Desbordes , Pablo Diego-Simón , Shubh Khanna , Juliette Millet , Pierre Orhan , Saarang Panchavati , Antoine Ratouchniak , Alexis Thual , Teon L. Brooks , Katelyn Begany , Yohann Benchetrit , Marlène Careil , Hubert Banville , Stéphane d'Ascoli , Simon Dahan , Jérémy Rapin

NeurST: Neural Speech Translation Toolkit

NeurST is an open-source toolkit for neural speech translation. The toolkit mainly focuses on end-to-end speech translation, which is easy to use, modify, and extend to advanced speech translation research and products. NeurST aims at…

Computation and Language · Computer Science 2021-06-16 Chengqi Zhao , Mingxuan Wang , Qianqian Dong , Rong Ye , Lei Li

pyAMPACT: A Score-Audio Alignment Toolkit for Performance Data Estimation and Multi-modal Processing

pyAMPACT (Python-based Automatic Music Performance Analysis and Comparison Toolkit) links symbolic and audio music representations to facilitate score-informed estimation of performance data in audio as well as general linking of symbolic…

Sound · Computer Science 2026-01-06 Johanna Devaney , Daniel McKemie , Alex Morgan

PyNX: high performance computing toolkit for coherent X-ray imaging based on operators

The open-source PyNX toolkit [Favre-Nicolin et al (2011) arXiv:1010.2641, Mandula et al (2016)] has been extended to provide tools for coherent X-ray imaging data analysis and simulation. All calculations can be executed on graphical…

Materials Science · Physics 2020-10-01 Vincent Favre-Nicolin , Gaétan Girard , Steven Leake , Jérôme Carnis , Yuriy Chushkin , Jérôme Kieffer , Pierre Paléo , Marie-Ingrid Richard

Deep Audio Analyzer: a Framework to Industrialize the Research on Audio Forensics

Deep Audio Analyzer is an open source speech framework that aims to simplify the research and the development process of neural speech processing pipelines, allowing users to conceive, compare and share results in a fast and reproducible…

Sound · Computer Science 2023-10-31 Valerio Francesco Puglisi , Oliver Giudice , Sebastiano Battiato

auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks

auDeep is a Python toolkit for deep unsupervised representation learning from acoustic data. It is based on a recurrent sequence to sequence autoencoder approach which can learn representations of time series data by taking into account…

Sound · Computer Science 2017-12-25 Michael Freitag , Shahin Amiriparian , Sergey Pugachevskiy , Nicholas Cummins , Björn Schuller

Differentiable Black-box and Gray-box Modeling of Nonlinear Audio Effects

Audio effects are extensively used at every stage of audio and music content creation. The majority of differentiable audio effects modeling approaches fall into the black-box or gray-box paradigms; and most models have been proposed and…

Sound · Computer Science 2025-02-21 Marco Comunità , Christian J. Steinmetz , Joshua D. Reiss

SpeechBrain: A General-Purpose Speech Toolkit

SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the research and development of neural speech processing technologies by being simple, flexible, user-friendly, and well-documented. This paper…

Audio and Speech Processing · Electrical Eng. & Systems 2021-06-10 Mirco Ravanelli , Titouan Parcollet , Peter Plantinga , Aku Rouhe , Samuele Cornell , Loren Lugosch , Cem Subakan , Nauman Dawalatabad , Abdelwahab Heba , Jianyuan Zhong , Ju-Chieh Chou , Sung-Lin Yeh , Szu-Wei Fu , Chien-Feng Liao , Elena Rastorgueva , François Grondin , William Aris , Hwidong Na , Yan Gao , Renato De Mori , Yoshua Bengio

bacpipe: a Python package to make bioacoustic deep learning models accessible

1. Natural sounds have been recorded for millions of hours over the previous decades using passive acoustic monitoring. Improvements in deep learning models have vastly accelerated the analysis of large portions of this data. While new…

Machine Learning · Computer Science 2026-04-14 Vincent S. Kather , Sylvain Haupert , Burooj Ghani , Dan Stowell