Related papers: Constructing Composite Features for Interpretable …

Perceptual Musical Features for Interpretable Audio Tagging

In the age of music streaming platforms, the task of automatically tagging music audio has garnered significant attention, driving researchers to devise methods aimed at enhancing performance metrics on standard datasets. Most recent…

Sound · Computer Science 2024-02-26 Vassilis Lyberatos , Spyridon Kantarelis , Edmund Dervakos , Giorgos Stamou

ICGAN: An implicit conditioning method for interpretable feature control of neural audio synthesis

Neural audio synthesis methods can achieve high-fidelity and realistic sound generation by utilizing deep generative models. Such models typically rely on external labels which are often discrete as conditioning information to achieve…

Sound · Computer Science 2024-06-12 Yunyi Liu , Craig Jin

Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics

A good feature representation is a determinant factor to achieve high performance for many machine learning algorithms in terms of classification. This is especially true for techniques that do not build complex internal representations of…

Neural and Evolutionary Computing · Computer Science 2019-08-22 Noëlie Cherrier , Jean-Philippe Poli , Maxime Defurne , Franck Sabatié

On Explaining Machine Learning Models by Evolving Crucial and Compact Features

Feature construction can substantially improve the accuracy of Machine Learning (ML) algorithms. Genetic Programming (GP) has been proven to be effective at this task by evolving non-linear combinations of input features. GP additionally…

Neural and Evolutionary Computing · Computer Science 2020-01-13 Marco Virgolin , Tanja Alderliesten , Peter A. N. Bosman

Extended pipeline for content-based feature engineering in music genre recognition

We present a feature engineering pipeline for the construction of musical signal characteristics, to be used for the design of a supervised model for musical genre identification. The key idea is to extend the traditional two-step process…

Sound · Computer Science 2021-04-08 Tina Raissi , Alessandro Tibo , Paolo Bientinesi

Benchmarking and Improving Compositional Generalization of Multi-aspect Controllable Text Generation

Compositional generalization, representing the model's ability to generate text with new attribute combinations obtained by recombining single attributes from the training data, is a crucial property for multi-aspect controllable text…

Computation and Language · Computer Science 2024-06-04 Tianqi Zhong , Zhaoyi Li , Quan Wang , Linqi Song , Ying Wei , Defu Lian , Zhendong Mao

Semantic-Aware Interpretable Multimodal Music Auto-Tagging

Music auto-tagging is essential for organizing and discovering music in extensive digital libraries. While foundation models achieve exceptional performance in this domain, their outputs often lack interpretability, limiting trust and…

Machine Learning · Computer Science 2026-05-28 Andreas Patakis , Vassilis Lyberatos , Spyridon Kantarelis , Edmund Dervakos , Giorgos Stamou

Compositional Program Generation for Few-Shot Systematic Generalization

Compositional generalization is a key ability of humans that enables us to learn new concepts from only a handful examples. Neural machine learning models, including the now ubiquitous Transformers, struggle to generalize in this way, and…

Machine Learning · Computer Science 2024-01-19 Tim Klinger , Luke Liu , Soham Dan , Maxwell Crouse , Parikshit Ram , Alexander Gray

When Audio Generators Become Good Listeners: Generative Features for Understanding Tasks

This work pioneers the utilization of generative features in enhancing audio understanding. Unlike conventional discriminative features that directly optimize posterior and thus emphasize semantic abstraction while losing fine grained…

Sound · Computer Science 2025-09-30 Zeyu Xie , Chenxing Li , Xuenan Xu , Mengyue Wu , Wenfu Wang , Ruibo Fu , Meng Yu , Dong Yu , Yuexian Zou

Genetic Programming for Evolving a Front of Interpretable Models for Data Visualisation

Data visualisation is a key tool in data mining for understanding big datasets. Many visualisation methods have been proposed, including the well-regarded state-of-the-art method t-Distributed Stochastic Neighbour Embedding. However, the…

Neural and Evolutionary Computing · Computer Science 2020-01-29 Andrew Lensen , Bing Xue , Mengjie Zhang

Genetic Programming is Naturally Suited to Evolve Bagging Ensembles

Learning ensembles by bagging can substantially improve the generalization performance of low-bias, high-variance estimators, including those evolved by Genetic Programming (GP). To be efficient, modern GP algorithms for evolving (bagging)…

Neural and Evolutionary Computing · Computer Science 2021-02-08 Marco Virgolin

Generative-based Fusion Mechanism for Multi-Modal Tracking

Generative models (GMs) have received increasing research interest for their remarkable capacity to achieve comprehensive understanding. However, their potential application in the domain of multi-modal tracking has remained relatively…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Zhangyong Tang , Tianyang Xu , Xuefeng Zhu , Xiao-Jun Wu , Josef Kittler

Towards Deep Representation Learning with Genetic Programming

Genetic Programming (GP) is an evolutionary algorithm commonly used for machine learning tasks. In this paper we present a method that allows GP to transform the representation of a large-scale machine learning dataset into a more compact…

Neural and Evolutionary Computing · Computer Science 2018-02-21 Lino Rodriguez-Coayahuitl , Alicia Morales-Reyes , Hugo Jair Escalante

Feature-informed Embedding Space Regularization For Audio Classification

Feature representations derived from models pre-trained on large-scale datasets have shown their generalizability on a variety of audio analysis tasks. Despite this generalizability, however, task-specific features can outperform if…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-13 Yun-Ning Hung , Alexander Lerch

DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks

Synthetic creation of drum sounds (e.g., in drum machines) is commonly performed using analog or digital synthesis, allowing a musician to sculpt the desired timbre modifying various parameters. Typically, such parameters control low-level…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-29 J. Nistal , S. Lattner , G. Richard

Improving Compositional Generalization Using Iterated Learning and Simplicial Embeddings

Compositional generalization, the ability of an agent to generalize to unseen combinations of latent factors, is easy for humans but hard for deep neural networks. A line of research in cognitive science has hypothesized a process,…

Machine Learning · Computer Science 2023-10-31 Yi Ren , Samuel Lavoie , Mikhail Galkin , Danica J. Sutherland , Aaron Courville

A Controllable Perceptual Feature Generative Model for Melody Harmonization via Conditional Variational Autoencoder

While Large Language Models (LLMs) make symbolic music generation increasingly accessible, producing music with distinctive composition and rich expressiveness remains a significant challenge. Many studies have introduced emotion models to…

Sound · Computer Science 2025-11-19 Dengyun Huang , Yonghua Zhu

Toward the Identifiability of Comparative Deep Generative Models

Deep Generative Models (DGMs) are versatile tools for learning data representations while adequately incorporating domain knowledge such as the specification of conditional probability distributions. Recently proposed DGMs tackle the…

Machine Learning · Computer Science 2024-01-30 Romain Lopez , Jan-Christian Huetter , Ehsan Hajiramezanali , Jonathan Pritchard , Aviv Regev

Learning Compositional Visual Concepts with Mutual Consistency

Compositionality of semantic concepts in image synthesis and analysis is appealing as it can help in decomposing known and generatively recomposing unknown data. For instance, we may learn concepts of changing illumination, geometry or…

Computer Vision and Pattern Recognition · Computer Science 2018-03-29 Yunye Gong , Srikrishna Karanam , Ziyan Wu , Kuan-Chuan Peng , Jan Ernst , Peter C. Doerschuk

Fantastic Features and Where to Find Them: A Probing Method to combine Features from Multiple Foundation Models

Foundation models (FMs) trained with different objectives and data learn diverse representations, making some more effective than others for specific downstream tasks. Existing adaptation strategies, such as parameter-efficient fine-tuning,…

Machine Learning · Computer Science 2025-12-02 Benjamin Ramtoula , Pierre-Yves Lajoie , Paul Newman , Daniele De Martini