Related papers: Vector Quantized Contrastive Predictive Coding for…

Learning source-aware representations of music in a discrete latent space

In recent years, neural network based methods have been proposed as a method that cangenerate representations from music, but they are not human readable and hardly analyzable oreditable by a human. To address this issue, we propose a novel…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-29 Jinsung Kim , Yeong-Seok Jeong , Woosung Choi , Jaehwa Chung , Soonyoung Jung

Vector-Quantized Prompt Learning for Paraphrase Generation

Deep generative modeling of natural languages has achieved many successes, such as producing fluent sentences and translating from one language into another. However, the development of generative modeling techniques for paraphrase…

Computation and Language · Computer Science 2023-11-28 Haotian Luo , Yixin Liu , Peidong Liu , Xianggen Liu

Rethinking Recurrent Latent Variable Model for Music Composition

We present a model for capturing musical features and creating novel sequences of music, called the Convolutional Variational Recurrent Neural Network. To generate sequential data, the model uses an encoder-decoder architecture with latent…

Sound · Computer Science 2018-10-09 Eunjeong Stella Koh , Shlomo Dubnov , Dustin Wright

Improved Variational Inference in Discrete VAEs using Error Correcting Codes

Despite advances in deep probabilistic models, learning discrete latent representations remains challenging. This work introduces a novel method to improve inference in discrete Variational Autoencoders by reframing the inference problem…

Machine Learning · Computer Science 2025-06-11 María Martínez-García , Grace Villacrés , David Mitchell , Pablo M. Olmos

Variational Template Machine for Data-to-Text Generation

How to generate descriptions from structured data organized in tables? Existing approaches using neural encoder-decoder models often suffer from lacking diversity. We claim that an open set of templates is crucial for enriching the phrase…

Computation and Language · Computer Science 2020-02-14 Rong Ye , Wenxian Shi , Hao Zhou , Zhongyu Wei , Lei Li

Sampling Variations of Lead Sheets

Machine-learning techniques have been recently used with spectacular results to generate artefacts such as music or text. However, these techniques are still unable to capture and generate artefacts that are convincingly structured. In this…

Artificial Intelligence · Computer Science 2017-03-03 Pierre Roy , Alexandre Papadopoulos , François Pachet

Vector Quantized Wasserstein Auto-Encoder

Learning deep discrete latent presentations offers a promise of better symbolic and summarized abstractions that are more useful to subsequent downstream tasks. Inspired by the seminal Vector Quantized Variational Auto-Encoder (VQ-VAE),…

Machine Learning · Computer Science 2023-06-21 Tung-Long Vuong , Trung Le , He Zhao , Chuanxia Zheng , Mehrtash Harandi , Jianfei Cai , Dinh Phung

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is widely employed in tokenizing data representations for large language models, diffusion models, and…

Machine Learning · Computer Science 2026-03-19 Wenhao Zhao , Qiran Zou , Rushi Shah , Yudi Wu , Zhouhan Lin , Dianbo Liu

Descriminative-Generative Custom Tokens for Vision-Language Models

This paper explores the possibility of learning custom tokens for representing new concepts in Vision-Language Models (VLMs). Our aim is to learn tokens that can be effective for both discriminative and generative tasks while composing well…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Pramuditha Perera , Matthew Trager , Luca Zancato , Alessandro Achille , Stefano Soatto

Robust Training of Vector Quantized Bottleneck Models

In this paper we demonstrate methods for reliable and efficient training of discrete representation using Vector-Quantized Variational Auto-Encoder models (VQ-VAEs). Discrete latent variable models have been shown to learn nontrivial…

Machine Learning · Computer Science 2024-09-13 Adrian Łańcucki , Jan Chorowski , Guillaume Sanchez , Ricard Marxer , Nanxin Chen , Hans J. G. A. Dolfing , Sameer Khurana , Tanel Alumäe , Antoine Laurent

Grammar Variational Autoencoder

Deep generative models have been wildly successful at learning coherent latent representations for continuous data such as video and audio. However, generative modeling of discrete data such as arithmetic expressions and molecular…

Machine Learning · Statistics 2017-03-07 Matt J. Kusner , Brooks Paige , José Miguel Hernández-Lobato

Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs

To apply neural sequence models such as the Transformers to music generation tasks, one has to represent a piece of music by a sequence of tokens drawn from a finite set of pre-defined vocabulary. Such a vocabulary usually involves tokens…

Sound · Computer Science 2021-01-08 Wen-Yi Hsiao , Jen-Yu Liu , Yin-Cheng Yeh , Yi-Hsuan Yang

Semi-supervised Disentanglement with Independent Vector Variational Autoencoders

We aim to separate the generative factors of data into two latent vectors in a variational autoencoder. One vector captures class factors relevant to target classification tasks, while the other vector captures style factors relevant to the…

Machine Learning · Computer Science 2020-03-17 Bo-Kyeong Kim , Sungjin Park , Geonmin Kim , Soo-Young Lee

A Classifying Variational Autoencoder with Application to Polyphonic Music Generation

The variational autoencoder (VAE) is a popular probabilistic generative model. However, one shortcoming of VAEs is that the latent variables cannot be discrete, which makes it difficult to generate data from different modes of a…

Machine Learning · Statistics 2017-11-21 Jay A. Hennig , Akash Umakantha , Ryan C. Williamson

Variational Quantum Latent Encoding for Topology Optimization

A variational framework for structural topology optimization is developed, integrating quantum and classical latent encoding strategies within a coordinate-based neural decoding architecture. In this approach, a low-dimensional latent…

Computational Engineering, Finance, and Science · Computer Science 2025-06-24 Alireza Tabarraei

Trajectory Forecasting through Low-Rank Adaptation of Discrete Latent Codes

Trajectory forecasting is crucial for video surveillance analytics, as it enables the anticipation of future movements for a set of agents, e.g. basketball players engaged in intricate interactions with long-term intentions. Deep generative…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Riccardo Benaglia , Angelo Porrello , Pietro Buzzega , Simone Calderara , Rita Cucchiara

Object-level Visual Prompts for Compositional Image Generation

We introduce a method for composing object-level visual prompts within a text-to-image diffusion model. Our approach addresses the task of generating semantically coherent compositions across diverse scenes and styles, similar to the…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Gaurav Parmar , Or Patashnik , Kuan-Chieh Wang , Daniil Ostashev , Srinivasa Narasimhan , Jun-Yan Zhu , Daniel Cohen-Or , Kfir Aberman

Neural Discrete Representation Learning

Learning useful representations without supervision remains a key challenge in machine learning. In this paper, we propose a simple yet powerful generative model that learns such discrete representations. Our model, the Vector…

Machine Learning · Computer Science 2018-05-31 Aaron van den Oord , Oriol Vinyals , Koray Kavukcuoglu

Unsupervised Controllable Generation with Self-Training

Recent generative adversarial networks (GANs) are able to generate impressive photo-realistic images. However, controllable generation with GANs remains a challenging research problem. Achieving controllable generation requires semantically…

Machine Learning · Computer Science 2021-05-04 Grigorios G Chrysos , Jean Kossaifi , Zhiding Yu , Anima Anandkumar

Training-Free Vector Quantization via Gaussian VAEs

Vector-quantized variational autoencoders (VQ-VAEs) are discrete autoencoders that compress images into discrete tokens. However, they are difficult to train due to discretization. In this paper, we propose a simple yet effective technique…

Machine Learning · Computer Science 2026-05-27 Tongda Xu , Wendi Zheng , Jiajun He , Jose Miguel Hernandez-Lobato , Yan Wang , Ya-Qin Zhang , Jie Tang