Related papers: KERMIT: Generative Insertion-Based Modeling for Se…

Learning the joint distribution of two sequences using little or no paired data

We present a noisy channel generative model of two sequences, for example text and speech, which enables uncovering the association between the two modalities when limited paired data is available. To address the intractability of the exact…

Machine Learning · Computer Science 2022-12-07 Soroosh Mariooryad , Matt Shannon , Siyuan Ma , Tom Bagby , David Kao , Daisy Stanton , Eric Battenberg , RJ Skerry-Ryan

GAMMT: Generative Ambiguity Modeling Using Multiple Transformers

We introduce a novel model called GAMMT (Generative Ambiguity Models using Multiple Transformers) for sequential data that is based on sets of probabilities. Unlike conventional models, our approach acknowledges that the data generation…

Machine Learning · Computer Science 2023-04-05 Xingcheng Xu

A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models

Undirected neural sequence models such as BERT (Devlin et al., 2019) have received renewed interest due to their success on discriminative natural language understanding tasks such as question-answering and natural language inference. The…

Machine Learning · Computer Science 2020-02-10 Elman Mansimov , Alex Wang , Sean Welleck , Kyunghyun Cho

Generative forecasting with joint probability models

Chaotic dynamical systems exhibit strong sensitivity to initial conditions and often contain unresolved multiscale processes, making deterministic forecasting fundamentally limited. Generative models offer an appealing alternative by…

Machine Learning · Computer Science 2026-01-01 Patrick Wyrod , Ashesh Chattopadhyay , Daniele Venturi

GenMed: A Pairwise Generative Reformulation of Medical Diagnostic Tasks

Data-driven medical AI is traditionally formulated as a discriminative mapping from input $X$ to output $Y$ via a learned function $f$, which does not generalize well across heterogeneous data and modalities encountered in real-world…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Hantao Zhang , Weidong Guo , Yuhe Liu , Jiancheng Yang , Sathvik Bhagavan , Danli Shi , Mingda Xu , Pascal Fua

A Universal Marginalizer for Amortized Inference in Generative Models

We consider the problem of inference in a causal generative model where the set of available observations differs between data instances. We show how combining samples drawn from the graphical model with an appropriate masking function…

Machine Learning · Computer Science 2017-11-03 Laura Douglas , Iliyan Zarov , Konstantinos Gourgoulias , Chris Lucas , Chris Hart , Adam Baker , Maneesh Sahani , Yura Perov , Saurabh Johri

CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping

We present a simple method, CropMix, for the purpose of producing a rich input distribution from the original dataset distribution. Unlike single random cropping, which may inadvertently capture only limited information, or irrelevant…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Junlin Han , Lars Petersson , Hongdong Li , Ian Reid

Unsupervised Generative Modeling Using Matrix Product States

Generative modeling, which learns joint probability distribution from data and generates samples according to it, is an important task in machine learning and artificial intelligence. Inspired by probabilistic interpretation of quantum…

Statistical Mechanics · Physics 2018-07-20 Zhao-Yu Han , Jun Wang , Heng Fan , Lei Wang , Pan Zhang

Conditional Inference for Multivariate Generalised Linear Mixed Models

We propose a method for inference in generalised linear mixed models (GLMMs) and several extensions of these models. First, we extend the GLMM by allowing the distribution of the random components to be non-Gaussian, that is, assuming an…

Methodology · Statistics 2021-07-27 Jeanett S. Pelck , Rodrigo Labouriau

Distributive Pre-Training of Generative Modeling Using Matrix-Product States

Tensor networks have recently found applications in machine learning for both supervised learning and unsupervised learning. The most common approaches for training these models are gradient descent methods. In this work, we consider an…

Machine Learning · Computer Science 2023-06-27 Sheng-Hsuan Lin , Olivier Kuijpers , Sebastian Peterhansl , Frank Pollmann

Query Training: Learning a Worse Model to Infer Better Marginals in Undirected Graphical Models with Hidden Variables

Probabilistic graphical models (PGMs) provide a compact representation of knowledge that can be queried in a flexible way: after learning the parameters of a graphical model once, new probabilistic queries can be answered at test time…

Machine Learning · Statistics 2021-03-01 Miguel Lázaro-Gredilla , Wolfgang Lehrach , Nishad Gothoskar , Guangyao Zhou , Antoine Dedieu , Dileep George

Neuro-SERKET: Development of Integrative Cognitive System through the Composition of Deep Probabilistic Generative Models

This paper describes a framework for the development of an integrative cognitive system based on probabilistic generative models (PGMs) called Neuro-SERKET. Neuro-SERKET is an extension of SERKET, which can compose elemental PGMs developed…

Machine Learning · Computer Science 2023-01-18 Tadahiro Taniguchi , Tomoaki Nakamura , Masahiro Suzuki , Ryo Kuniyasu , Kaede Hayashi , Akira Taniguchi , Takato Horii , Takayuki Nagai

A Generative Model of Words and Relationships from Multiple Sources

Neural language models are a powerful tool to embed words into semantic vector spaces. However, learning such models generally relies on the availability of abundant and diverse training examples. In highly specialised domains this…

Computation and Language · Computer Science 2015-12-04 Stephanie L. Hyland , Theofanis Karaletsos , Gunnar Rätsch

Generative modeling of conditional probability distributions on the level-sets of collective variables

Given a probability distribution $\mu$ in $\mathbb{R}^d$ represented by data, we study in this paper the generative modeling of the corresponding conditional probability distributions on the level-sets of a collective variable…

Machine Learning · Statistics 2026-03-30 Fatima-Zahrae Akhyar , Wei Zhang , Gabriel Stoltz , Christof Schütte

Implicit Modeling -- A Generalization of Discriminative and Generative Approaches

We propose a new modeling approach that is a generalization of generative and discriminative models. The core idea is to use an implicit parameterization of a joint probability distribution by specifying only the conditional distributions.…

Machine Learning · Computer Science 2016-12-06 Dmitrij Schlesinger , Carsten Rother

Unifying Autoregressive and Diffusion-Based Sequence Generation

We present significant extensions to diffusion-based sequence generation models, blurring the line with autoregressive language models. We introduce hyperschedules, which assign distinct noise schedules to individual token positions,…

Machine Learning · Computer Science 2025-10-08 Nima Fathi , Torsten Scholak , Pierre-André Noël

InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model

We propose InsNet, an expressive insertion-based text generator with efficient training and flexible decoding (parallel or sequential). Unlike most existing insertion-based text generation works that require re-encoding of the context after…

Computation and Language · Computer Science 2022-10-18 Sidi Lu , Tao Meng , Nanyun Peng

Deep Generative Model for Joint Alignment and Word Representation

This work exploits translation data as a source of semantically relevant learning signal for models of word representation. In particular, we exploit equivalence through translation as a form of distributed context and jointly learn how to…

Computation and Language · Computer Science 2018-04-24 Miguel Rios , Wilker Aziz , Khalil Sima'an

Learning a Generative Motion Model from Image Sequences based on a Latent Motion Matrix

We propose to learn a probabilistic motion model from a sequence of images for spatio-temporal registration. Our model encodes motion in a low-dimensional probabilistic space - the motion matrix - which enables various motion analysis tasks…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Julian Krebs , Hervé Delingette , Nicholas Ayache , Tommaso Mansi

SeDyT: A General Framework for Multi-Step Event Forecasting via Sequence Modeling on Dynamic Entity Embeddings

Temporal Knowledge Graphs store events in the form of subjects, relations, objects, and timestamps which are often represented by dynamic heterogeneous graphs. Event forecasting is a critical and challenging task in Temporal Knowledge Graph…

Machine Learning · Computer Science 2021-09-13 Hongkuan Zhou , James Orme-Rogers , Rajgopal Kannan , Viktor Prasanna