English
Related papers

Related papers: KERMIT: Generative Insertion-Based Modeling for Se…

200 papers

We present a noisy channel generative model of two sequences, for example text and speech, which enables uncovering the association between the two modalities when limited paired data is available. To address the intractability of the exact…

Machine Learning · Computer Science 2022-12-07 Soroosh Mariooryad , Matt Shannon , Siyuan Ma , Tom Bagby , David Kao , Daisy Stanton , Eric Battenberg , RJ Skerry-Ryan

We introduce a novel model called GAMMT (Generative Ambiguity Models using Multiple Transformers) for sequential data that is based on sets of probabilities. Unlike conventional models, our approach acknowledges that the data generation…

Machine Learning · Computer Science 2023-04-05 Xingcheng Xu

Undirected neural sequence models such as BERT (Devlin et al., 2019) have received renewed interest due to their success on discriminative natural language understanding tasks such as question-answering and natural language inference. The…

Machine Learning · Computer Science 2020-02-10 Elman Mansimov , Alex Wang , Sean Welleck , Kyunghyun Cho

Chaotic dynamical systems exhibit strong sensitivity to initial conditions and often contain unresolved multiscale processes, making deterministic forecasting fundamentally limited. Generative models offer an appealing alternative by…

Machine Learning · Computer Science 2026-01-01 Patrick Wyrod , Ashesh Chattopadhyay , Daniele Venturi

Data-driven medical AI is traditionally formulated as a discriminative mapping from input $X$ to output $Y$ via a learned function $f$, which does not generalize well across heterogeneous data and modalities encountered in real-world…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Hantao Zhang , Weidong Guo , Yuhe Liu , Jiancheng Yang , Sathvik Bhagavan , Danli Shi , Mingda Xu , Pascal Fua

We consider the problem of inference in a causal generative model where the set of available observations differs between data instances. We show how combining samples drawn from the graphical model with an appropriate masking function…

We present a simple method, CropMix, for the purpose of producing a rich input distribution from the original dataset distribution. Unlike single random cropping, which may inadvertently capture only limited information, or irrelevant…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Junlin Han , Lars Petersson , Hongdong Li , Ian Reid

Generative modeling, which learns joint probability distribution from data and generates samples according to it, is an important task in machine learning and artificial intelligence. Inspired by probabilistic interpretation of quantum…

Statistical Mechanics · Physics 2018-07-20 Zhao-Yu Han , Jun Wang , Heng Fan , Lei Wang , Pan Zhang

We propose a method for inference in generalised linear mixed models (GLMMs) and several extensions of these models. First, we extend the GLMM by allowing the distribution of the random components to be non-Gaussian, that is, assuming an…

Methodology · Statistics 2021-07-27 Jeanett S. Pelck , Rodrigo Labouriau

Tensor networks have recently found applications in machine learning for both supervised learning and unsupervised learning. The most common approaches for training these models are gradient descent methods. In this work, we consider an…

Machine Learning · Computer Science 2023-06-27 Sheng-Hsuan Lin , Olivier Kuijpers , Sebastian Peterhansl , Frank Pollmann

Probabilistic graphical models (PGMs) provide a compact representation of knowledge that can be queried in a flexible way: after learning the parameters of a graphical model once, new probabilistic queries can be answered at test time…

This paper describes a framework for the development of an integrative cognitive system based on probabilistic generative models (PGMs) called Neuro-SERKET. Neuro-SERKET is an extension of SERKET, which can compose elemental PGMs developed…

Neural language models are a powerful tool to embed words into semantic vector spaces. However, learning such models generally relies on the availability of abundant and diverse training examples. In highly specialised domains this…

Computation and Language · Computer Science 2015-12-04 Stephanie L. Hyland , Theofanis Karaletsos , Gunnar Rätsch

Given a probability distribution $\mu$ in $\mathbb{R}^d$ represented by data, we study in this paper the generative modeling of the corresponding conditional probability distributions on the level-sets of a collective variable…

Machine Learning · Statistics 2026-03-30 Fatima-Zahrae Akhyar , Wei Zhang , Gabriel Stoltz , Christof Schütte

We propose a new modeling approach that is a generalization of generative and discriminative models. The core idea is to use an implicit parameterization of a joint probability distribution by specifying only the conditional distributions.…

Machine Learning · Computer Science 2016-12-06 Dmitrij Schlesinger , Carsten Rother

We present significant extensions to diffusion-based sequence generation models, blurring the line with autoregressive language models. We introduce hyperschedules, which assign distinct noise schedules to individual token positions,…

Machine Learning · Computer Science 2025-10-08 Nima Fathi , Torsten Scholak , Pierre-André Noël

We propose InsNet, an expressive insertion-based text generator with efficient training and flexible decoding (parallel or sequential). Unlike most existing insertion-based text generation works that require re-encoding of the context after…

Computation and Language · Computer Science 2022-10-18 Sidi Lu , Tao Meng , Nanyun Peng

This work exploits translation data as a source of semantically relevant learning signal for models of word representation. In particular, we exploit equivalence through translation as a form of distributed context and jointly learn how to…

Computation and Language · Computer Science 2018-04-24 Miguel Rios , Wilker Aziz , Khalil Sima'an

We propose to learn a probabilistic motion model from a sequence of images for spatio-temporal registration. Our model encodes motion in a low-dimensional probabilistic space - the motion matrix - which enables various motion analysis tasks…

Computer Vision and Pattern Recognition · Computer Science 2021-02-02 Julian Krebs , Hervé Delingette , Nicholas Ayache , Tommaso Mansi

Temporal Knowledge Graphs store events in the form of subjects, relations, objects, and timestamps which are often represented by dynamic heterogeneous graphs. Event forecasting is a critical and challenging task in Temporal Knowledge Graph…

Machine Learning · Computer Science 2021-09-13 Hongkuan Zhou , James Orme-Rogers , Rajgopal Kannan , Viktor Prasanna
‹ Prev 1 2 3 10 Next ›