Related papers: Context-aware learning for generative models

Generative Context-aware Fine-tuning of Self-supervised Speech Models

When performing tasks like automatic speech recognition or spoken language understanding for a given utterance, access to preceding text or audio provides contextual information can improve performance. Considering the recent advances in…

Computation and Language · Computer Science 2023-12-18 Suwon Shon , Kwangyoun Kim , Prashant Sridhar , Yi-Te Hsu , Shinji Watanabe , Karen Livescu

Accuracy of Latent-Variable Estimation in Bayesian Semi-Supervised Learning

Hierarchical probabilistic models, such as Gaussian mixture models, are widely used for unsupervised learning tasks. These models consist of observable and latent variables, which represent the observable data and the underlying…

Machine Learning · Statistics 2015-03-26 Keisuke Yamazaki

Beyond MLE: Convex Learning for Text Generation

Maximum likelihood estimation (MLE) is a statistical method used to estimate the parameters of a probability distribution that best explain the observed data. In the context of text generation, MLE is often used to train generative language…

Computation and Language · Computer Science 2023-10-27 Chenze Shao , Zhengrui Ma , Min Zhang , Yang Feng

Variational Autoencoders for Collaborative Filtering

We extend variational autoencoders (VAEs) to collaborative filtering for implicit feedback. This non-linear probabilistic model enables us to go beyond the limited modeling capacity of linear factor models which still largely dominate…

Machine Learning · Statistics 2018-02-19 Dawen Liang , Rahul G. Krishnan , Matthew D. Hoffman , Tony Jebara

GAN-EM: GAN based EM learning framework

Expectation maximization (EM) algorithm is to find maximum likelihood solution for models having latent variables. A typical example is Gaussian Mixture Model (GMM) which requires Gaussian assumption, however, natural images are highly…

Machine Learning · Computer Science 2018-12-04 Wentian Zhao , Shaojie Wang , Zhihuai Xie , Jing Shi , Chenliang Xu

Context Aware Machine Learning

We propose a principle for exploring context in machine learning models. Starting with a simple assumption that each observation may or may not depend on its context, a conditional probability distribution is decomposed into two parts:…

Machine Learning · Computer Science 2019-01-23 Yun Zeng

On the Optimization Landscape of Maximum Mean Discrepancy

Generative models have been successfully used for generating realistic signals. Because the likelihood function is typically intractable in most of these models, the common practice is to use "implicit" models that avoid likelihood…

Machine Learning · Computer Science 2024-05-07 Itai Alon , Amir Globerson , Ami Wiesel

Estimation of Classification Rules from Partially Classified Data

We consider the situation where the observed sample contains some observations whose class of origin is known (that is, they are classified with respect to the g underlying classes of interest), and where the remaining observations in the…

Machine Learning · Statistics 2020-04-15 Geoffrey J. McLachlan , Daniel Ahfock

CMVAE: Causal Meta VAE for Unsupervised Meta-Learning

Unsupervised meta-learning aims to learn the meta knowledge from unlabeled data and rapidly adapt to novel tasks. However, existing approaches may be misled by the context-bias (e.g. background) from the training data. In this paper, we…

Machine Learning · Computer Science 2023-02-21 Guodong Qi , Huimin Yu

Unsupervised Attention-based Sentence-Level Meta-Embeddings from Contextualised Language Models

A variety of contextualised language models have been proposed in the NLP community, which are trained on diverse corpora to produce numerous Neural Language Models (NLMs). However, different NLMs have reported different levels of…

Computation and Language · Computer Science 2022-04-19 Keigo Takahashi , Danushka Bollegala

Quantum Expectation-Maximization for Gaussian Mixture Models

The Expectation-Maximization (EM) algorithm is a fundamental tool in unsupervised machine learning. It is often used as an efficient way to solve Maximum Likelihood (ML) estimation problems, especially for models with latent variables. It…

Quantum Physics · Physics 2020-07-08 Iordanis Kerenidis , Alessandro Luongo , Anupam Prakash

Learning Representations by Maximizing Mutual Information Across Views

We propose an approach to self-supervised representation learning based on maximizing mutual information between features extracted from multiple views of a shared context. For example, one could produce multiple views of a local…

Machine Learning · Computer Science 2019-07-09 Philip Bachman , R Devon Hjelm , William Buchwalter

Unsupervised Meta-Learning via In-Context Learning

Unsupervised meta-learning aims to learn feature representations from unsupervised datasets that can transfer to downstream tasks with limited labeled data. In this paper, we propose a novel approach to unsupervised meta-learning that…

Machine Learning · Computer Science 2025-02-11 Anna Vettoruzzo , Lorenzo Braccaioli , Joaquin Vanschoren , Marlena Nowaczyk

Memory-Based Meta-Learning on Non-Stationary Distributions

Memory-based meta-learning is a technique for approximating Bayes-optimal predictors. Under fairly general conditions, minimizing sequential prediction error, measured by the log loss, leads to implicit meta-learning. The goal of this work…

Machine Learning · Computer Science 2023-05-26 Tim Genewein , Grégoire Delétang , Anian Ruoss , Li Kevin Wenliang , Elliot Catt , Vincent Dutordoir , Jordi Grau-Moya , Laurent Orseau , Marcus Hutter , Joel Veness

Contextualized Machine Learning

We examine Contextualized Machine Learning (ML), a paradigm for learning heterogeneous and context-dependent effects. Contextualized ML estimates heterogeneous functions by applying deep learning to the meta-relationship between contextual…

Machine Learning · Statistics 2023-10-18 Benjamin Lengerich , Caleb N. Ellington , Andrea Rubbi , Manolis Kellis , Eric P. Xing

Provable Mixed-Noise Learning with Flow-Matching

We study Bayesian inverse problems with mixed noise, modeled as a combination of additive and multiplicative Gaussian components. While traditional inference methods often assume fixed or known noise characteristics, real-world…

Machine Learning · Computer Science 2025-10-17 Paul Hagemann , Robert Gruhlke , Bernhard Stankewitz , Claudia Schillings , Gabriele Steidl

Inference and Evaluation of the Multinomial Mixture Model for Text Clustering

In this article, we investigate the use of a probabilistic model for unsupervised clustering in text collections. Unsupervised clustering has become a basic module for many intelligent text processing applications, such as information…

Information Retrieval · Computer Science 2016-08-16 Loïs Rigouste , Olivier Cappé , François Yvon

Context-Aware Neural Gradient Mapping for Fine-Grained Instruction Processing

The integration of contextual embeddings into the optimization processes of large language models is an advancement in natural language processing. The Context-Aware Neural Gradient Mapping framework introduces a dynamic gradient adjustment…

Computation and Language · Computer Science 2025-04-25 David Boldo , Lily Pemberton , Gabriel Thistledown , Jacob Fairchild , Felix Kowalski

Semi-Supervised Anomaly Detection - Towards Model-Independent Searches of New Physics

Most classification algorithms used in high energy physics fall under the category of supervised machine learning. Such methods require a training set containing both signal and background events and are prone to classification errors…

Data Analysis, Statistics and Probability · Physics 2015-06-03 Mikael Kuusela , Tommi Vatanen , Eric Malmi , Tapani Raiko , Timo Aaltonen , Yoshikazu Nagai

Adversarially Learned Mixture Model

The Adversarially Learned Mixture Model (AMM) is a generative model for unsupervised or semi-supervised data clustering. The AMM is the first adversarially optimized method to model the conditional dependence between inferred continuous and…

Machine Learning · Statistics 2022-04-26 Andrew Jesson , Cécile Low-Kam , Tanya Nair , Florian Soudan , Florent Chandelier , Nicolas Chapados