Related papers: Representation Learning with Conditional Informati…

An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding

This paper proposes a new principled multi-task representation learning framework (InfoMTL) to extract noise-invariant sufficient representations for all tasks. It ensures sufficiency of shared representations for all tasks and mitigates…

Computation and Language · Computer Science 2025-03-07 Dou Hu , Lingwei Wei , Wei Zhou , Songlin Hu

Multimodal Representation Learning via Maximization of Local Mutual Information

We propose and demonstrate a representation learning approach by maximizing the mutual information between local features of images and text. The goal of this approach is to learn useful image representations by taking advantage of the rich…

Image and Video Processing · Electrical Eng. & Systems 2021-12-16 Ruizhi Liao , Daniel Moyer , Miriam Cha , Keegan Quigley , Seth Berkowitz , Steven Horng , Polina Golland , William M. Wells

Learning Representations in Reinforcement Learning:An Information Bottleneck Approach

The information bottleneck principle is an elegant and useful approach to representation learning. In this paper, we investigate the problem of representation learning in the context of reinforcement learning using the information…

Machine Learning · Computer Science 2019-11-14 Pei Yingjun , Hou Xinwen

A Mutual Information Maximization Perspective of Language Representation Learning

We show state-of-the-art word representation learning methods maximize an objective function that is a lower bound on the mutual information between different parts of a word sequence (i.e., a sentence). Our formulation provides an…

Computation and Language · Computer Science 2019-11-27 Lingpeng Kong , Cyprien de Masson d'Autume , Wang Ling , Lei Yu , Zihang Dai , Dani Yogatama

Semantic Noise Modeling for Better Representation Learning

Latent representation learned from multi-layered neural networks via hierarchical feature abstraction enables recent success of deep learning. Under the deep learning framework, generalization performance highly depends on the learned…

Machine Learning · Computer Science 2016-11-07 Hyo-Eun Kim , Sangheum Hwang , Kyunghyun Cho

InfoDiffusion: Representation Learning Using Information Maximizing Diffusion Models

While diffusion models excel at generating high-quality samples, their latent variables typically lack semantic meaning and are not suitable for representation learning. Here, we propose InfoDiffusion, an algorithm that augments diffusion…

Machine Learning · Computer Science 2023-06-16 Yingheng Wang , Yair Schiff , Aaron Gokaslan , Weishen Pan , Fei Wang , Christopher De Sa , Volodymyr Kuleshov

Conditional Meta-Learning of Linear Representations

Standard meta-learning for representation learning aims to find a common representation to be shared across multiple tasks. The effectiveness of these methods is often limited when the nuances of the tasks' distribution cannot be captured…

Machine Learning · Computer Science 2021-03-31 Giulia Denevi , Massimiliano Pontil , Carlo Ciliberto

Which Mutual-Information Representation Learning Objectives are Sufficient for Control?

Mutual information maximization provides an appealing formalism for learning representations of data. In the context of reinforcement learning (RL), such representations can accelerate learning by discarding irrelevant and redundant…

Machine Learning · Computer Science 2021-06-15 Kate Rakelly , Abhishek Gupta , Carlos Florensa , Sergey Levine

Theoretical Understanding of the Information Flow on Continual Learning Performance

Continual learning (CL) is a setting in which an agent has to learn from an incoming stream of data sequentially. CL performance evaluates the model's ability to continually learn and solve new problems with incremental available…

Machine Learning · Computer Science 2022-05-04 Josh Andle , Salimeh Yasaei Sekeh

Information-Theoretic Representation Learning for Positive-Unlabeled Classification

Recent advances in weakly supervised classification allow us to train a classifier only from positive and unlabeled (PU) data. However, existing PU classification methods typically require an accurate estimate of the class-prior…

Machine Learning · Statistics 2022-06-22 Tomoya Sakai , Gang Niu , Masashi Sugiyama

Learning Representations by Maximizing Mutual Information Across Views

We propose an approach to self-supervised representation learning based on maximizing mutual information between features extracted from multiple views of a shared context. For example, one could produce multiple views of a local…

Machine Learning · Computer Science 2019-07-09 Philip Bachman , R Devon Hjelm , William Buchwalter

Robust Representation Learning through Explicit Environment Modeling

We consider learning from labeled data collected across multiple environments, where the data distribution may vary across these environments. This problem is commonly approached from a causal perspective, seeking invariant representations…

Machine Learning · Statistics 2026-04-30 Yuli Slavutsky , David M. Blei

Learning to Find Correlated Features by Maximizing Information Flow in Convolutional Neural Networks

Training convolutional neural networks for image classification tasks usually causes information loss. Although most of the time the information lost is redundant with respect to the target task, there are still cases where discriminative…

Computer Vision and Pattern Recognition · Computer Science 2019-07-02 Wei Shen , Fei Li , Rujie Liu

Maximizing Mutual Information Across Feature and Topology Views for Learning Graph Representations

Recently, maximizing mutual information has emerged as a powerful method for unsupervised graph representation learning. The existing methods are typically effective to capture information from the topology view but ignore the feature view.…

Machine Learning · Computer Science 2022-10-12 Xiaolong Fan , Maoguo Gong , Yue Wu , Hao Li

Towards Optimal Learning of Language Models

This work studies the general principles of improving the learning of language models (LMs), which aims at reducing the necessary training steps for achieving superior performance. Specifically, we present a theory for the optimal learning…

Computation and Language · Computer Science 2024-03-05 Yuxian Gu , Li Dong , Yaru Hao , Qingxiu Dong , Minlie Huang , Furu Wei

Disentangled Representation Learning via Flow Matching

Disentangled representation learning aims to capture the underlying explanatory factors of observed data, enabling a principled understanding of the data-generating process. Recent advances in generative modeling have introduced new…

Machine Learning · Computer Science 2026-05-12 Jinjin Chi , Taoping Liu , Mengtao Yin , Ximing Li , Yongcheng Jing , Jialie Shen , Leszek Rutkowski , Dacheng Tao

Learning Robust Representations via Multi-View Information Bottleneck

The information bottleneck principle provides an information-theoretic method for representation learning, by training an encoder to retain all information which is relevant for predicting the label while minimizing the amount of other,…

Machine Learning · Computer Science 2020-02-19 Marco Federici , Anjan Dutta , Patrick Forré , Nate Kushman , Zeynep Akata

Towards Informative Few-Shot Prompt with Maximum Information Gain for In-Context Learning

Large Language models (LLMs) possess the capability to engage In-context Learning (ICL) by leveraging a few demonstrations pertaining to a new downstream task as conditions. However, this particular learning paradigm suffers from high…

Computation and Language · Computer Science 2023-10-16 Hongfu Liu , Ye Wang

Learning Robust Data Representation: A Knowledge Flow Perspective

It is always demanding to learn robust visual representation for various learning problems; however, this learning and maintenance process usually suffers from noise, incompleteness or knowledge domain mismatch. Thus, robust representation…

Machine Learning · Computer Science 2020-04-28 Zhengming Ding , Ming Shao , Handong Zhao , Sheng Li

Robust Speech Representation Learning via Flow-based Embedding Regularization

Over the recent years, various deep learning-based methods were proposed for extracting a fixed-dimensional embedding vector from speech signals. Although the deep learning-based embedding extraction methods have shown good performance in…

Audio and Speech Processing · Electrical Eng. & Systems 2021-12-08 Woo Hyun Kang , Jahangir Alam , Abderrahim Fathan