English
Related papers

Related papers: Information Theoretic-Learning Auto-Encoder

200 papers

Deep Neural Networks inherit spurious correlations embedded in training data and hence may fail to predict desired labels on unseen domains (or environments), which have different distributions from the domain used in training. Invariance…

Machine Learning · Statistics 2022-03-30 Shoji Toyota , Kenji Fukumizu

Reinforcement learning (RL) provides a powerful framework for decision-making, but its application in practice often requires a carefully designed reward function. Adversarial Imitation Learning (AIL) sheds light on automatic policy…

Machine Learning · Computer Science 2024-02-05 Kaifeng Zhang , Rui Zhao , Ziming Zhang , Yang Gao

This work develops problem statements related to encoders and autoencoders with the goal of elucidating variational formulations and establishing clear connections to information-theoretic concepts. Specifically, four problems with varying…

Information Theory · Computer Science 2021-07-15 Karthik Duraisamy

As AI systems become increasingly autonomous, aligning their decision-making to human preferences is essential. In domains like autonomous driving or robotics, it is impossible to write down the reward function representing these…

Machine Learning · Computer Science 2025-01-03 Ondrej Bajgar , Sid William Gould , Rohan Narayan Langford Mitta , Jonathon Liu , Oliver Newcombe , Jack Golden

In disentangled representation learning, a model is asked to tease apart a dataset's underlying sources of variation and represent them independently of one another. Since the model is provided with no ground truth information about these…

Machine Learning · Computer Science 2023-10-24 Kyle Hsu , Will Dorrell , James C. R. Whittington , Jiajun Wu , Chelsea Finn

Incremental Task learning (ITL) is a category of continual learning that seeks to train a single network for multiple tasks (one after another), where training data for each task is only available during the training of that task. Neural…

Computer Vision and Pattern Recognition · Computer Science 2022-07-20 Rakib Hyder , Ken Shao , Boyu Hou , Panos Markopoulos , Ashley Prater-Bennette , M. Salman Asif

Contrastive Learning (CL) has been proved to be a powerful self-supervised approach for a wide range of domains, including computer vision and graph representation learning. However, the incremental learning issue of CL has rarely been…

Machine Learning · Computer Science 2023-01-31 Cheng Ji , Jianxin Li , Hao Peng , Jia Wu , Xingcheng Fu , Qingyun Sun , Phillip S. Yu

Inverse Reinforcement Learning (IRL) aims to facilitate a learner's ability to imitate expert behavior by acquiring reward functions that explain the expert's decisions. Regularized IRL applies strongly convex regularizers to the learner's…

Machine Learning · Computer Science 2020-12-04 Wonseok Jeon , Chen-Yang Su , Paul Barde , Thang Doan , Derek Nowrouzezahrai , Joelle Pineau

Inverse Reinforcement Learning (IRL) learns an optimal policy, given some expert demonstrations, thus avoiding the need for the tedious process of specifying a suitable reward function. However, current methods are constrained by at least…

Machine Learning · Computer Science 2023-11-16 Pierre Le Pelletier de Woillemont , Rémi Labory , Vincent Corruble

Text generation is a crucial task in NLP. Recently, several adversarial generative models have been proposed to improve the exposure bias problem in text generation. Though these models gain great success, they still suffer from the…

Computation and Language · Computer Science 2018-06-08 Zhan Shi , Xinchi Chen , Xipeng Qiu , Xuanjing Huang

We present a novel framework that can combine multi-domain learning (MDL), data imputation (DI) and multi-task learning (MTL) to improve performance for classification and regression tasks in different domains. The core of our method is an…

Machine Learning · Computer Science 2020-03-18 Andre Mendes , Julian Togelius , Leandro dos Santos Coelho

Continual learning has become increasingly important as it enables NLP models to constantly learn and gain knowledge over time. Previous continual learning methods are mainly designed to preserve knowledge from previous tasks, without much…

Computation and Language · Computer Science 2021-06-14 Yufan Huang , Yanzhe Zhang , Jiaao Chen , Xuezhi Wang , Diyi Yang

Auto-encoding generative adversarial networks (GANs) combine the standard GAN algorithm, which discriminates between real and model-generated data, with a reconstruction loss given by an auto-encoder. Such models aim to prevent mode…

Machine Learning · Statistics 2017-10-24 Mihaela Rosca , Balaji Lakshminarayanan , David Warde-Farley , Shakir Mohamed

Information divergence that measures the difference between two nonnegative matrices or tensors has found its use in a variety of machine learning problems. Examples are Nonnegative Matrix/Tensor Factorization, Stochastic Neighbor…

Machine Learning · Computer Science 2014-06-06 Onur Dikmen , Zhirong Yang , Erkki Oja

Many medical datasets have recently been created for medical image segmentation tasks, and it is natural to question whether we can use them to sequentially train a single model that (1) performs better on all these datasets, and (2)…

Computer Vision and Pattern Recognition · Computer Science 2022-08-02 Chenyu You , Jinlin Xiang , Kun Su , Xiaoran Zhang , Siyuan Dong , John Onofrey , Lawrence Staib , James S. Duncan

Many imitation learning (IL) algorithms use inverse reinforcement learning (IRL) to infer a reward function that aligns with the demonstration. However, the inferred reward functions often fail to capture the underlying task objectives. In…

Machine Learning · Computer Science 2024-11-01 Weichao Zhou , Wenchao Li

We study the question: How can we select the right data for fine-tuning to a specific task? We call this data selection problem active fine-tuning and show that it is an instance of transductive active learning, a novel generalization of…

Machine Learning · Computer Science 2024-06-24 Jonas Hübotter , Bhavya Sukhija , Lenart Treven , Yarden As , Andreas Krause

A new form of variational autoencoder (VAE) is developed, in which the joint distribution of data and codes is considered in two (symmetric) forms: ($i$) from observed data fed through the encoder to yield codes, and ($ii$) from latent…

Machine Learning · Computer Science 2017-11-21 Yunchen Pu , Weiyao Wang , Ricardo Henao , Liqun Chen , Zhe Gan , Chunyuan Li , Lawrence Carin

Machine unlearning offers a promising solution to privacy and safety concerns in large language models (LLMs) by selectively removing targeted knowledge while preserving utility. However, current methods are highly sensitive to downstream…

Learning invariant representations is a critical first step in a number of machine learning tasks. A common approach corresponds to the so-called information bottleneck principle in which an application dependent function of mutual…

Machine Learning · Computer Science 2021-02-17 Aditya Kumar Akash , Vishnu Suresh Lokhande , Sathya N. Ravi , Vikas Singh
‹ Prev 1 2 3 10 Next ›