English
Related papers

Related papers: Understanding Encoder-Decoder Structures in Machin…

200 papers

We introduce the Mutual Information Machine (MIM), a novel formulation of representation learning, using a joint distribution over the observations and latent state in an encoder/decoder framework. Our key principles are symmetry and mutual…

Machine Learning · Statistics 2019-10-10 Micha Livne , Kevin Swersky , David J. Fleet

Split learning is a privacy-preserving distributed learning paradigm in which an ML model (e.g., a neural network) is split into two parts (i.e., an encoder and a decoder). The encoder shares so-called latent representation, rather than raw…

Machine Learning · Computer Science 2023-09-07 Omar Alhussein , Moshi Wei , Arashmid Akhavain

We introduce the Mutual Information Machine (MIM), a probabilistic auto-encoder for learning joint distributions over observations and latent variables. MIM reflects three design principles: 1) low divergence, to encourage the encoder and…

Machine Learning · Computer Science 2020-02-24 Micha Livne , Kevin Swersky , David J. Fleet

A major challenge in designing efficient statistical supervised learning algorithms is finding representations that perform well not only on available training samples but also on unseen data. While the study of representation learning has…

Machine Learning · Statistics 2024-02-06 Milad Sefidgaran , Abdellatif Zaidi , Piotr Krasnowski

A grand challenge in representation learning is to learn the different explanatory factors of variation behind the high dimen- sional data. Encoder models are often determined to optimize performance on training data when the real objective…

Machine Learning · Statistics 2018-02-16 Matías Vera , Pablo Piantanida , Leonardo Rey Vega

The traditional SegNet architecture commonly encounters significant information loss during the sampling process, which detrimentally affects its accuracy in image semantic segmentation tasks. To counter this challenge, we introduce an…

Image and Video Processing · Electrical Eng. & Systems 2024-06-05 Zijun Gao , Qi Wang , Taiyuan Mei , Xiaohan Cheng , Yun Zi , Haowei Yang

Learning good representations is of crucial importance in deep learning. Mutual Information (MI) or similar measures of statistical dependence are promising tools for learning these representations in an unsupervised way. Even though the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-09 Mirco Ravanelli , Yoshua Bengio

Deep learning systems have been reported to acheive state-of-the-art performances in many applications, and one of the keys for achieving this is the existence of well trained classifiers on benchmark datasets which can be used as backbone…

Machine Learning · Computer Science 2022-10-04 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

Information-theoretic measures have been widely adopted in the design of features for learning and decision problems. Inspired by this, we look at the relationship between i) a weak form of information loss in the Shannon sense and ii) the…

Machine Learning · Computer Science 2022-01-03 Jorge F. Silva , Felipe Tobar , Mario Vicuña , Felipe Cordova

Deep learning systems have been reported to achieve state-of-the-art performances in many applications, and a key is the existence of well trained classifiers on benchmark datasets. As a main-stream loss function, the cross entropy can…

Machine Learning · Computer Science 2022-09-22 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

The development of optimal and efficient machine learning-based communication systems is likely to be a key enabler of beyond 5G communication technologies. In this direction, physical layer design has been recently reformulated under a…

Information Theory · Computer Science 2021-11-16 Nunzio A. Letizia , Andrea M. Tonello

The ability of machine learning models to store input information in hidden layer vector embeddings, analogous to the concept of `memory', is widely employed but not well characterized. We find that language model embeddings typically…

Computation and Language · Computer Science 2026-05-20 Benjamin L. Badger

Lossy image compression is one of the most commonly used operators for digital images. Most recently proposed deep-learning-based image compression methods leverage the auto-encoder structure, and reach a series of promising results in this…

Computer Vision and Pattern Recognition · Computer Science 2020-07-09 Yaolong Wang , Mingqing Xiao , Chang Liu , Shuxin Zheng , Tie-Yan Liu

Despite the remarkable success of large large-scale neural networks, we still lack unified notation for thinking about and describing their representational spaces. We lack methods to reliably describe how their representations are…

Machine Learning · Computer Science 2025-06-02 Henry Conklin

We introduce an information-theoretic framework that views learning as universal prediction under log loss, characterized through regret bounds. Central to the framework is an effective notion of architecture-based model complexity, defined…

Machine Learning · Computer Science 2025-11-04 Meir Feder , Ruediger Urbanke , Yaniv Fogel

Recent advances in large language models (LLMs) have revolutionized natural language processing, yet evaluating their intrinsic linguistic understanding remains challenging. Moving beyond specialized evaluation tasks, we propose an…

Computation and Language · Computer Science 2025-06-02 Shaojie Wang , Sirui Ding , Na Zou

The body morphology plays an important role in the way information is perceived and processed by an agent. We address an information theory (IT) account on how the precision of sensors, the accuracy of motors, their placement, the body…

Robotics · Computer Science 2024-08-26 Alexandre Pitti , Kohei Nakajima , Yasuo Kuniyoshi

Recently, the Multilinear Compressive Learning (MCL) framework was proposed to efficiently optimize the sensing and learning steps when working with multidimensional signals, i.e. tensors. In Compressive Learning in general, and in MCL in…

Computer Vision and Pattern Recognition · Computer Science 2020-09-23 Dat Thanh Tran , Moncef Gabbouj , Alexandros Iosifidis

Accurately matching visual and textual data in cross-modal retrieval has been widely studied in the multimedia community. To address these challenges posited by the heterogeneity gap and the semantic gap, we propose integrating Shannon…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Wei Chen , Yu Liu , Erwin M. Bakker , Michael S. Lew

Despite the increasing prevalence of large language models (LLMs), we still have a limited understanding of how their representational spaces are structured. This limits our ability to interpret how and what they learn or relate them to…

‹ Prev 1 2 3 10 Next ›