Related papers: Compositional Distributed Learning for Multi-View …

Federated Representation Learning via Maximal Coding Rate Reduction

We propose a federated methodology to learn low-dimensional representations from a dataset that is distributed among several clients. In particular, we move away from the commonly-used cross-entropy loss in federated learning, and seek to…

Machine Learning · Computer Science 2022-10-04 Juan Cervino , Navid NaderiAlizadeh , Alejandro Ribeiro

Efficient Decoding of Compositional Structure in Holistic Representations

We investigate the task of retrieving information from compositional distributed representations formed by Hyperdimensional Computing/Vector Symbolic Architectures and present novel techniques which achieve new information rate bounds.…

Neural and Evolutionary Computing · Computer Science 2023-05-29 Denis Kleyko , Connor Bybee , Ping-Chen Huang , Christopher J. Kymn , Bruno A. Olshausen , E. Paxon Frady , Friedrich T. Sommer

Unlocking Compositional Generalization in Continual Few-Shot Learning

Object-centric representations promise a key property for few-shot learning: Rather than treating a scene as a single unit, a model can decompose it into individual object-level parts that can be matched and compared across different…

Machine Learning · Computer Science 2026-05-19 Phu-Quy Nguyen-Lam , Phu-Hoa Pham , Dao Sy Duy Minh , Chi-Nguyen Tran , Huynh Trung Kiet , Long Tran-Thanh

Enhancing the Rate-Distortion-Perception Flexibility of Learned Image Codecs with Conditional Diffusion Decoders

Learned image compression codecs have recently achieved impressive compression performances surpassing the most efficient image coding architectures. However, most approaches are trained to minimize rate and distortion which often leads to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Daniele Mari , Simone Milani

Learned Disentangled Latent Representations for Scalable Image Coding for Humans and Machines

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily…

Image and Video Processing · Electrical Eng. & Systems 2023-01-12 Ezgi Ozyilkan , Mateen Ulhaq , Hyomin Choi , Fabien Racape

Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

Learning compositional representation is a key aspect of object-centric learning as it enables flexible systematic generalization and supports complex visual reasoning. However, most of the existing approaches rely on auto-encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Whie Jung , Jaehoon Yoo , Sungjin Ahn , Seunghoon Hong

Learning Compositional Representations for Few-Shot Recognition

One of the key limitations of modern deep learning approaches lies in the amount of data required to train them. Humans, by contrast, can learn to recognize novel categories from just a few examples. Instrumental to this rapid learning…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Pavel Tokmakov , Yu-Xiong Wang , Martial Hebert

Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction

To learn intrinsic low-dimensional structures from high-dimensional data that most discriminate between classes, we propose the principle of Maximal Coding Rate Reduction ($\text{MCR}^2$), an information-theoretic measure that maximizes the…

Machine Learning · Computer Science 2020-06-16 Yaodong Yu , Kwan Ho Ryan Chan , Chong You , Chaobing Song , Yi Ma

Incomplete Multi-View Multi-Label Classification via Shared Codebook and Fused-Teacher Self-Distillation

Although multi-view multi-label learning has been extensively studied, research on the dual-missing scenario, where both views and labels are incomplete, remains largely unexplored. Existing methods mainly rely on contrastive learning or…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Xu Yan , Jun Yin , Shiliang Sun , Minghua Wan

Compositional Scene Representation Learning via Reconstruction: A Survey

Visual scenes are composed of visual concepts and have the property of combinatorial explosion. An important reason for humans to efficiently learn from diverse visual scenes is the ability of compositional perception, and it is desirable…

Machine Learning · Computer Science 2023-06-16 Jinyang Yuan , Tonglin Chen , Bin Li , Xiangyang Xue

Multimodal Skeleton-Based Action Representation Learning via Decomposition and Composition

Multimodal human action understanding is a significant problem in computer vision, with the central challenge being the effective utilization of the complementarity among diverse modalities while maintaining model efficiency. However, most…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Hongsong Wang , Heng Fei , Bingxuan Dai , Jie Gui

Learned Image Compression for Machine Perception

Recent work has shown that learned image compression strategies can outperform standard hand-crafted compression algorithms that have been developed over decades of intensive research on the rate-distortion trade-off. With growing…

Image and Video Processing · Electrical Eng. & Systems 2021-11-04 Felipe Codevilla , Jean Gabriel Simard , Ross Goroshin , Chris Pal

When A Conventional Filter Meets Deep Learning: Basis Composition Learning on Image Filters

Image filters are fast, lightweight and effective, which make these conventional wisdoms preferable as basic tools in vision tasks. In practical scenarios, users have to tweak parameters multiple times to obtain satisfied results. This…

Computer Vision and Pattern Recognition · Computer Science 2022-03-02 Fu Lee Wang , Yidan Feng , Haoran Xie , Gary Cheng , Mingqiang Wei

Self-Supervised Representation Learning With MUlti-Segmental Informational Coding (MUSIC)

Self-supervised representation learning maps high-dimensional data into a meaningful embedding space, where samples of similar semantic contents are close to each other. Most of the recent representation learning methods maximize cosine…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Chuang Niu , Ge Wang

Multi-View Task-Driven Recognition in Visual Sensor Networks

Nowadays, distributed smart cameras are deployed for a wide set of tasks in several application scenarios, ranging from object recognition, image retrieval, and forensic applications. Due to limited bandwidth in distributed systems,…

Computer Vision and Pattern Recognition · Computer Science 2017-06-02 Ali Taalimi , Alireza Rahimpour , Liu Liu , Hairong Qi

Learning a Compressive Sensing Matrix with Structural Constraints via Maximum Mean Discrepancy Optimization

We introduce a learning-based algorithm to obtain a measurement matrix for compressive sensing related recovery problems. The focus lies on matrices with a constant modulus constraint which typically represent a network of analog phase…

Signal Processing · Electrical Eng. & Systems 2021-10-15 Michael Koller , Wolfgang Utschick

MIXER: A Principled Framework for Multimodal, Multiway Data Association

A fundamental problem in robotic perception is matching identical objects or data, with applications such as loop closure detection, place recognition, object tracking, and map fusion. While the problem becomes considerably more challenging…

Robotics · Computer Science 2021-12-01 Parker C. Lusk , Ronak Roy , Kaveh Fathian , Jonathan P. How

Limited Memory Prediction for Linear Systems with Different types of Observation

This paper is concerned with distributed limited memory prediction for continuous-time linear stochastic systems with multiple sensors. A distributed fusion with the weighted sum structure is applied to the optimal local limited memory…

Other Computer Science · Computer Science 2010-02-18 Ha-ryong Song , Vladimir Shin

CodeBind: Decoupled Representation Learning for Multimodal Alignment with Unified Compositional Codebook

Multimodal representation alignment is pivotal for large language models and robotics. Traditional methods are often hindered by cross-modal information discrepancies and data scarcity, leading to suboptimal alignment spaces that overlook…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Zeyu Chen , Jie Li , Kai Han

Compositional Mixture Representations for Vision and Text

Learning a common representation space between vision and language allows deep networks to relate objects in the image to the corresponding semantic meaning. We present a model that learns a shared Gaussian mixture representation imposing…

Computer Vision and Pattern Recognition · Computer Science 2022-06-15 Stephan Alaniz , Marco Federici , Zeynep Akata