Related papers: Compressed Set Representations based on Set Differ…

Compact Representations of Event Sequences

We introduce a new technique for the efficient management of large sequences of multidimensional data, which takes advantage of regularities that arise in real-world datasets and supports different types of aggregation queries. More…

Data Structures and Algorithms · Computer Science 2018-03-08 Nieves R. Brisaboa , Guillermo de Bernardo , Gonzalo Navarro , Tirso V. Rodeiro , Diego Seco

Rep the Set: Neural Networks for Learning Set Representations

In several domains, data objects can be decomposed into sets of simpler objects. It is then natural to represent each object as the set of its components or parts. Many conventional machine learning algorithms are unable to process this…

Machine Learning · Computer Science 2020-03-03 Konstantinos Skianis , Giannis Nikolentzos , Stratis Limnios , Michalis Vazirgiannis

Compressed representation of Learning Spaces

Learning Spaces are certain set systems that are applied in the mathematical modeling of education. We propose a suitable compression (without loss of information) of such set systems to facilitate their logical and statistical analysis.…

Data Structures and Algorithms · Computer Science 2017-08-14 Marcel Wild

Learning to Represent and Predict Sets with Deep Neural Networks

In this thesis, we develop various techniques for working with sets in machine learning. Each input or output is not an image or a sequence, but a set: an unordered collection of multiple objects, each object described by a feature vector.…

Machine Learning · Computer Science 2021-03-09 Yan Zhang

Disentangled Mode-Specific Representations for Tensor Time Series via Contrastive Learning

Multi-mode tensor time series (TTS) can be found in many domains, such as search engines and environmental monitoring systems. Learning representations of a TTS benefits various applications, but it is also challenging since the…

Machine Learning · Computer Science 2026-03-02 Kohei Obata , Taichi Murayama , Zheng Chen , Yasuko Matsubara , Yasushi Sakurai

Learning Representations of Sets through Optimized Permutations

Representations of sets are challenging to learn because operations on sets should be permutation-invariant. To this end, we propose a Permutation-Optimisation module that learns how to permute a set end-to-end. The permuted set can be…

Machine Learning · Computer Science 2019-01-16 Yan Zhang , Jonathon Hare , Adam Prügel-Bennett

Compressed Representations of Conjunctive Query Results

Relational queries, and in particular join queries, often generate large output results when executed over a huge dataset. In such cases, it is often infeasible to store the whole materialized output if we plan to reuse it further down a…

Databases · Computer Science 2018-03-28 Shaleen Deep , Paraschos Koutris

Benefiting from Duplicates of Compressed Data: Shift-Based Holographic Compression of Images

Storage systems often rely on multiple copies of the same compressed data, enabling recovery in case of binary data errors, of course, at the expense of a higher storage cost. In this paper we show that a wiser method of duplication entails…

Multimedia · Computer Science 2019-02-08 Yehuda Dar , Alfred M. Bruckstein

Learned Image Compression for Machine Perception

Recent work has shown that learned image compression strategies can outperform standard hand-crafted compression algorithms that have been developed over decades of intensive research on the rate-distortion trade-off. With growing…

Image and Video Processing · Electrical Eng. & Systems 2021-11-04 Felipe Codevilla , Jean Gabriel Simard , Ross Goroshin , Chris Pal

Compressing Sets and Multisets of Sequences

This article describes lossless compression algorithms for multisets of sequences, taking advantage of the multiset's unordered structure. Multisets are a generalisation of sets where members are allowed to occur multiple times. A multiset…

Information Theory · Computer Science 2014-01-27 Christian Steinruecken

Cluster Specific Representation Learning

Representation learning aims to extract meaningful lower-dimensional embeddings from data, known as representations. Despite its widespread application, there is no established definition of a ``good'' representation. Typically, the…

Machine Learning · Computer Science 2024-12-05 Mahalakshmi Sabanayagam , Omar Al-Dabooni , Pascal Esser

Representation Learning with Multisets

We study the problem of learning permutation invariant representations that can capture "flexible" notions of containment. We formalize this problem via a measure theoretic definition of multisets, and obtain a theoretically-motivated…

Machine Learning · Computer Science 2019-11-21 Vasco Portilheiro

Constrained latent state modeling: A unifying perspective on representation learning under competing constraints

Learning latent representations from complex data is central to modern machine learning, spanning temporal, multimodal, and partially observed systems. In such settings, representations are better understood as latent states capturing…

Machine Learning · Computer Science 2026-05-18 Gwenolé Quellec

Learning Representations by Maximizing Compression

We give an algorithm that learns a representation of data through compression. The algorithm 1) predicts bits sequentially from those previously seen and 2) has a structure and a number of computations similar to an autoencoder. The…

Computer Vision and Pattern Recognition · Computer Science 2011-08-05 Karol Gregor , Yann LeCun

Transformers from Compressed Representations

Compressed file formats are the corner stone of efficient data storage and transmission, yet their potential for representation learning remains largely underexplored. We introduce TEMPEST (TransformErs froM comPressed rEpreSenTations), a…

Machine Learning · Computer Science 2025-10-30 Juan C. Leon Alcazar , Mattia Soldan , Mohammad Saatialsoruji , Alejandro Pardo , Hani Itani , Juan Camilo Perez , Bernard Ghanem

Comprehensive and Efficient Workload Compression

This work studies the problem of constructing a representative workload from a given input analytical query workload where the former serves as an approximation with guarantees of the latter. We discuss our work in the context of workload…

Databases · Computer Science 2021-02-04 Shaleen Deep , Anja Gruenheid , Paraschos Koutris , Jeffrey Naughton , Stratis Viglas

Learning to compress and search visual data in large-scale systems

The problem of high-dimensional and large-scale representation of visual data is addressed from an unsupervised learning perspective. The emphasis is put on discrete representations, where the description length can be measured in bits and…

Machine Learning · Computer Science 2019-01-25 Sohrab Ferdowsi

Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval

Cross-modal image-text retrieval is challenging because of the diverse possible associations between content from different modalities. Traditional methods learn a single-vector embedding to represent semantics of each sample, but struggle…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Hani Alomari , Anushka Sivakumar , Andrew Zhang , Chris Thomas

Grammar Compressed Sequences with Rank/Select Support

Sequence representations supporting not only direct access to their symbols, but also rank/select operations, are a fundamental building block in many compressed data structures. Several recent applications need to represent highly…

Data Structures and Algorithms · Computer Science 2019-11-25 Alberto Ordóñez , Gonzalo Navarro , Nieves R. Brisaboa

Mixed-integer convex representability

We consider the question of which nonconvex sets can be represented exactly as the feasible sets of mixed-integer convex optimization problems. We state the first complete characterization for the case when the number of possible integer…

Optimization and Control · Mathematics 2017-06-20 Miles Lubin , Ilias Zadik , Juan Pablo Vielma