Related papers: Learned Static Function Data Structures

Learned Data Compression: Challenges and Opportunities for the Future

Compressing integer keys is a fundamental operation among multiple communities, such as database management (DB), information retrieval (IR), and high-performance computing (HPC). Recent advances in \emph{learned indexes} have inspired the…

Databases · Computer Science 2024-12-17 Qiyu Liu , Siyuan Han , Jianwei Liao , Jin Li , Jingshu Peng , Jun Du , Lei Chen

Data Compression with Stochastic Codes

Machine learning has had a major impact on data compression over the last decade and inspired many new, exciting theoretical and applied questions. This paper describes one such direction -- relative entropy coding -- which focuses on…

Information Theory · Computer Science 2026-02-10 Gergely Flamich , Deniz Gündüz

Recognizing Variables from their Data via Deep Embeddings of Distributions

A key obstacle in automated analytics and meta-learning is the inability to recognize when different datasets contain measurements of the same variable. Because provided attribute labels are often uninformative in practice, this task may be…

Machine Learning · Computer Science 2019-09-12 Jonas Mueller , Alex Smola

Learning Barrier Functions for Constrained Motion Planning with Dynamical Systems

Stable dynamical systems are a flexible tool to plan robotic motions in real-time. In the robotic literature, dynamical system motions are typically planned without considering possible limitations in the robot's workspace. This work…

Robotics · Computer Science 2020-03-26 Matteo Saveriano , Dongheui Lee

The Potential of Learned Index Structures for Index Compression

Inverted indexes are vital in providing fast key-word-based search. For every term in the document collection, a list of identifiers of documents in which the term appears is stored, along with auxiliary information such as term frequency,…

Information Retrieval · Computer Science 2019-01-30 Harrie Oosterhuis , J. Shane Culpepper , Maarten de Rijke

Privacy-Preserving Learning-Augmented Data Structures

Learning-augmented data structures use predicted frequency estimates to retrieve frequently occurring database elements faster than standard data structures. Recent work has developed data structures that optimally exploit these frequency…

Information Retrieval · Computer Science 2025-10-02 Prabhav Goyal , Vinesh Sridhar , Wilson Zheng

Neural Statistical Functions

Classical deep learning typically operates on individual cases. Despite its success, real-world usage often requires repeated inference to estimate statistical quantities for complex decision-making tasks involving uncertainty or…

Machine Learning · Computer Science 2026-05-13 Daniel Xu , Yuxin Xie , Minghao Guo , Haixu Wu , Wojciech Matusik

DeepMapping: Learned Data Mapping for Lossless Compression and Efficient Lookup

Storing tabular data to balance storage and query efficiency is a long-standing research question in the database community. In this work, we argue and show that a novel DeepMapping abstraction, which relies on the impressive memorization…

Databases · Computer Science 2024-09-27 Lixi Zhou , K. Selçuk Candan , Jia Zou

Structured Functional Principal Component Analysis

Motivated by modern observational studies, we introduce a class of functional models that expands nested and crossed designs. These models account for the natural inheritance of correlation structure from sampling design in studies where…

Applications · Statistics 2013-04-26 Haochang Shou , Vadim Zipunnikov , Ciprian M. Crainiceanu , Sonja Greven

The Stochastic Replica Approach to Machine Learning: Stability and Parameter Optimization

We introduce a statistical physics inspired supervised machine learning algorithm for classification and regression problems. The method is based on the invariances or stability of predicted results when known data is represented as…

Machine Learning · Statistics 2018-11-19 Patrick Chao , Tahereh Mazaheri , Bo Sun , Nicholas B. Weingartner , Zohar Nussinov

Measuring the Stability of Learned Features

Many modern datasets don't fit neatly into $n \times p$ matrices, but most techniques for measuring statistical stability expect rectangular data. We study methods for stability assessment on non-rectangular data, using statistical learning…

Computation · Statistics 2021-02-23 Kris Sankaran

Structured Learning via Logistic Regression

A successful approach to structured learning is to write the learning objective as a joint function of linear parameters and inference messages, and iterate between updates to each. This paper observes that if the inference problem is…

Machine Learning · Computer Science 2014-07-04 Justin Domke

A Semantic Matching Energy Function for Learning with Multi-relational Data

Large-scale relational learning becomes crucial for handling the huge amounts of structured data generated daily in many application domains ranging from computational biology or information retrieval, to natural language processing. In…

Machine Learning · Computer Science 2013-03-22 Xavier Glorot , Antoine Bordes , Jason Weston , Yoshua Bengio

Holographic Embeddings of Knowledge Graphs

Learning embeddings of entities and relations is an efficient and versatile method to perform machine learning on relational data such as knowledge graphs. In this work, we propose holographic embeddings (HolE) to learn compositional vector…

Artificial Intelligence · Computer Science 2015-12-08 Maximilian Nickel , Lorenzo Rosasco , Tomaso Poggio

Querying in Constant Expected Time with Learned Indexes

Learned indexes leverage machine learning models to accelerate query answering in databases, showing impressive practical performance. However, theoretical understanding of these methods remains incomplete. Existing research suggests that…

Databases · Computer Science 2024-10-23 Luis Croquevielle , Guang Yang , Liang Liang , Ali Hadian , Thomas Heinis

Learning a Static Analyzer from Data

To be practically useful, modern static analyzers must precisely model the effect of both, statements in the programming language as well as frameworks used by the program under analysis. While important, manually addressing these…

Programming Languages · Computer Science 2017-06-27 Pavol Bielik , Veselin Raychev , Martin Vechev

Learning Ordered Representations with Nested Dropout

In this paper, we study ordered representations of data in which different dimensions have different degrees of importance. To learn these representations we introduce nested dropout, a procedure for stochastically removing coherent nested…

Machine Learning · Statistics 2014-02-06 Oren Rippel , Michael A. Gelbart , Ryan P. Adams

Statistical physics of directional, stochastic chains with memory

Stochastic chains represent a wide and key variety of phenomena in many branches of science within the context of Information Theory and Thermodynamics. They are typically approached by a sequence of independent events or by a memoryless…

Statistical Mechanics · Physics 2017-03-06 J. Ricardo Arias-Gonzalez

Dynamic Tree Databases in Automated Planning

A central challenge in scaling up explicit state-space search for large tasks is compactly representing the set of generated states. Tree databases, a data structure from model checking, require constant space per generated state in the…

Artificial Intelligence · Computer Science 2025-11-18 Oliver Joergensen , Dominik Drexler , Jendrik Seipp

Practical Concurrent Priority Queues

Priority queues are abstract data structures which store a set of key/value pairs and allow efficient access to the item with the minimal (maximal) key. Such queues are an important element in various areas of computer science such as…

Data Structures and Algorithms · Computer Science 2015-09-24 Jakob Gruber