English
Related papers

Related papers: Fast Factorized Learning: Powered by In-Memory Dat…

200 papers

Factorised databases are relational databases that use compact factorised representations at the physical layer to reduce data redundancy and boost query performance. This paper introduces FDB, an in-memory query engine for…

Databases · Computer Science 2012-03-14 Nurzhan Bakibayev , Dan Olteanu , Jakub Závodný

A common approach to data analysis involves understanding and manipulating succinct representations of data. In earlier work, we put forward a succinct representation system for relational data called factorised databases and reported on…

Databases · Computer Science 2013-07-02 Nurzhan Bakibayev , Tomáš Kočiský , Dan Olteanu , Jakub Závodný

Disentanglement of constituent factors of a sensory signal is central to perception and cognition and hence is a critical task for future artificial intelligence systems. In this paper, we present a compute engine capable of efficiently…

Emerging Technologies · Computer Science 2023-06-07 Jovin Langenegger , Geethan Karunaratne , Michael Hersche , Luca Benini , Abu Sebastian , Abbas Rahimi

In-memory join is an essential operator in any database engine. It has been extensively investigated in the database literature. In this paper, we study whether exploiting the CDF-based learned models to boost the join performance is…

Databases · Computer Science 2022-03-10 Ibrahim Sabek , Tim Kraska

Database management systems (DBMSs) carefully optimize complex multi-join queries to avoid expensive disk I/O. As servers today feature tens or hundreds of gigabytes of RAM, a significant fraction of many analytic databases becomes…

Databases · Computer Science 2015-07-22 Feilong Liu , Spyros Blanas

Real-world AI/ML workflows often apply inference computations to feature vectors joined from multiple datasets. To avoid the redundant AI/ML computations caused by repeated data records in the join's output, factorized ML has been proposed…

Databases · Computer Science 2025-11-26 Kanchan Chowdhury , Lixi Zhou , Lulu Xie , Xinwei Fu , Jia Zou

Joins are among the most time-consuming and data-intensive operations in relational query processing. Much research effort has been applied to the optimization of join processing due to their frequent execution. Recent studies have shown…

Databases · Computer Science 2025-05-26 Yuvaraj Chesetti , Prashant Pandey

Index structures are important for efficient data access, which have been widely used to improve the performance in many in-memory systems. Due to high in-memory overheads, traditional index structures become difficult to process the…

Databases · Computer Science 2019-05-16 Pengfei Li , Yu Hua , Pengfei Zuo , Jingnan Jia

Federated learning is an emerging distributed machine learning framework aiming at protecting data privacy. Data heterogeneity is one of the core challenges in federated learning, which could severely degrade the convergence rate and…

Machine Learning · Statistics 2025-11-27 Feifei Wang , Huiyun Tang , Yang Li

Federated Learning is an emerging distributed collaborative learning paradigm used by many of applications nowadays. The effectiveness of federated learning relies on clients' collective efforts and their willingness to contribute local…

Computer Science and Game Theory · Computer Science 2022-05-24 Shuyu Kong , You Li , Hai Zhou

The emerging paradigm of federated learning strives to enable collaborative training of machine learning models on the network edge without centrally aggregating raw data and hence, improving data privacy. This sharply deviates from…

Machine Learning · Computer Science 2019-12-03 Manoj Ghuhan Arivazhagan , Vinay Aggarwal , Aaditya Kumar Singh , Sunav Choudhary

While Deep Learning has demonstrated impressive results in applications on various data types, it continues to lag behind tree-based methods when applied to tabular data, often referred to as the last "unconquered castle" for neural…

Machine Learning · Computer Science 2026-02-27 Marius Dragoi , Florin Gogianu , Elena Burceanu

In the era of big data, the explosive growth of multi-source heterogeneous data offers many exciting challenges and opportunities for improving the inference of conditional average treatment effects. In this paper, we investigate…

Machine Learning · Statistics 2022-11-02 Xinyu Li , Yilin Li , Qing Cui , Longfei Li , Jun Zhou

Federated Learning is a novel paradigm that involves learning from data samples distributed across a large network of clients while the data remains local. It is, however, known that federated learning is prone to multiple system challenges…

Machine Learning · Computer Science 2021-01-01 Amirhossein Reisizadeh , Isidoros Tziotis , Hamed Hassani , Aryan Mokhtari , Ramtin Pedarsani

Federated Learning is a distributed machine learning approach that enables geographically distributed data silos to collaboratively learn a joint machine learning model without sharing data. Most of the existing work operates on…

Machine Learning · Computer Science 2023-05-17 Dimitris Stripelis , Jose Luis Ambite

Despite the outstanding performance of deep neural networks in different applications, they are still computationally extensive and require a great number of memories. This motivates more research on reducing the resources required for…

Machine Learning · Computer Science 2023-01-09 Alireza Bordbar , Mohammad Hossein Kahaei

While deep learning has achieved phenomenal successes in many AI applications, its enormous model size and intensive computation requirements pose a formidable challenge to the deployment in resource-limited nodes. There has recently been…

Machine Learning · Computer Science 2020-12-01 Sen Lin , Li Yang , Zhezhi He , Deliang Fan , Junshan Zhang

Join order selection plays a significant role in query performance. However, modern query optimizers typically employ static join enumeration algorithms that do not receive any feedback about the quality of the resulting plan. Hence,…

Databases · Computer Science 2018-09-28 Ryan Marcus , Olga Papaemmanouil

Training large-scale recommendation models under a single global objective implicitly assumes homogeneity across user populations. However, real-world data are composites of heterogeneous cohorts with distinct conditional distributions. As…

Integrating deep learning with latent state space models has the potential to yield temporal models that are powerful, yet tractable and interpretable. Unfortunately, current models are not designed to handle missing data or multiple data…

Machine Learning · Computer Science 2019-11-25 Tan Zhi-Xuan , Harold Soh , Desmond C. Ong
‹ Prev 1 2 3 10 Next ›