Popularity Driven Data Integration
Abstract
More and more, with the growing focus on large scale analytics, we are confronted with the need of integrating data from multiple sources. The problem is that these data are impossible to reuse as-is. The net result is high cost, with the further drawback that the resulting integrated data will again be hardly reusable as-is. iTelos is a general purpose methodology aiming at minimizing the effects of this process. The intuition is that data will be treated differently based on their popularity: the more a certain set of data have been reused, the more they will be reused and the less they will be changed across reuses, thus decreasing the overall data preprocessing costs, while increasing backward compatibility and future sharing
Keywords
Cite
@article{arxiv.2209.14049,
title = {Popularity Driven Data Integration},
author = {Fausto Giunchiglia and Simone Bocca and Mattia Fumagalli and Mayukh Bagchi and Alessio Zamboni},
journal= {arXiv preprint arXiv:2209.14049},
year = {2022}
}
Comments
KGSWC 2022. Fourth Ibero-American Knowledge Graph and Semantic Web Conference joint with Third Indo-American Knowledge Graph and Semantic Web Conference 21-23 November 2022, Universidad Camilo Jos\'e Cela, Madrid, Spain. arXiv admin note: substantial text overlap with arXiv:2105.09418