English

Warehousing complex data from the Web

Databases 2017-01-03 v1

Abstract

The data warehousing and OLAP technologies are now moving onto handling complex data that mostly originate from the Web. However, intagrating such data into a decision-support process requires their representation under a form processable by OLAP and/or data mining techniques. We present in this paper a complex data warehousing methodology that exploits XML as a pivot language. Our approach includes the integration of complex data in an ODS, under the form of XML documents; their dimensional modeling and storage in an XML data warehouse; and their analysis with combined OLAP and data mining techniques. We also address the crucial issue of performance in XML warehouses.

Keywords

Cite

@article{arxiv.1701.00398,
  title  = {Warehousing complex data from the Web},
  author = {Omar Boussaid and Jerome Darmont and Fadila Bentayeb and Sabine Loudcher},
  journal= {arXiv preprint arXiv:1701.00398},
  year   = {2017}
}