Related papers: XML Entity Architecture for Efficient Software Int…

Scalable XSLT Evaluation

XSLT is an increasingly popular language for processing XML data. It is widely supported by application platform software. However, little optimization effort has been made inside the current XSLT processing engines. Evaluating a very…

Databases · Computer Science 2007-05-23 Zhimao Guo , Min Li , Xiaoling Wang , Aoying Zhou

Mapping XML Data to Relational Data: A DOM-Based Approach

XML has emerged as the standard for representing and exchanging data on the World Wide Web. It is critical to have efficient mechanisms to store and query XML data to exploit the full power of this new technology. Several researchers have…

Databases · Computer Science 2010-10-11 Mustafa Atay , Yezhou Sun , Dapeng Liu , Shiyong Lu , Farshad Fotouhi

Query Performance Optimization in XML Data Warehouses

XML data warehouses form an interesting basis for decision-support applications that exploit complex data. However, native-XML database management systems (DBMSs) currently bear limited performances and it is necessary to research for ways…

Databases · Computer Science 2017-01-30 Hadj Mahboubi , Jérôme Darmont

Enhancing XML Data Warehouse Query Performance by Fragmentation

XML data warehouses form an interesting basis for decision-support applications that exploit heterogeneous data from multiple sources. However, XML-native database systems currently suffer from limited performances in terms of manageable…

Databases · Computer Science 2009-08-28 Hadj Mahboubi , Jérôme Darmont

Un index de jointure pour les entrep\^ots de donn\'ees XML

XML data warehouses form an interesting basis for decision-support applications that exploit heterogeneous data from multiple sources. However, XML-native database systems currently bear limited performances and it is necessary to research…

Databases · Computer Science 2007-07-10 Hadj Mahboubi , Kamel Aouiche , Jérôme Darmont

Ensuring Query Compatibility with Evolving XML Schemas

During the life cycle of an XML application, both schemas and queries may change from one version to another. Schema evolutions may affect query results and potentially the validity of produced data. Nowadays, a challenge is to assess and…

Programming Languages · Computer Science 2008-11-27 Pierre Genevès , Nabil Layaïda , Vincent Quint

XML Schema-based Minification for Communication of Security Information and Event Management (SIEM) Systems in Cloud Environments

XML-based communication governs most of today's systems communication, due to its capability of representing complex structural and hierarchical data. However, XML document structure is considered a huge and bulky data that can be reduced…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-10-10 Bishoy Moussa , Mahmoud Mostafa , Mahmoud El-Khouly

A general XML-based distributed software architecture for accessing and sharing ressources

This paper presents a general xml-based distributed software architecture in the aim of accessing and sharing resources in an opened client/server environment. The paper is organized as follows : First, we introduce the idea of a "General…

Software Engineering · Computer Science 2009-09-14 Samuel Cruz-Lara , Patrice Bonhomme , Christophe De Saint-Rat , Laurent Romary

XML Multidimensional Modelling and Querying

As XML becomes ubiquitous and XML storage and processing becomes more efficient, the range of use cases for these technologies widens daily. One promising area is the integration of XML and data warehouses, where an XML-native database…

Databases · Computer Science 2009-12-08 Serge Boucher , Boris Verhaegen , Esteban Zimányi

Structured Multi-Step Reasoning for Entity Matching Using Large Language Model

Entity matching is a fundamental task in data cleaning and data integration. With the rapid adoption of large language models (LLMs), recent studies have explored zero-shot and few-shot prompting to improve entity matching accuracy.…

Databases · Computer Science 2025-12-01 Rohan Bopardikar , Jin Wang , Jia Zou

A distributed editing environment for XML documents

XML is based on two essential aspects: the modelization of data in a tree like structure and the separation between the information itself and the way it is displayed. XML structures are easily serializable. The separation between an…

Software Engineering · Computer Science 2009-02-19 Claude Pasquier , Laurent Théry

Towards Scalable Schema Mapping using Large Language Models

The growing need to integrate information from a large number of diverse sources poses significant scalability challenges for data integration systems. These systems often rely on manually written schema mappings, which are complex,…

Databases · Computer Science 2025-06-02 Christopher Buss , Mahdis Safari , Arash Termehchy , Stefan Lee , David Maier

A Join Index for XML Data Warehouses

XML data warehouses form an interesting basis for decision-support applications that exploit complex data. However, native-XML database management systems (DBMSs) currently bear limited performances and it is necessary to research for ways…

Databases · Computer Science 2008-09-12 Hadj Mahboubi , Kamel Aouiche , Jérôme Darmont

Affinity-based XML Fragmentation

In this paper we tackle the fragmentation problem for highly distributed databases. In such an environment, a suitable fragmentation strategy may provide scalability and availability by minimizing distributed transactions. We propose an…

Databases · Computer Science 2013-04-25 Rebeca Schroeder , Ronaldo Santos Mello , Carmem Satie Hara

An Ensemble Embedding Approach for Improving Semantic Caching Performance in LLM-based Systems

Semantic caching enhances the efficiency of large language model (LLM) systems by identifying semantically similar queries, storing responses once, and serving them for subsequent equivalent requests. However, existing semantic caching…

Machine Learning · Computer Science 2025-07-10 Shervin Ghaffari , Zohre Bahranifard , Mohammad Akbari

Fine-tuning Large Language Models for Entity Matching

Generative large language models (LLMs) are a promising alternative to pre-trained language models for entity matching due to their high zero-shot performance and ability to generalize to unseen entities. Existing research on using LLMs for…

Computation and Language · Computer Science 2025-05-22 Aaron Steiner , Ralph Peeters , Christian Bizer

Modeling and visualizing networked multi-core embedded software energy consumption

In this report we present a network-level multi-core energy model and a software development process workflow that allows software developers to estimate the energy consumption of multi-core embedded programs. This work focuses on a high…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-09-10 Steve Kerrison , Kerstin Eder

Data Partitioning for Parallel Entity Matching

Entity matching is an important and difficult step for integrating web data. To reduce the typically high execution time for matching we investigate how we can perform entity matching in parallel on a distributed infrastructure. We propose…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-06-29 Toralf Kirsten , Lars Kolb , Michael Hartung , Anika Groß , Hanna Köpcke , Erhard Rahm

Unlocking the Power of Large Language Models for Multi-table Entity Matching

Multi-table entity matching (MEM) addresses the limitations of dual-table approaches by enabling simultaneous identification of equivalent entities across multiple data sources without unique identifiers. However, existing methods relying…

Computation and Language · Computer Science 2026-04-24 Yingkai Tang , Taoyu Su , Wenyuan Zhang , Xiaoyang Guo , Tingwen Liu

Optimizing ETL Dataflow Using Shared Caching and Parallelization Methods

Extract-Transform-Load (ETL) handles large amount of data and manages workload through dataflows. ETL dataflows are widely regarded as complex and expensive operations in terms of time and system resources. In order to minimize the time and…

Databases · Computer Science 2014-09-08 Xiufeng Liu