Related papers: Matchmaker: Self-Improving Large Language Model Pr…

ReMatch: Retrieval Enhanced Schema Matching with LLMs

Schema matching is a crucial task in data integration, involving the alignment of a source schema with a target schema to establish correspondence between their elements. This task is challenging due to textual and semantic heterogeneity,…

Databases · Computer Science 2024-05-31 Eitam Sheetrit , Menachem Brief , Moshik Mishaeli , Oren Elisha

Schema Matching using Machine Learning

Schema Matching is a method of finding attributes that are either similar to each other linguistically or represent the same information. In this project, we take a hybrid approach at solving this problem by making use of both the provided…

Databases · Computer Science 2020-04-22 Tanvi Sahay , Ankita Mehta , Shruti Jadon

Schemora: schema matching via multi-stage recommendation and metadata enrichment using off-the-shelf llms

Schema matching is essential for integrating heterogeneous data sources and enhancing dataset discovery, yet it remains a complex and resource-intensive problem. We introduce SCHEMORA, a schema matching framework that combines large…

Databases · Computer Science 2025-07-22 Osman Erman Gungor , Derak Paulsen , William Kang

Schema Matching with Large Language Models: an Experimental Study

Large Language Models (LLMs) have shown useful applications in a variety of tasks, including data wrangling. In this paper, we investigate the use of an off-the-shelf LLM for schema matching. Our objective is to identify semantic…

Databases · Computer Science 2024-07-17 Marcel Parciak , Brecht Vandevoort , Frank Neven , Liesbet M. Peeters , Stijn Vansummeren

Prompt-Matcher: Leveraging Large Models to Reduce Uncertainty in Schema Matching Results

Schema matching is the process of identifying correspondences between the elements of two given schemata, essential for database management systems, data integration, and data warehousing. For datasets across different scenarios, the…

Databases · Computer Science 2025-03-07 Longyu Feng , Huahang Li , Chen Jason Zhang

Rule-based Construction of Matching Processes

Mapping complex metadata structures is crucial in a number of domains such as data integration, ontology alignment or model management. To speed up that process automatic matching systems were developed to compute mapping suggestions that…

Databases · Computer Science 2011-08-10 Eric Peukert , Julian Eberius , Erhard Rahm

Magneto: Combining Small and Large Language Models for Schema Matching

Recent advances in language models opened new opportunities to address complex schema matching tasks. Schema matching approaches have been proposed that demonstrate the usefulness of language models, but they have also uncovered important…

Databases · Computer Science 2025-06-18 Yurong Liu , Eduardo Pena , Aecio Santos , Eden Wu , Juliana Freire

LLMATCH: A Unified Schema Matching Framework with Large Language Models

Schema matching is a foundational task in enterprise data integration, aiming to align disparate data sources. While traditional methods handle simple one-to-one table mappings, they often struggle with complex multi-table schema matching…

Databases · Computer Science 2025-07-16 Sha Wang , Yuchen Li , Hanhua Xiao , Bing Tian Dai , Roy Ka-Wei Lee , Yanfei Dong , Lambert Deng

The Role of Schema Matching in Large Enterprises

To date, the principal use case for schema matching research has been as a precursor for code generation, i.e., constructing mappings between schema elements with the end goal of data transfer. In this paper, we argue that schema matching…

Databases · Computer Science 2009-09-15 Ken Smith , Michael Morse , Peter Mork , Maya Li , Arnon Rosenthal , David Allen , Len Seligman , Chris Wolf

Matchmaker: An Open-source Library for Real-time Piano Score Following and Systematic Evaluation

Real-time music alignment, also known as score following, is a fundamental MIR task with a long history and is essential for many interactive applications. Despite its importance, there has not been a unified open framework for comparing…

Sound · Computer Science 2025-10-14 Jiyun Park , Carlos Cancino-Chacón , Suhit Chiruthapudi , Juhan Nam

Zero-Shot Clinical Trial Patient Matching with LLMs

Matching patients to clinical trials is a key unsolved challenge in bringing new drugs to market. Today, identifying patients who meet a trial's eligibility criteria is highly manual, taking up to 1 hour per patient. Automated screening is…

Computation and Language · Computer Science 2024-04-11 Michael Wornow , Alejandro Lozano , Dev Dash , Jenelle Jindal , Kenneth W. Mahaffey , Nigam H. Shah

Improving LLM-based Ontology Matching with fine-tuning on synthetic data

Large Language Models (LLMs) are increasingly being integrated into various components of Ontology Matching pipelines. This paper investigates the capability of LLMs to perform ontology matching directly on ontology modules and generate the…

Computation and Language · Computer Science 2025-12-01 Guilherme Sousa , Rinaldo Lima , Cassia Trojahn

Aligning Black-box Language Models with Human Judgments

Large language models (LLMs) are increasingly used as automated judges to evaluate recommendation systems, search engines, and other subjective tasks, where relying on human evaluators can be costly, time-consuming, and unscalable. LLMs…

Computation and Language · Computer Science 2025-02-10 Gerrit J. J. van den Burg , Gen Suzuki , Wei Liu , Murat Sensoy

LOUC: Leave-One-Out-Calibration Measure for Analyzing Human Matcher Performance

Schema matching is a core data integration task, focusing on identifying correspondences among attributes of multiple schemata. Numerous algorithmic approaches were suggested for schema matching over the years, aiming at solving the task…

Databases · Computer Science 2023-08-04 Matan Solomon , Bar Genossar , Roee Shraga , Avigdor Gal

GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security

Schema matching constitutes a pivotal phase in the data ingestion process for contemporary database systems. Its objective is to discern pairwise similarities between two sets of attributes, each associated with a distinct data table. This…

Databases · Computer Science 2024-06-05 Xuanqing Liu , Luyang Kong , Runhui Wang , Patrick Song , Austin Nevins , Henrik Johnson , Nimish Amlathe , Davor Golac

XML Matchers: approaches and challenges

Schema Matching, i.e. the process of discovering semantic correspondences between concepts adopted in different data source schemas, has been a key topic in Database and Artificial Intelligence research areas for many years. In the past, it…

Databases · Computer Science 2014-07-11 Santa Agreste , Pasquale De Meo , Emilio Ferrara , Domenico Ursino

Towards Scalable Schema Mapping using Large Language Models

The growing need to integrate information from a large number of diverse sources poses significant scalability challenges for data integration systems. These systems often rely on manually written schema mappings, which are complex,…

Databases · Computer Science 2025-06-02 Christopher Buss , Mahdis Safari , Arash Termehchy , Stefan Lee , David Maier

Structured Multi-Step Reasoning for Entity Matching Using Large Language Model

Entity matching is a fundamental task in data cleaning and data integration. With the rapid adoption of large language models (LLMs), recent studies have explored zero-shot and few-shot prompting to improve entity matching accuracy.…

Databases · Computer Science 2025-12-01 Rohan Bopardikar , Jin Wang , Jia Zou

MatchBench: An Evaluation of Feature Matchers

Feature matching is one of the most fundamental and active research areas in computer vision. A comprehensive evaluation of feature matchers is necessary, since it would advance both the development of this field and also high-level…

Computer Vision and Pattern Recognition · Computer Science 2018-08-08 JiaWang Bian , Ruihan Yang , Yun Liu , Le Zhang , Ming-Ming Cheng , Ian Reid , WenHai Wu

PoWareMatch: a Quality-aware Deep Learning Approach to Improve Human Schema Matching

Schema matching is a core task of any data integration process. Being investigated in the fields of databases, AI, Semantic Web and data mining for many years, the main challenge remains the ability to generate quality matches among data…

Databases · Computer Science 2021-09-16 Roee Shraga , Avigdor Gal