Related papers: Entity Matching using Large Language Models

Fine-tuning Large Language Models for Entity Matching

Generative large language models (LLMs) are a promising alternative to pre-trained language models for entity matching due to their high zero-shot performance and ability to generalize to unseen entities. Existing research on using LLMs for…

Computation and Language · Computer Science 2025-05-22 Aaron Steiner , Ralph Peeters , Christian Bizer

Leveraging Large Language Models for Entity Matching

Entity matching (EM) is a critical task in data integration, aiming to identify records across different datasets that refer to the same real-world entities. Traditional methods often rely on manually engineered features and rule-based…

Computation and Language · Computer Science 2024-06-03 Qianyu Huang , Tongfang Zhao

Learning from Natural Language Explanations for Generalizable Entity Matching

Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching…

Computation and Language · Computer Science 2024-10-01 Somin Wadhwa , Adit Krishnan , Runhui Wang , Byron C. Wallace , Chris Kong

Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching

Entity matching (EM) is a critical step in entity resolution (ER). Recently, entity matching based on large language models (LLMs) has shown great promise. However, current LLM-based entity matching approaches typically follow a binary…

Computation and Language · Computer Science 2024-12-13 Tianshu Wang , Xiaoyang Chen , Hongyu Lin , Xuanang Chen , Xianpei Han , Hao Wang , Zhenyu Zeng , Le Sun

MapperGPT: Large Language Models for Linking and Mapping Entities

Aligning terminological resources, including ontologies, controlled vocabularies, taxonomies, and value sets is a critical part of data integration in many domains such as healthcare, chemistry, and biomedical research. Entity mapping is…

Computation and Language · Computer Science 2023-10-06 Nicolas Matentzoglu , J. Harry Caufield , Harshad B. Hegde , Justin T. Reese , Sierra Moxon , Hyeongsik Kim , Nomi L. Harris , Melissa A Haendel , Christopher J. Mungall

Structured Multi-Step Reasoning for Entity Matching Using Large Language Model

Entity matching is a fundamental task in data cleaning and data integration. With the rapid adoption of large language models (LLMs), recent studies have explored zero-shot and few-shot prompting to improve entity matching accuracy.…

Databases · Computer Science 2025-12-01 Rohan Bopardikar , Jin Wang , Jia Zou

On the Limitations of Large Language Models for Conceptual Database Modeling

This article analyzes the use of Large Language Models (LLMs) as support for the conceptual modeling of relational databases through the automatic generation of Entity-Relationship (ER) diagrams from natural language requirements. The…

Artificial Intelligence · Computer Science 2026-05-13 Arthur F. Siqueira , Carlos D. S. Nogueira , Eduarda Farias , Claudio E. C. Campelo , Júlia Menezes

APrompt4EM: Augmented Prompt Tuning for Generalized Entity Matching

Generalized Entity Matching (GEM), which aims at judging whether two records represented in different formats refer to the same real-world entity, is an essential task in data management. The prompt tuning paradigm for pre-trained language…

Computation and Language · Computer Science 2024-05-09 Yikuan Xia , Jiazun Chen , Xinchi Li , Jun Gao

BoostER: Leveraging Large Language Models for Enhancing Entity Resolution

Entity resolution, which involves identifying and merging records that refer to the same real-world entity, is a crucial task in areas like Web data integration. This importance is underscored by the presence of numerous duplicated and…

Databases · Computer Science 2024-03-12 Huahang Li , Shuangyin Li , Fei Hao , Chen Jason Zhang , Yuanfeng Song , Lei Chen

On Leveraging Large Language Models for Enhancing Entity Resolution: A Cost-efficient Approach

Entity resolution, the task of identifying and merging records that refer to the same real-world entity, is crucial in sectors like e-commerce, healthcare, and law enforcement. Large Language Models (LLMs) introduce an innovative approach…

Computation and Language · Computer Science 2024-09-13 Huahang Li , Longyu Feng , Shuangyin Li , Fei Hao , Chen Jason Zhang , Yuanfeng Song

Cost-Efficient Prompt Engineering for Unsupervised Entity Resolution

Entity Resolution (ER) is the problem of semi-automatically determining when two entities refer to the same underlying entity, with applications ranging from healthcare to e-commerce. Traditional ER solutions required considerable manual…

Artificial Intelligence · Computer Science 2024-04-09 Navapat Nananukul , Khanin Sisaengsuwanchai , Mayank Kejriwal

Unlocking the Power of Large Language Models for Multi-table Entity Matching

Multi-table entity matching (MEM) addresses the limitations of dual-table approaches by enabling simultaneous identification of equivalent entities across multiple data sources without unique identifiers. However, existing methods relying…

Computation and Language · Computer Science 2026-04-24 Yingkai Tang , Taoyu Su , Wenyuan Zhang , Xiaoyang Guo , Tingwen Liu

EntGPT: Entity Linking with Generative Large Language Models

Entity Linking in natural language processing seeks to match text entities to their corresponding entries in a dictionary or knowledge base. Traditional approaches rely on contextual models, which can be complex, hard to train, and have…

Computation and Language · Computer Science 2025-05-23 Yifan Ding , Amrit Poudel , Qingkai Zeng , Tim Weninger , Balaji Veeramani , Sanmitra Bhattacharya

AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuning

Entity Matching (EM) involves identifying different data representations referring to the same entity from multiple data sources and is typically formulated as a binary classification problem. It is a challenging problem in data integration…

Computation and Language · Computer Science 2023-05-31 John Bosco Mugeni , Steven Lynden , Toshiyuki Amagasa , Akiyoshi Matono

Using ChatGPT for Entity Matching

Entity Matching is the task of deciding if two entity descriptions refer to the same real-world entity. State-of-the-art entity matching methods often rely on fine-tuning Transformer models such as BERT or RoBERTa. Two major drawbacks of…

Computation and Language · Computer Science 2023-06-23 Ralph Peeters , Christian Bizer

Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration

Entity resolution (ER) is an important data integration task with a wide spectrum of applications. The state-of-the-art solutions on ER rely on pre-trained language models (PLMs), which require fine-tuning on a lot of labeled…

Computation and Language · Computer Science 2023-12-08 Meihao Fan , Xiaoyue Han , Ju Fan , Chengliang Chai , Nan Tang , Guoliang Li , Xiaoyong Du

Assessment of Generative Named Entity Recognition in the Era of Large Language Models

Named entity recognition (NER) is evolving from a sequence labeling task into a generative paradigm with the rise of large language models (LLMs). We conduct a systematic evaluation of open-source LLMs on both flat and nested NER tasks. We…

Computation and Language · Computer Science 2026-01-27 Qi Zhan , Yile Wang , Hui Huang

Liberal Entity Matching as a Compound AI Toolchain

Entity matching (EM), the task of identifying whether two descriptions refer to the same entity, is essential in data management. Traditional methods have evolved from rule-based to AI-driven approaches, yet current techniques using large…

Databases · Computer Science 2024-06-18 Silvery D. Fu , David Wang , Wen Zhang , Kathleen Ge

Do Language Models Learn about Legal Entity Types during Pretraining?

Language Models (LMs) have proven their ability to acquire diverse linguistic knowledge during the pretraining phase, potentially serving as a valuable source of incidental supervision for downstream tasks. However, there has been limited…

Computation and Language · Computer Science 2023-10-23 Claire Barale , Michael Rovatsos , Nehal Bhuta

EvEntS ReaLM: Event Reasoning of Entity States via Language Models

This paper investigates models of event implications. Specifically, how well models predict entity state-changes, by targeting their understanding of physical attributes. Nominally, Large Language models (LLM) have been exposed to…

Computation and Language · Computer Science 2022-11-11 Evangelia Spiliopoulou , Artidoro Pagnoni , Yonatan Bisk , Eduard Hovy