English
Related papers

Related papers: Entity Matching using Large Language Models

200 papers

Generative large language models (LLMs) are a promising alternative to pre-trained language models for entity matching due to their high zero-shot performance and ability to generalize to unseen entities. Existing research on using LLMs for…

Computation and Language · Computer Science 2025-05-22 Aaron Steiner , Ralph Peeters , Christian Bizer

Entity matching (EM) is a critical task in data integration, aiming to identify records across different datasets that refer to the same real-world entities. Traditional methods often rely on manually engineered features and rule-based…

Computation and Language · Computer Science 2024-06-03 Qianyu Huang , Tongfang Zhao

Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching…

Computation and Language · Computer Science 2024-10-01 Somin Wadhwa , Adit Krishnan , Runhui Wang , Byron C. Wallace , Chris Kong

Entity matching (EM) is a critical step in entity resolution (ER). Recently, entity matching based on large language models (LLMs) has shown great promise. However, current LLM-based entity matching approaches typically follow a binary…

Computation and Language · Computer Science 2024-12-13 Tianshu Wang , Xiaoyang Chen , Hongyu Lin , Xuanang Chen , Xianpei Han , Hao Wang , Zhenyu Zeng , Le Sun

Aligning terminological resources, including ontologies, controlled vocabularies, taxonomies, and value sets is a critical part of data integration in many domains such as healthcare, chemistry, and biomedical research. Entity mapping is…

Entity matching is a fundamental task in data cleaning and data integration. With the rapid adoption of large language models (LLMs), recent studies have explored zero-shot and few-shot prompting to improve entity matching accuracy.…

Databases · Computer Science 2025-12-01 Rohan Bopardikar , Jin Wang , Jia Zou

This article analyzes the use of Large Language Models (LLMs) as support for the conceptual modeling of relational databases through the automatic generation of Entity-Relationship (ER) diagrams from natural language requirements. The…

Artificial Intelligence · Computer Science 2026-05-13 Arthur F. Siqueira , Carlos D. S. Nogueira , Eduarda Farias , Claudio E. C. Campelo , Júlia Menezes

Generalized Entity Matching (GEM), which aims at judging whether two records represented in different formats refer to the same real-world entity, is an essential task in data management. The prompt tuning paradigm for pre-trained language…

Computation and Language · Computer Science 2024-05-09 Yikuan Xia , Jiazun Chen , Xinchi Li , Jun Gao

Entity resolution, which involves identifying and merging records that refer to the same real-world entity, is a crucial task in areas like Web data integration. This importance is underscored by the presence of numerous duplicated and…

Databases · Computer Science 2024-03-12 Huahang Li , Shuangyin Li , Fei Hao , Chen Jason Zhang , Yuanfeng Song , Lei Chen

Entity resolution, the task of identifying and merging records that refer to the same real-world entity, is crucial in sectors like e-commerce, healthcare, and law enforcement. Large Language Models (LLMs) introduce an innovative approach…

Computation and Language · Computer Science 2024-09-13 Huahang Li , Longyu Feng , Shuangyin Li , Fei Hao , Chen Jason Zhang , Yuanfeng Song

Entity Resolution (ER) is the problem of semi-automatically determining when two entities refer to the same underlying entity, with applications ranging from healthcare to e-commerce. Traditional ER solutions required considerable manual…

Artificial Intelligence · Computer Science 2024-04-09 Navapat Nananukul , Khanin Sisaengsuwanchai , Mayank Kejriwal

Multi-table entity matching (MEM) addresses the limitations of dual-table approaches by enabling simultaneous identification of equivalent entities across multiple data sources without unique identifiers. However, existing methods relying…

Computation and Language · Computer Science 2026-04-24 Yingkai Tang , Taoyu Su , Wenyuan Zhang , Xiaoyang Guo , Tingwen Liu

Entity Linking in natural language processing seeks to match text entities to their corresponding entries in a dictionary or knowledge base. Traditional approaches rely on contextual models, which can be complex, hard to train, and have…

Computation and Language · Computer Science 2025-05-23 Yifan Ding , Amrit Poudel , Qingkai Zeng , Tim Weninger , Balaji Veeramani , Sanmitra Bhattacharya

Entity Matching (EM) involves identifying different data representations referring to the same entity from multiple data sources and is typically formulated as a binary classification problem. It is a challenging problem in data integration…

Computation and Language · Computer Science 2023-05-31 John Bosco Mugeni , Steven Lynden , Toshiyuki Amagasa , Akiyoshi Matono

Entity Matching is the task of deciding if two entity descriptions refer to the same real-world entity. State-of-the-art entity matching methods often rely on fine-tuning Transformer models such as BERT or RoBERTa. Two major drawbacks of…

Computation and Language · Computer Science 2023-06-23 Ralph Peeters , Christian Bizer

Entity resolution (ER) is an important data integration task with a wide spectrum of applications. The state-of-the-art solutions on ER rely on pre-trained language models (PLMs), which require fine-tuning on a lot of labeled…

Computation and Language · Computer Science 2023-12-08 Meihao Fan , Xiaoyue Han , Ju Fan , Chengliang Chai , Nan Tang , Guoliang Li , Xiaoyong Du

Named entity recognition (NER) is evolving from a sequence labeling task into a generative paradigm with the rise of large language models (LLMs). We conduct a systematic evaluation of open-source LLMs on both flat and nested NER tasks. We…

Computation and Language · Computer Science 2026-01-27 Qi Zhan , Yile Wang , Hui Huang

Entity matching (EM), the task of identifying whether two descriptions refer to the same entity, is essential in data management. Traditional methods have evolved from rule-based to AI-driven approaches, yet current techniques using large…

Databases · Computer Science 2024-06-18 Silvery D. Fu , David Wang , Wen Zhang , Kathleen Ge

Language Models (LMs) have proven their ability to acquire diverse linguistic knowledge during the pretraining phase, potentially serving as a valuable source of incidental supervision for downstream tasks. However, there has been limited…

Computation and Language · Computer Science 2023-10-23 Claire Barale , Michael Rovatsos , Nehal Bhuta

This paper investigates models of event implications. Specifically, how well models predict entity state-changes, by targeting their understanding of physical attributes. Nominally, Large Language models (LLM) have been exposed to…

Computation and Language · Computer Science 2022-11-11 Evangelia Spiliopoulou , Artidoro Pagnoni , Yonatan Bisk , Eduard Hovy
‹ Prev 1 2 3 10 Next ›