English
Related papers

Related papers: Benchmarking Blocking Algorithms for Web Entities

200 papers

Entity Resolution, also called record linkage or deduplication, refers to the process of identifying and merging duplicate versions of the same entity into a unified representation. The standard practice is to use a Rule based or Machine…

Artificial Intelligence · Computer Science 2016-09-22 Janani Balaji , Faizan Javed , Mayank Kejriwal , Chris Min , Sam Sander , Ozgur Ozturk

Entity Matching (EM) is crucial for identifying equivalent data entities across different sources, a task that becomes increasingly challenging with the growth and heterogeneity of data. Blocking techniques, which reduce the computational…

Machine Learning · Computer Science 2024-09-26 Mohammad Hossein Moslemi , Harini Balamurugan , Mostafa Milani

One of the most important tasks for improving data quality and the reliability of data analytics results is Entity Resolution (ER). ER aims to identify different descriptions that refer to the same real-world entity, and remains a…

Entity Resolution (ER) aims to identify different descriptions in various Knowledge Bases (KBs) that refer to the same entity. ER is challenged by the Variety, Volume and Veracity of entity descriptions published in the Web of Data. To…

Databases · Computer Science 2019-05-16 Vasilis Efthymiou , George Papadakis , Kostas Stefanidis , Vassilis Christophides

Entity resolution (ER) is the process of identifying records that refer to the same entities within one or across multiple databases. Numerous techniques have been developed to tackle ER challenges over the years, with recent emphasis…

Databases · Computer Science 2023-11-14 George Papadakis , Nishadi Kirielle , Peter Christen , Themis Palpanas

Entity resolution (ER), an important and common data cleaning problem, is about detecting data duplicate representations for the same external entities, and merging them into single representations. Relatively recently, declarative rules…

Databases · Computer Science 2017-01-19 Zeinab Bahmani , Leopoldo Bertossi , Nikolaos Vasiloglou

Entity Resolution suffers from quadratic time complexity. To increase its time efficiency, three kinds of filtering techniques are typically used for restricting its search space: (i) blocking workflows, which group together entity profiles…

Entity matching (EM) is the most critical step for entity resolution (ER). While current deep learningbased methods achieve very impressive performance on standard EM benchmarks, their realworld application performance is much frustrating.…

Computation and Language · Computer Science 2022-05-13 Tianshu Wang , Hongyu Lin , Cheng Fu , Xianpei Han , Le Sun , Feiyu Xiong , Hui Chen , Minlong Lu , Xiuwen Zhu

Efficiency techniques are an integral part of Entity Resolution, since its infancy. In this survey, we organized the bulk of works in the field into Blocking, Filtering and hybrid techniques, facilitating their understanding and use. We…

Databases · Computer Science 2020-08-24 George Papadakis , Dimitrios Skoutas , Emmanouil Thanos , Themis Palpanas

Entity resolution (ER), an important and common data cleaning problem, is about detecting data duplicate representations for the same external entities, and merging them into single representations. Relatively recently, declarative rules…

Databases · Computer Science 2016-02-09 Zeinab Bahmani , Leopoldo Bertossi , Nikolaos Vasiloglou

The same real-world entity (e.g., a movie, a restaurant, a person) may be described in various ways on different datasets. Entity Resolution (ER) aims to find such different descriptions of the same entity, this way improving data quality…

Databases · Computer Science 2025-03-18 Konstantinos Nikoletos , Vasilis Efthymiou , George Papadakis , Kostas Stefanidis

Entity Resolution (ER) is the task of finding entity profiles that correspond to the same real-world entity. Progressive ER aims to efficiently resolve large datasets when limited time and/or computational resources are available. In…

Databases · Computer Science 2019-05-17 Giovanni Simonini , George Papadakis , Themis Palpanas , Sonia Bergamaschi

Entity resolution (ER) is the task of identifying records belonging to the same entity (e.g. individual, group) across one or multiple databases. Ironically, it has multiple names: deduplication and record linkage, among others. In this…

Databases · Computer Science 2015-09-15 Matt Barnes

Many recent works on Entity Resolution (ER) leverage Deep Learning techniques involving language models to improve effectiveness. This is applied to both main steps of ER, i.e., blocking and matching. Several pre-trained embeddings have…

Databases · Computer Science 2023-04-26 Alexandros Zeakis , George Papadakis , Dimitrios Skoutas , Manolis Koubarakis

Entity resolution (ER) is the problem of identifying and merging records that refer to the same real-world entity. In many scenarios, raw records are stored under heterogeneous environment. Specifically, the schemas of records may differ…

Databases · Computer Science 2016-11-01 Yiming Lin , Hongzhi Wang , Jianzhong Li , Hong Gao

Entity Resolution (ER) is typically implemented as a batch task that processes all available data before identifying duplicate records. However, applications with time or computational constraints, e.g., those running in the cloud, require…

Databases · Computer Science 2025-03-12 Jakub Maciejewski , Konstantinos Nikoletos , George Papadakis , Yannis Velegrakis

Entity matching seeks to identify data records over one or multiple data sources that refer to the same real-world entity. Virtually every entity matching task on large datasets requires blocking, a step that reduces the number of record…

Databases · Computer Science 2019-12-10 Wei Zhang , Hao Wei , Bunyamin Sisman , Xin Luna Dong , Christos Faloutsos , David Page

Entity resolution (probabilistic record linkage, deduplication) is a key step in scientific analysis and data science pipelines involving multiple data sources. The objective of entity resolution is to link records without common unique…

Applications · Statistics 2026-01-22 Tymoteusz Strojny , Maciej Beręsewicz

Entity matching (EM) is a critical step in entity resolution (ER). Recently, entity matching based on large language models (LLMs) has shown great promise. However, current LLM-based entity matching approaches typically follow a binary…

Computation and Language · Computer Science 2024-12-13 Tianshu Wang , Xiaoyang Chen , Hongyu Lin , Xuanang Chen , Xianpei Han , Hao Wang , Zhenyu Zeng , Le Sun

Entity linking (EL) is the task of automatically identifying entity mentions in text and resolving them to a corresponding entity in a reference knowledge base like Wikipedia. Throughout the past decade, a plethora of EL systems and…

Computation and Language · Computer Science 2021-01-15 Renato Stoffalette João , Pavlos Fafalios , Stefan Dietze
‹ Prev 1 2 3 10 Next ›