English
Related papers

Related papers: Query-Driven Sampling for Collective Entity Resolu…

200 papers

Entity resolution (ER) is the task of identifying all records in a database that refer to the same underlying entity, and are therefore duplicates of each other. Due to inherent ambiguity of data representation and poor data quality, ER is…

Databases · Computer Science 2017-02-07 Arya Mazumdar , Barna Saha

Knowledge bases (KBs) store rich yet heterogeneous entities and facts. Entity resolution (ER) aims to identify entities in KBs which refer to the same real-world object. Recent studies have shown significant benefits of involving humans in…

Databases · Computer Science 2020-02-24 Jiacheng Huang , Wei Hu , Zhifeng Bao , Yuzhong Qu

Entity Resolution (ER) is typically implemented as a batch task that processes all available data before identifying duplicate records. However, applications with time or computational constraints, e.g., those running in the cloud, require…

Databases · Computer Science 2025-03-12 Jakub Maciejewski , Konstantinos Nikoletos , George Papadakis , Yannis Velegrakis

Entity Resolution (ER) is a critical data cleaning task for identifying records that refer to the same real-world entity. In the era of Big Data, traditional batch ER is often infeasible due to volume and velocity constraints, necessitating…

Databases · Computer Science 2026-01-05 Dimitrios Karapiperis , George Papadakis , Vassilios Verykios

Entity resolution is the problem of reconciling database references corresponding to the same real-world entities. Given the abundance of publicly available databases that have unresolved entities, we motivate the problem of query-time…

Databases · Computer Science 2011-11-02 I. Bhattacharya , L. Getoor

Crowdsourcing is becoming increasingly important in entity resolution tasks due to their inherent complexity such as clustering of images and natural language processing. Humans can provide more insightful information for these difficult…

Databases · Computer Science 2017-08-28 Vijaya Krishna Yalavarthi , Xiangyu Ke , Arijit Khan

One of the most important tasks for improving data quality and the reliability of data analytics results is Entity Resolution (ER). ER aims to identify different descriptions that refer to the same real-world entity, and remains a…

Entity resolution (ER) is the problem of identifying and linking database records that refer to the same real-world entity. Traditional ER methods use batch processing, which becomes impractical with growing data volumes due to high…

Databases · Computer Science 2025-10-09 Shujing Wang , Sibo Zhao , Shiqi Miao , Selasi Kwashie , Michael Bewong , Junwei Hu , Vincent M. Nofong , Zaiwen Feng

Entity Resolution (ER) is a fundamental data quality improvement task that identifies and links records referring to the same real-world entity. Traditional ER approaches often rely on pairwise comparisons, which can be costly in terms of…

Databases · Computer Science 2025-06-04 Jiajie Fu , Haitong Tang , Arijit Khan , Sharad Mehrotra , Xiangyu Ke , Yunjun Gao

Entity resolution (ER) is a key data integration problem. Despite the efforts in 70+ years in all aspects of ER, there is still a high demand for democratizing ER - humans are heavily involved in labeling data, performing feature…

Databases · Computer Science 2019-11-20 Muhammad Ebraheem , Saravanan Thirumuruganathan , Shafiq Joty , Mourad Ouzzani , Nan Tang

Entity resolution (ER) is the process of identifying records that refer to the same entities within one or across multiple databases. Numerous techniques have been developed to tackle ER challenges over the years, with recent emphasis…

Databases · Computer Science 2023-11-14 George Papadakis , Nishadi Kirielle , Peter Christen , Themis Palpanas

Entity resolution, the task of identifying and merging records that refer to the same real-world entity, is crucial in sectors like e-commerce, healthcare, and law enforcement. Large Language Models (LLMs) introduce an innovative approach…

Computation and Language · Computer Science 2024-09-13 Huahang Li , Longyu Feng , Shuangyin Li , Fei Hao , Chen Jason Zhang , Yuanfeng Song

Entity resolution (ER) is the task of identifying records belonging to the same entity (e.g. individual, group) across one or multiple databases. Ironically, it has multiple names: deduplication and record linkage, among others. In this…

Databases · Computer Science 2015-09-15 Matt Barnes

Entity resolution is central to data integration and data cleaning. Algorithmic approaches have been improving in quality, but remain far from perfect. Crowdsourcing platforms offer a more accurate but expensive (and slow) way to bring…

Databases · Computer Science 2012-08-10 Jiannan Wang , Tim Kraska , Michael J. Franklin , Jianhua Feng

Entity resolution (ER) is the task of identifying different representations of the same real-world entities across databases. It is a key step for knowledge base creation and text mining. Recent adaptation of deep learning methods for ER…

Databases · Computer Science 2019-06-20 Jungo Kasai , Kun Qian , Sairam Gurajada , Yunyao Li , Lucian Popa

Entity resolution (ER) is a fundamental task in data integration that enables insights from heterogeneous data sources. The primary challenge of ER lies in classifying record pairs as matches or nonmatches, which in multi-source ER (MS-ER)…

Databases · Computer Science 2026-04-10 Victor Christen , Peter Christen

Entity resolution (ER) is about identifying and merging records in a database that represent the same real-world entity. Matching dependencies (MDs) have been introduced and investigated as declarative rules that specify ER policies. An ER…

Databases · Computer Science 2017-02-28 Zeinab Bahmani , Leopoldo Bertossi

In recent years, crowdsourcing is increasingly applied as a means to enhance data quality. Although the crowd generates insightful information especially for complex problems such as entity resolution (ER), the output quality of crowd…

Databases · Computer Science 2015-12-03 Anja Gruenheid , Besmira Nushi , Tim Kraska , Wolfgang Gatterbauer , Donald Kossmann

Entity resolution (ER) refers to the problem of matching records in one or more relations that refer to the same real-world entity. While supervised machine learning (ML) approaches achieve the state-of-the-art results, they require a large…

Databases · Computer Science 2020-04-07 Renzhi Wu , Sanya Chaba , Saurabh Sawlani , Xu Chu , Saravanan Thirumuruganathan

Usually considered as a classification problem, entity resolution (ER) can be very challenging on real data due to the prevalence of dirty values. The state-of-the-art solutions for ER were built on a variety of learning models (most…

Databases · Computer Science 2019-06-17 Boyi Hou , Qun Chen , Yanyan Wang , Youcef Nafa , Zhanhuai Li
‹ Prev 1 2 3 10 Next ›