English
Related papers

Related papers: HyperBlocker: Accelerating Rule-based Blocking in …

200 papers

Blocking is a mechanism to improve the efficiency of Entity Resolution (ER) which aims to quickly prune out all non-matching record pairs. However, depending on the distributions of entity cluster sizes, existing techniques can be either…

Databases · Computer Science 2021-03-17 Sainyam Galhotra , Donatella Firmani , Barna Saha , Divesh Srivastava

Blocking is a critical step in entity resolution, and the emergence of neural network-based representation models has led to the development of dense blocking as a promising approach for exploring deep semantics in blocking. However,…

Databases · Computer Science 2024-04-26 Tianshu Wang , Hongyu Lin , Xianpei Han , Xiaoyang Chen , Boxi Cao , Le Sun

Entity resolution (ER) is a key data integration problem. Despite the efforts in 70+ years in all aspects of ER, there is still a high demand for democratizing ER - humans are heavily involved in labeling data, performing feature…

Databases · Computer Science 2019-11-20 Muhammad Ebraheem , Saravanan Thirumuruganathan , Shafiq Joty , Mourad Ouzzani , Nan Tang

The goal of entity resolution is to identify records in multiple datasets that represent the same real-world entity. However, comparing all records across datasets can be computationally intensive, leading to long runtimes. To reduce these…

Databases · Computer Science 2023-06-26 Alexander Brinkmann , Roee Shraga , Christian Bizer

This paper introduces an innovative approach to boost the efficiency and scalability of Evolutionary Rule-based machine Learning (ERL), a key technique in explainable AI. While traditional ERL systems can distribute processes across…

Neural and Evolutionary Computing · Computer Science 2025-05-27 Hormoz Shahrzad , Risto Miikkulainen

Hypergraph partitioning is a recurring NP-hard problem in engineering; its efficient solution at scale hinges on parallelism. This work proposes a GPU-centric algorithm for multi-level hypergraph partitioning aimed at a specific set of…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-21 Marco Ronzani , Cristina Silvano

Recent years have witnessed impressive progress in super-resolution (SR) processing. However, its real-time inference requirement sets a challenge not only for the model design but also for the on-chip implementation. In this paper, we…

Hardware Architecture · Computer Science 2023-04-04 Wenqian Zhao , Qi Sun , Yang Bai , Wenbo Li , Haisheng Zheng , Bei Yu , Martin D. F. Wong

It has been widely accepted that Graphics Processing Units (GPU) is one of promising schemes for encryption acceleration, in particular, the support of complex mathematical calculations such as integer and logical operations makes the…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-15 Canhui Wang , Xiaowen Chu

Entity Resolution, also called record linkage or deduplication, refers to the process of identifying and merging duplicate versions of the same entity into a unified representation. The standard practice is to use a Rule based or Machine…

Artificial Intelligence · Computer Science 2016-09-22 Janani Balaji , Faizan Javed , Mayank Kejriwal , Chris Min , Sam Sander , Ozgur Ozturk

Entity resolution (ER) is the problem of identifying and linking database records that refer to the same real-world entity. Traditional ER methods use batch processing, which becomes impractical with growing data volumes due to high…

Databases · Computer Science 2025-10-09 Shujing Wang , Sibo Zhao , Shiqi Miao , Selasi Kwashie , Michael Bewong , Junwei Hu , Vincent M. Nofong , Zaiwen Feng

Entity Resolution constitutes a core data integration task that relies on Blocking in order to tame its quadratic time complexity. Schema-agnostic blocking achieves very high recall, requires no domain knowledge and applies to data of any…

Databases · Computer Science 2022-04-20 Luca Gagliardelli , George Papadakis , Giovanni Simonini , Sonia Bergamaschi , Themis Palpanas

This paper presents Block, a distributed scheduling framework designed to optimize load balancing and auto-provisioning across instances in large language model serving frameworks by leveraging contextual information from incoming requests.…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-14 Wei Da , Evangelia Kalyvianaki

Expert Parallelism (EP) permits Mixture of Experts (MoE) models to scale beyond a single GPU. To address load imbalance across GPUs in EP, existing approaches aim to balance the number of tokens each GPU processes. Surprisingly, we find…

Hypergraph partitioning is a pervasive NP-hard problem, and accelerating its computation on GPU can both slice time-to-solution and raise quality of results. In this work, we implement a multi-level hypergraph partitioning algorithm on GPU…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-17 Marco Ronzani , Cristina Silvano

Efficient parallelization of algorithms on general-purpose GPUs is essential in many areas today. However, it is a non-trivial task for software engineers to utilize GPUs to improve the performance of high-level programs in general.…

Programming Languages · Computer Science 2024-07-09 Lars Hummelgren , John Wikman , Oscar Eriksson , Philipp Haller , David Broman

The number of cores on graphical computing units (GPUs) is reaching thousands nowadays, whereas the clock speed of processors stagnates. Unfortunately, constraint programming solvers do not take advantage yet of GPU parallelism. One reason…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-07-26 Pierre Talbot , Frédéric Pinel , Pascal Bouvry

The prediction of a dielectric breakdown in a high-voltage device is based on criteria that evaluate the electric field along field lines. Therefore it is necessary to efficiently compute the electric field at arbitrary points in space. A…

Numerical Analysis · Mathematics 2020-11-03 Cedric Münger , Steffen Börm , Jörg Ostrowski

Evolutionary algorithms (EAs) are increasingly implemented on graphics processing units (GPUs) to leverage parallel processing capabilities for enhanced efficiency. However, existing studies largely emphasize the raw speedup obtained by…

Neural and Evolutionary Computing · Computer Science 2026-01-28 Xinmeng Yu , Tao Jiang , Ran Cheng , Yaochu Jin , Kay Chen Tan

Entity Resolution (ER) is a critical data cleaning task for identifying records that refer to the same real-world entity. In the era of Big Data, traditional batch ER is often infeasible due to volume and velocity constraints, necessitating…

Databases · Computer Science 2026-01-05 Dimitrios Karapiperis , George Papadakis , Vassilios Verykios

In this work, we survey the role of GPUs in real-time systems. Originally designed for parallel graphics workloads, GPUs are now widely used in time-critical applications such as machine learning, autonomous vehicles, and robotics due to…

‹ Prev 1 2 3 10 Next ›