Related papers: ESBM: An Entity Summarization BenchMark

Entity Summarization: State of the Art and Future Challenges

The increasing availability of semantic data has substantially enhanced Web applications. Semantic data such as RDF data is commonly represented as entity-property-value triples. The magnitude of semantic data, in particular the large…

Information Retrieval · Computer Science 2021-05-12 Qingxia Liu , Gong Cheng , Kalpa Gunaratna , Yuzhong Qu

Wiki Entity Summarization Benchmark

Entity summarization aims to compute concise summaries for entities in knowledge graphs. Existing datasets and benchmarks are often limited to a few hundred entities and discard graph structure in source knowledge graphs. This limitation is…

Information Retrieval · Computer Science 2024-06-13 Saeedeh Javadi , Atefeh Moradan , Mohammad Sorkhpar , Klim Zaporojets , Davide Mottin , Ira Assent

ESA: Entity Summarization with Attention

Entity summarization aims at creating brief but informative descriptions of entities from knowledge graphs. While previous work mostly focused on traditional techniques such as clustering algorithms and graph models, we ask how to apply…

Computation and Language · Computer Science 2020-05-27 Dongjun Wei , Yaxin Liu , Fuqing Zhu , Liangjun Zang , Wei Zhou , Jizhong Han , Songlin Hu

MPSUM: Entity Summarization with Predicate-based Matching

With the development of Semantic Web, entity summarization has become an emerging task to generate concrete summaries for real world entities. To solve this problem, we propose an approach named MPSUM that extends a probabilistic topic…

Information Retrieval · Computer Science 2020-05-26 Dongjun Wei , Shiyuan Gao , Yaxin Liu , Zhibing Liu , Longtao Hang

SUMIE: A Synthetic Benchmark for Incremental Entity Summarization

No existing dataset adequately tests how well language models can incrementally update entity summaries - a crucial ability as these models rapidly advance. The Incremental Entity Summarization (IES) task is vital for maintaining accurate,…

Computation and Language · Computer Science 2024-06-10 Eunjeong Hwang , Yichao Zhou , Beliz Gunel , James Bradley Wendt , Sandeep Tata

EntSUM: A Data Set for Entity-Centric Summarization

Controllable summarization aims to provide summaries that take into account user-specified aspects and preferences to better assist them with their information need, as opposed to the standard summarization setup which build a single…

Computation and Language · Computer Science 2022-04-06 Mounica Maddela , Mayank Kulkarni , Daniel Preotiuc-Pietro

Bridging the Gap between Reality and Ideality of Entity Matching: A Revisiting and Benchmark Re-Construction

Entity matching (EM) is the most critical step for entity resolution (ER). While current deep learningbased methods achieve very impressive performance on standard EM benchmarks, their realworld application performance is much frustrating.…

Computation and Language · Computer Science 2022-05-13 Tianshu Wang , Hongyu Lin , Cheng Fu , Xianpei Han , Le Sun , Feiyu Xiong , Hui Chen , Minlong Lu , Xiuwen Zhu

UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches

Large language models (LLMs) have shown remarkable capabilities in generating user summaries from a long list of raw user activity data. These summaries capture essential user information such as preferences and interests, and therefore are…

Machine Learning · Computer Science 2024-09-09 Chao Wang , Neo Wu , Lin Ning , Jiaxing Wu , Luyang Liu , Jun Xie , Shawn O'Banion , Bradley Green

Machamp: A Generalized Entity Matching Benchmark

Entity Matching (EM) refers to the problem of determining whether two different data representations refer to the same real-world entity. It has been a long-standing interest of the data management community and many efforts have been paid…

Databases · Computer Science 2021-06-17 Jin Wang , Yuliang Li , Wataru Hirota

Benchmarking Blocking Algorithms for Web Entities

An increasing number of entities are described by interlinked data rather than documents on the Web. Entity Resolution (ER) aims to identify descriptions of the same real-world entity within one or across knowledge bases in the Web of data.…

Databases · Computer Science 2020-05-20 Vasilis Efthymiou , Kostas Stefanidis , Vassilis Christophides

A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching

Entity Matching (EM) is a core data cleaning task, aiming to identify different mentions of the same real-world entity. Active learning is one way to address the challenge of scarce labeled data in practice, by dynamically collecting the…

Databases · Computer Science 2020-03-31 Venkata Vamsikrishna Meduri , Lucian Popa , Prithviraj Sen , Mohamed Sarwat

Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching

Entity matching (EM) is a critical step in entity resolution (ER). Recently, entity matching based on large language models (LLMs) has shown great promise. However, current LLM-based entity matching approaches typically follow a binary…

Computation and Language · Computer Science 2024-12-13 Tianshu Wang , Xiaoyang Chen , Hongyu Lin , Xuanang Chen , Xianpei Han , Hao Wang , Zhenyu Zeng , Le Sun

EntEval: A Holistic Evaluation Benchmark for Entity Representations

Rich entity representations are useful for a wide class of problems involving entities. Despite their importance, there is no standardized benchmark that evaluates the overall quality of entity representations. In this work, we propose…

Computation and Language · Computer Science 2019-11-12 Mingda Chen , Zewei Chu , Yang Chen , Karl Stratos , Kevin Gimpel

Towards Multi-dimensional Evaluation of LLM Summarization across Domains and Languages

Evaluation frameworks for text summarization have evolved in terms of both domain coverage and metrics. However, existing benchmarks still lack domain-specific assessment criteria, remain predominantly English-centric, and face challenges…

Computation and Language · Computer Science 2025-06-03 Hyangsuk Min , Yuho Lee , Minjeong Ban , Jiaqi Deng , Nicole Hee-Yeon Kim , Taewon Yun , Hang Su , Jason Cai , Hwanjun Song

UMSE: Unified Multi-scenario Summarization Evaluation

Summarization quality evaluation is a non-trivial task in text summarization. Contemporary methods can be mainly categorized into two scenarios: (1) reference-based: evaluating with human-labeled reference summary; (2) reference-free:…

Computation and Language · Computer Science 2023-05-29 Shen Gao , Zhitao Yao , Chongyang Tao , Xiuying Chen , Pengjie Ren , Zhaochun Ren , Zhumin Chen

How to Evaluate Entity Resolution Systems: An Entity-Centric Framework with Application to Inventor Name Disambiguation

Entity resolution (record linkage, microclustering) systems are notoriously difficult to evaluate. Looking for a needle in a haystack, traditional evaluation methods use sophisticated, application-specific sampling schemes to find matching…

Computation and Language · Computer Science 2024-04-09 Olivier Binette , Youngsoo Baek , Siddharth Engineer , Christina Jones , Abel Dasylva , Jerome P. Reiter

MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching

Entity Matching (EM), which aims to identify all entity pairs referring to the same real-world entity from relational tables, is one of the most important tasks in real-world data management systems. Due to the labeling process of EM being…

Databases · Computer Science 2023-08-07 Xiaocan Zeng , Pengfei Wang , Yuren Mao , Lu Chen , Xiaoze Liu , Yunjun Gao

UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs

Existing benchmarks for summarization quality evaluation often lack diverse input scenarios, focus on narrowly defined dimensions (e.g., faithfulness), and struggle with subjective and coarse-grained annotation schemes. To address these…

Computation and Language · Computer Science 2024-10-02 Yuho Lee , Taewon Yun , Jason Cai , Hang Su , Hwanjun Song

Benchmarking Summarizability Processing in XML Warehouses with Complex Hierarchies

Business Intelligence plays an important role in decision making. Based on data warehouses and Online Analytical Processing, a business intelligence tool can be used to analyze complex data. Still, summarizability issues in data warehouses…

Databases · Computer Science 2013-09-02 Chantola Kit , Marouane Hachicha , Jérôme Darmont

Investigating Consistency in Query-Based Meeting Summarization: A Comparative Study of Different Embedding Methods

With more and more advanced data analysis techniques emerging, people will expect these techniques to be applied in more complex tasks and solve problems in our daily lives. Text Summarization is one of famous applications in Natural…

Computation and Language · Computer Science 2024-02-13 Chen Jia-Chen , Guillem Senabre , Allane Caron