Related papers: Hidden Entity Detection from GitHub Leveraging Lar…

Leveraging Large Language Models for Entity Matching

Entity matching (EM) is a critical task in data integration, aiming to identify records across different datasets that refer to the same real-world entities. Traditional methods often rely on manually engineered features and rule-based…

Computation and Language · Computer Science 2024-06-03 Qianyu Huang , Tongfang Zhao

Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach

Web-scale visual entity recognition, the task of associating images with their corresponding entities within vast knowledge bases like Wikipedia, presents significant challenges due to the lack of clean, large-scale training data. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Mathilde Caron , Alireza Fathi , Cordelia Schmid , Ahmet Iscen

Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content

In today's visually dominated social media landscape, predicting the perceived credibility of visual content and understanding what drives human judgment are crucial for countering misinformation. However, these tasks are challenging due to…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Yilang Peng , Sijia Qian , Yingdan Lu , Cuihua Shen

Large Language Models Are Zero-Shot Text Classifiers

Retrained large language models (LLMs) have become extensively used across various sub-disciplines of natural language processing (NLP). In NLP, text classification problems have garnered considerable focus, but still faced with some…

Computation and Language · Computer Science 2023-12-05 Zhiqiang Wang , Yiran Pang , Yanbin Lin

Zshot: An Open-source Framework for Zero-Shot Named Entity Recognition and Relation Extraction

The Zero-Shot Learning (ZSL) task pertains to the identification of entities or relations in texts that were not seen during training. ZSL has emerged as a critical research area due to the scarcity of labeled data in specific domains, and…

Computation and Language · Computer Science 2023-07-26 Gabriele Picco , Marcos Martínez Galindo , Alberto Purpura , Leopold Fuchs , Vanessa López , Hoang Thanh Lam

Named Entity Recognition of Historical Texts via Large Language Model

Large language models (LLMs) have demonstrated remarkable versatility across a wide range of natural language processing tasks and domains. One such task is Named Entity Recognition (NER), which involves identifying and classifying proper…

Digital Libraries · Computer Science 2026-04-29 Shibingfeng Zhang , Giovanni Colavizza

llmNER: (Zero|Few)-Shot Named Entity Recognition, Exploiting the Power of Large Language Models

Large language models (LLMs) allow us to generate high-quality human-like text. One interesting task in natural language processing (NLP) is named entity recognition (NER), which seeks to detect mentions of relevant information in…

Computation and Language · Computer Science 2024-06-10 Fabián Villena , Luis Miranda , Claudio Aracena

Self-Improving for Zero-Shot Named Entity Recognition with Large Language Models

Exploring the application of powerful large language models (LLMs) on the named entity recognition (NER) task has drawn much attention recently. This work pushes the performance boundary of zero-shot NER with LLMs by proposing a…

Computation and Language · Computer Science 2024-03-22 Tingyu Xie , Qi Li , Yan Zhang , Zuozhu Liu , Hongwei Wang

Entity Matching using Large Language Models

Entity matching is the task of deciding whether two entity descriptions refer to the same real-world entity. Entity matching is a central step in most data integration pipelines. Many state-of-the-art entity matching methods rely on…

Computation and Language · Computer Science 2024-10-21 Ralph Peeters , Aaron Steiner , Christian Bizer

Unveiling Entity-Level Unlearning for Large Language Models: A Comprehensive Analysis

Large language model unlearning has garnered increasing attention due to its potential to address security and privacy concerns, leading to extensive research in the field. However, much of this research has concentrated on instance-level…

Computation and Language · Computer Science 2025-05-20 Weitao Ma , Xiaocheng Feng , Weihong Zhong , Lei Huang , Yangfan Ye , Xiachong Feng , Bing Qin

Leveraging Open-Source Large Language Models for Native Language Identification

Native Language Identification (NLI) - the task of identifying the native language (L1) of a person based on their writing in the second language (L2) - has applications in forensics, marketing, and second language acquisition.…

Computation and Language · Computer Science 2025-01-22 Yee Man Ng , Ilia Markov

Fine-tuning Large Language Models for Entity Matching

Generative large language models (LLMs) are a promising alternative to pre-trained language models for entity matching due to their high zero-shot performance and ability to generalize to unseen entities. Existing research on using LLMs for…

Computation and Language · Computer Science 2025-05-22 Aaron Steiner , Ralph Peeters , Christian Bizer

LLMs in Software Security: A Survey of Vulnerability Detection Techniques and Insights

Large Language Models (LLMs) are emerging as transformative tools for software vulnerability detection, addressing critical challenges in the security domain. Traditional methods, such as static and dynamic analysis, often falter due to…

Cryptography and Security · Computer Science 2025-02-19 Ze Sheng , Zhicheng Chen , Shuning Gu , Heqing Huang , Guofei Gu , Jeff Huang

ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models

Although Large Language Models (LLMs) exhibit remarkable adaptability across domains, these models often fall short in structured knowledge extraction tasks such as named entity recognition (NER). This paper explores an innovative,…

Computation and Language · Computer Science 2024-06-11 Yuzhao Heng , Chunyuan Deng , Yitong Li , Yue Yu , Yinghao Li , Rongzhi Zhang , Chao Zhang

Can Large Language Models Identify Authorship?

The ability to accurately identify authorship is crucial for verifying content authenticity and mitigating misinformation. Large Language Models (LLMs) have demonstrated an exceptional capacity for reasoning and problem-solving. However,…

Computation and Language · Computer Science 2024-10-23 Baixiang Huang , Canyu Chen , Kai Shu

Harnessing Large Language Models for Software Vulnerability Detection: A Comprehensive Benchmarking Study

Despite various approaches being employed to detect vulnerabilities, the number of reported vulnerabilities shows an upward trend over the years. This suggests the problems are not caught before the code is released, which could be caused…

Cryptography and Security · Computer Science 2025-02-14 Karl Tamberg , Hayretdin Bahsi

Entity Extraction from High-Level Corruption Schemes via Large Language Models

The rise of financial crime that has been observed in recent years has created an increasing concern around the topic and many people, organizations and governments are more and more frequently trying to combat it. Despite the increase of…

Computation and Language · Computer Science 2024-11-12 Panagiotis Koletsis , Panagiotis-Konstantinos Gemos , Christos Chronis , Iraklis Varlamis , Vasilis Efthymiou , Georgios Th. Papadopoulos

A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection

Detecting music entities such as song titles or artist names is a useful application to help use cases like processing music search queries or analyzing music consumption on the web. Recent approaches incorporate smaller language models…

Computation and Language · Computer Science 2024-12-17 Simon Hachmeier , Robert Jäschke

Large Language Models Struggle in Token-Level Clinical Named Entity Recognition

Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity,…

Computation and Language · Computer Science 2024-08-20 Qiuhao Lu , Rui Li , Andrew Wen , Jinlian Wang , Liwei Wang , Hongfang Liu

Multimodal Large Language Models for Phishing Webpage Detection and Identification

To address the challenging problem of detecting phishing webpages, researchers have developed numerous solutions, in particular those based on machine learning (ML) algorithms. Among these, brand-based phishing detection that uses models…

Cryptography and Security · Computer Science 2024-08-13 Jehyun Lee , Peiyuan Lim , Bryan Hooi , Dinil Mon Divakaran