Related papers: What's in a Name?

What's in a Name? Evaluating Assembly-Part Semantic Knowledge in Language Models through User-Provided Names in CAD Files

Semantic knowledge of part-part and part-whole relationships in assemblies is useful for a variety of tasks from searching design repositories to the construction of engineering knowledge bases. In this work we propose that the natural…

Computation and Language · Computer Science 2023-04-28 Peter Meltzer , Joseph G. Lambourne , Daniele Grandi

Reproducing, Extending, and Analyzing Naming Experiments

Naming is very important in software development, as names are often the only vehicle of meaning about what the code is intended to do. A recent study on how developers choose names collected the names given by different developers for the…

Software Engineering · Computer Science 2024-02-16 Rachel Alpern , Ido Lazer , Issar Tzachor , Hanit Hakim , Sapir Weissbuch , Dror G. Feitelson

Name Searching and Information Retrieval

The main application of name searching has been name matching in a database of names. This paper discusses a different application: improving information retrieval through name recognition. It investigates name recognition accuracy, and the…

cmp-lg · Computer Science 2008-02-03 Paul Thompson , Christopher C. Dozier

Corpus structure, language models, and ad hoc information retrieval

Most previous work on the recently developed language-modeling approach to information retrieval focuses on document-specific characteristics, and therefore does not take into account the structure of the surrounding corpus. We propose a…

Information Retrieval · Computer Science 2007-05-23 Oren Kurland , Lillian Lee

Les noms propres se traduisent-ils ? \'Etude d'un corpus multilingue

In this paper, we tackle the problem of the translation of proper names. We introduce our hypothesis according to which proper names can be translated more often than most people seem to think. Then, we describe the construction of a…

Computation and Language · Computer Science 2014-07-08 Émeline Lecuit , Denis Maurel , Dusko Vitas

On the Strength of Character Language Models for Multilingual Named Entity Recognition

Character-level patterns have been widely used as features in English Named Entity Recognition (NER) systems. However, to date there has been no direct investigation of the inherent differences between name and non-name tokens in text, nor…

Computation and Language · Computer Science 2018-09-21 Xiaodong Yu , Stephen Mayhew , Mark Sammons , Dan Roth

When Are Names Similar Or the Same? Introducing the Code Names Matcher Library

Program code contains functions, variables, and data structures that are represented by names. To promote human understanding, these names should describe the role and use of the code elements they represent. But the names given by…

Software Engineering · Computer Science 2022-09-08 Moshe Munk , Dror G. Feitelson

What's in a name?

Among the several findings deriving from the application of complex network formalism to the investigation of natural phenomena, the fact that linguistic constructions follow power laws presents special interest for its potential…

Disordered Systems and Neural Networks · Physics 2009-11-10 Luciano da Fontoura Costa

From Isolates to Families: Using Neural Networks for Automated Language Affiliation

In historical linguistics, the affiliation of languages to a common language family is traditionally carried out using a complex workflow that relies on manually comparing individual languages. Large-scale standardized collections of…

Computation and Language · Computer Science 2025-12-09 Frederic Blum , Steffen Herbold , Johann-Mattis List

Uncovering Name-Based Biases in Large Language Models Through Simulated Trust Game

Gender and race inferred from an individual's name are a notable source of stereotypes and biases that subtly influence social interactions. Abundant evidence from human experiments has revealed the preferential treatment that one receives…

Computers and Society · Computer Science 2024-04-24 Yumou Wei , Paulo F. Carvalho , John Stamper

Scholar Name Disambiguation with Search-enhanced LLM Across Language

The task of scholar name disambiguation is crucial in various real-world scenarios, including bibliometric-based candidate evaluation for awards, application material anti-fraud measures, and more. Despite significant advancements, current…

Information Retrieval · Computer Science 2025-03-05 Renyu Zhao , Yunxin Chen

Multilingual person name recognition and transliteration

We present an exploratory tool that extracts person names from multilingual news collections, matches name variants referring to the same person, and infers relationships between people based on the co-occurrence of their names in related…

Computation and Language · Computer Science 2007-05-23 Bruno Pouliquen , Ralf Steinberger , Camelia Ignat , Irina Temnikova , Anna Widiger , Wajdi Zaghouani , Jan Zizka

Corpus Statistics Meet the Noun Compound: Some Empirical Results

A variety of statistical methods for noun compound analysis are implemented and compared. The results support two main conclusions. First, the use of conceptual association not only enables a broad coverage, but also improves the accuracy.…

cmp-lg · Computer Science 2008-02-03 Mark Lauer

Personal Names Popularity Estimation and its Application to Record Linkage

This study deals with a fairly simply formulated problem -- how to estimate the number of people bearing the same full name in a large population. Estimation of name popularity can leverage personal name matching in databases and be of…

Databases · Computer Science 2021-10-14 Ksenia Zhagorina , Pavel Braslavski , Vladimir Gusev

What's in a Name? Auditing Large Language Models for Race and Gender Bias

We employ an audit design to investigate biases in state-of-the-art large language models, including GPT-4. In our study, we prompt the models for advice involving a named individual across a variety of scenarios, such as during car…

Computation and Language · Computer Science 2025-01-27 Alejandro Salinas , Amit Haim , Julian Nyarko

Compositional Approaches for Representing Relations Between Words: A Comparative Study

Identifying the relations that exist between words (or entities) is important for various natural language processing tasks such as, relational search, noun-modifier classification and analogy detection. A popular approach to represent the…

Computation and Language · Computer Science 2017-09-06 Huda Hakami , Danushka Bollegala

Learning Alternative Name Spellings

Name matching is a key component of systems for entity resolution or record linkage. Alternative spellings of the same names are a com- mon occurrence in many applications. We use the largest collection of genealogy person records in the…

Information Retrieval · Computer Science 2014-05-09 Jeffrey Sukharev , Leonid Zhukov , Alexandrin Popescul

What is in a name? Mitigating Name Bias in Text Embeddings via Anonymization

Text-embedding models often exhibit biases arising from the data on which they are trained. In this paper, we examine a hitherto unexplored bias in text-embeddings: bias arising from the presence of $\textit{names}$ such as persons,…

Computation and Language · Computer Science 2025-02-06 Sahil Manchanda , Pannaga Shivaswamy

Building Language Models for Text with Named Entities

Text in many domains involves a significant amount of named entities. Predict- ing the entity names is often challenging for a language model as they appear less frequent on the training corpus. In this paper, we propose a novel and…

Computation and Language · Computer Science 2018-05-15 Md Rizwan Parvez , Saikat Chakraborty , Baishakhi Ray , Kai-Wei Chang

Exploring Language Similarities with Dimensionality Reduction Technique

In recent years several novel models were developed to process natural language, development of accurate language translation systems have helped us overcome geographical barriers and communicate ideas effectively. These models are…

Computation and Language · Computer Science 2019-02-19 Sangarshanan Veeraraghavan