Related papers: Instantiation

Modelling Semantic Categories using Conceptual Neighborhood

While many methods for learning vector space embeddings have been proposed in the field of Natural Language Processing, these methods typically do not distinguish between categories and individuals. Intuitively, if individuals are…

Computation and Language · Computer Science 2019-12-04 Zied Bouraoui , Jose Camacho-Collados , Luis Espinosa-Anke , Steven Schockaert

On Entity Identification in Language Models

We analyze the extent to which internal representations of language models (LMs) identify and distinguish mentions of named entities, focusing on the many-to-many correspondence between entities and their mentions. We first formulate two…

Computation and Language · Computer Science 2025-07-22 Masaki Sakata , Benjamin Heinzerling , Sho Yokoi , Takumi Ito , Kentaro Inui

Joint Embeddings of Hierarchical Categories and Entities

Due to the lack of structured knowledge applied in learning distributed representation of categories, existing work cannot incorporate category hierarchies into entity information.~We propose a framework that embeds entities and categories…

Computation and Language · Computer Science 2016-05-16 Yuezhang Li , Ronghuo Zheng , Tian Tian , Zhiting Hu , Rahul Iyer , Katia Sycara

Category Enhanced Word Embedding

Distributed word representations have been demonstrated to be effective in capturing semantic and syntactic regularities. Unsupervised representation learning from large unlabeled corpora can learn similar representations for those words…

Computation and Language · Computer Science 2015-12-01 Chunting Zhou , Chonglin Sun , Zhiyuan Liu , Francis C. M. Lau

Toward Socially-Infused Information Extraction: Embedding Authors, Mentions, and Entities

Entity linking is the task of identifying mentions of entities in text, and linking them to entries in a knowledge base. This task is especially difficult in microblogs, as there is little additional text to provide disambiguating context;…

Computation and Language · Computer Science 2016-09-27 Yi Yang , Ming-Wei Chang , Jacob Eisenstein

Open Named Entity Modeling from Embedding Distribution

In this paper, we report our discovery on named entity distribution in a general word embedding space, which helps an open definition on multilingual named entity definition rather than previous closed and constraint definition on named…

Computation and Language · Computer Science 2021-02-11 Ying Luo , Hai Zhao , Zhuosheng Zhang , Bingjie Tang

Distributed Entity Disambiguation with Per-Mention Learning

Entity disambiguation, or mapping a phrase to its canonical representation in a knowledge base, is a fundamental step in many natural language processing applications. Existing techniques based on global ranking models fail to capture the…

Computation and Language · Computer Science 2016-04-21 Tiep Mai , Bichen Shi , Patrick K. Nicholson , Deepak Ajwani , Alessandra Sala

Discovering Concepts in Learned Representations using Statistical Inference and Interactive Visualization

Concept discovery is one of the open problems in the interpretability literature that is important for bridging the gap between non-deep learning experts and model end-users. Among current formulations, concepts defines them by as a…

Machine Learning · Computer Science 2022-02-11 Adrianna Janik , Kris Sankaran

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

The sequential process of conceptualization and instantiation is essential to generalizable commonsense reasoning as it allows the application of existing knowledge to unfamiliar scenarios. However, existing works tend to undervalue the…

Computation and Language · Computer Science 2024-05-24 Weiqi Wang , Tianqing Fang , Chunyang Li , Haochen Shi , Wenxuan Ding , Baixuan Xu , Zhaowei Wang , Jiaxin Bai , Xin Liu , Jiayang Cheng , Chunkit Chan , Yangqiu Song

Definition Modeling: Learning to define word embeddings in natural language

Distributed representations of words have been shown to capture lexical semantics, as demonstrated by their effectiveness in word similarity and analogical relation tasks. But, these tasks only evaluate lexical semantics indirectly. In this…

Computation and Language · Computer Science 2016-12-02 Thanapon Noraset , Chen Liang , Larry Birnbaum , Doug Downey

Building a Fine-Grained Entity Typing System Overnight for a New X (X = Language, Domain, Genre)

Recent research has shown great progress on fine-grained entity typing. Most existing methods require pre-defining a set of types and training a multi-class classifier from a large labeled data set based on multi-level linguistic features.…

Computation and Language · Computer Science 2016-03-11 Lifu Huang , Jonathan May , Xiaoman Pan , Heng Ji

Corpus-level Fine-grained Entity Typing

This paper addresses the problem of corpus-level entity typing, i.e., inferring from a large corpus that an entity is a member of a class such as "food" or "artist". The application of entity typing we are interested in is knowledge base…

Computation and Language · Computer Science 2018-06-11 Yadollah Yaghoobzadeh , Heike Adel , Hinrich Schütze

CALE : Concept-Aligned Embeddings for Both Within-Lemma and Inter-Lemma Sense Differentiation

Lexical semantics is concerned with both the multiple senses a word can adopt in different contexts, and the semantic relations that exist between meanings of different words. To investigate them, Contextualized Language Models are a…

Computation and Language · Computer Science 2026-01-26 Bastien Liétard , Gabriel Loiseau

Propagating Knowledge Updates to LMs Through Distillation

Modern language models have the capacity to store and use immense amounts of knowledge about real-world entities, but it remains unclear how to update such knowledge stored in model parameters. While prior methods for updating knowledge in…

Computation and Language · Computer Science 2023-11-01 Shankar Padmanabhan , Yasumasa Onoe , Michael J. Q. Zhang , Greg Durrett , Eunsol Choi

Clinical Named Entity Recognition using Contextualized Token Representations

The clinical named entity recognition (CNER) task seeks to locate and classify clinical terminologies into predefined categories, such as diagnostic procedure, disease disorder, severity, medication, medication dosage, and sign symptom.…

Computation and Language · Computer Science 2021-06-25 Yichao Zhou , Chelsea Ju , J. Harry Caufield , Kevin Shih , Calvin Chen , Yizhou Sun , Kai-Wei Chang , Peipei Ping , Wei Wang

Interpretable Entity Representations through Large-Scale Typing

In standard methodology for natural language processing, entities in text are typically embedded in dense vector spaces with pre-trained models. The embeddings produced this way are effective when fed into downstream models, but they…

Computation and Language · Computer Science 2020-10-14 Yasumasa Onoe , Greg Durrett

Topics as Entity Clusters: Entity-based Topics from Large Language Models and Graph Neural Networks

Topic models aim to reveal latent structures within a corpus of text, typically through the use of term-frequency statistics over bag-of-words representations from documents. In recent years, conceptual entities -- interpretable,…

Computation and Language · Computer Science 2024-08-27 Manuel V. Loureiro , Steven Derby , Tri Kurniawan Wijaya

Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition

Named entity recognition (NER) is a well-studied task in natural language processing. Traditional NER research only deals with flat entities and ignores nested entities. The span-based methods treat entity recognition as a span…

Computation and Language · Computer Science 2021-07-14 Yongliang Shen , Xinyin Ma , Zeqi Tan , Shuai Zhang , Wen Wang , Weiming Lu

Modeling Named Entity Embedding Distribution into Hypersphere

This work models named entity distribution from a way of visualizing topological structure of embedding space, so that we make an assumption that most, if not all, named entities (NEs) for a language tend to aggregate together to be…

Computation and Language · Computer Science 2019-09-04 Zhuosheng Zhang , Bingjie Tang , Zuchao Li , Hai Zhao

Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning

Named entity recognition, and other information extraction tasks, frequently use linguistic features such as part of speech tags or chunkings. For languages where word boundaries are not readily identified in text, word segmentation is a…

Computation and Language · Computer Science 2017-03-30 Nanyun Peng , Mark Dredze