Related papers: Latent Relation Language Models

An Overview on Language Models: Recent Developments and Outlook

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo

Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities

The role of large language models (LLMs) in enterprise modeling has recently started to shift from academic research to that of industrial applications. Thereby, LLMs represent a further building block for the machine-supported generation…

Multiagent Systems · Computer Science 2025-01-08 Benedikt Reitemeyer , Hans-Georg Fill

Generalizing and Hybridizing Count-based and Neural Language Models

Language models (LMs) are statistical models that calculate probabilities over sequences of words or other discrete symbols. Currently two major paradigms for language modeling exist: count-based n-gram models, which have advantages of…

Computation and Language · Computer Science 2016-09-27 Graham Neubig , Chris Dyer

Tracking the perspectives of interacting language models

Large language models (LLMs) are capable of producing high quality information at unprecedented rates. As these models continue to entrench themselves in society, the content they produce will become increasingly pervasive in databases that…

Artificial Intelligence · Computer Science 2024-06-19 Hayden Helm , Brandon Duderstadt , Youngser Park , Carey E. Priebe

Document Context Language Models

Text documents are structured on multiple levels of detail: individual words are related by syntax, but larger units of text are related by discourse structure. Existing language models generally fail to account for discourse structure, but…

Computation and Language · Computer Science 2016-02-23 Yangfeng Ji , Trevor Cohn , Lingpeng Kong , Chris Dyer , Jacob Eisenstein

Language-Specific Latent Process Hinders Cross-Lingual Performance

Large language models (LLMs) are demonstrably capable of cross-lingual transfer, but can produce inconsistent output when prompted with the same queries written in different languages. To understand how language models are able to…

Computation and Language · Computer Science 2025-09-29 Zheng Wei Lim , Alham Fikri Aji , Trevor Cohn

A Neural Knowledge Language Model

Current language models have a significant limitation in the ability to encode and decode factual knowledge. This is mainly because they acquire such knowledge from statistical co-occurrences although most of the knowledge words are rarely…

Computation and Language · Computer Science 2017-03-03 Sungjin Ahn , Heeyoul Choi , Tanel Pärnamaa , Yoshua Bengio

How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure

Language models are typically evaluated on their success at predicting the distribution of specific words in specific contexts. Yet linguistic knowledge also encodes relationships between contexts, allowing inferences between word…

Computation and Language · Computer Science 2023-11-09 Michael Wilson , Jackson Petty , Robert Frank

KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs

Incorporating factual knowledge into pre-trained language models (PLM) such as BERT is an emerging trend in recent NLP studies. However, most of the existing methods combine the external knowledge integration module with a modified…

Computation and Language · Computer Science 2022-05-06 Yinquan Lu , Haonan Lu , Guirong Fu , Qun Liu

Latent Topic Models for Hypertext

Latent topic models have been successfully applied as an unsupervised topic discovery technique in large document collections. With the proliferation of hypertext document collection such as the Internet, there has also been great interest…

Information Retrieval · Computer Science 2012-06-18 Amit Gruber , Michal Rosen-Zvi , Yair Weiss

A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

This paper presents a novel latent variable recurrent neural network architecture for jointly modeling sequences of words and (possibly latent) discourse relations between adjacent sentences. A recurrent neural network generates individual…

Computation and Language · Computer Science 2016-04-06 Yangfeng Ji , Gholamreza Haffari , Jacob Eisenstein

Graph Language Models

While Language Models (LMs) are the workhorses of NLP, their interplay with structured knowledge graphs (KGs) is still actively researched. Current methods for encoding such graphs typically either (i) linearize them for embedding with LMs…

Computation and Language · Computer Science 2024-06-04 Moritz Plenz , Anette Frank

LaMPP: Language Models as Probabilistic Priors for Perception and Action

Language models trained on large text corpora encode rich distributional information about real-world environments and action sequences. This information plays a crucial role in current approaches to language processing tasks like question…

Machine Learning · Computer Science 2023-02-07 Belinda Z. Li , William Chen , Pratyusha Sharma , Jacob Andreas

On the Limitations of Large Language Models for Conceptual Database Modeling

This article analyzes the use of Large Language Models (LLMs) as support for the conceptual modeling of relational databases through the automatic generation of Entity-Relationship (ER) diagrams from natural language requirements. The…

Artificial Intelligence · Computer Science 2026-05-13 Arthur F. Siqueira , Carlos D. S. Nogueira , Eduarda Farias , Claudio E. C. Campelo , Júlia Menezes

Language Models as Agent Models

Language models (LMs) are trained on collections of documents, written by individual human agents to achieve specific goals in an outside world. During training, LMs have access only to text of these documents, with no direct evidence of…

Computation and Language · Computer Science 2022-12-06 Jacob Andreas

Improving Topic Models with Latent Feature Word Representations

Probabilistic topic models are widely used to discover latent topics in document collections, while latent feature vector representations of words have been used to obtain high performance in many NLP tasks. In this paper, we extend two…

Computation and Language · Computer Science 2018-10-16 Dat Quoc Nguyen , Richard Billingsley , Lan Du , Mark Johnson

A Survey on Neural Network Language Models

As the core component of Natural Language Processing (NLP) system, Language Model (LM) can provide word representation and probability indication of word sequences. Neural Network Language Models (NNLMs) overcome the curse of dimensionality…

Computation and Language · Computer Science 2019-06-14 Kun Jing , Jungang Xu

Let the Pretrained Language Models "Imagine" for Short Texts Topic Modeling

Topic models are one of the compelling methods for discovering latent semantics in a document collection. However, it assumes that a document has sufficient co-occurrence information to be effective. However, in short texts, co-occurrence…

Computation and Language · Computer Science 2023-10-25 Pritom Saha Akash , Jie Huang , Kevin Chen-Chuan Chang

Learning Informative Representations of Biomedical Relations with Latent Variable Models

Extracting biomedical relations from large corpora of scientific documents is a challenging natural language processing task. Existing approaches usually focus on identifying a relation either in a single sentence (mention-level) or across…

Computation and Language · Computer Science 2020-11-23 Harshil Shah , Julien Fauqueur

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning

In recent years, pre-trained large language models (LLMs) have demonstrated remarkable efficiency in achieving an inference-time few-shot learning capability known as in-context learning. However, existing literature has highlighted the…

Computation and Language · Computer Science 2024-02-14 Xinyi Wang , Wanrong Zhu , Michael Saxon , Mark Steyvers , William Yang Wang