Related papers: Two Discourse Driven Language Models for Semantics

On Semantic Cognition, Inductive Generalization, and Language Models

My doctoral research focuses on understanding semantic knowledge in neural network models trained solely to predict natural language (referred to as language models, or LMs), by drawing on insights from the study of concepts and categories…

Computation and Language · Computer Science 2021-11-05 Kanishka Misra

Language Modeling with Sparse Product of Sememe Experts

Most language modeling methods rely on large-scale data to statistically learn the sequential patterns of words. In this paper, we argue that words are atomic language units but not necessarily atomic semantic units. Inspired by HowNet, we…

Computation and Language · Computer Science 2018-10-31 Yihong Gu , Jun Yan , Hao Zhu , Zhiyuan Liu , Ruobing Xie , Maosong Sun , Fen Lin , Leyu Lin

LMMS Reloaded: Transformer-based Sense Embeddings for Disambiguation and Beyond

Distributional semantics based on neural approaches is a cornerstone of Natural Language Processing, with surprising connections to human meaning representation as well. Recent Transformer-based Language Models have proven capable of…

Computation and Language · Computer Science 2022-04-04 Daniel Loureiro , Alípio Mário Jorge , Jose Camacho-Collados

Breaking Down Word Semantics from Pre-trained Language Models through Layer-wise Dimension Selection

Contextual word embeddings obtained from pre-trained language model (PLM) have proven effective for various natural language processing tasks at the word level. However, interpreting the hidden aspects within embeddings, such as syntax and…

Computation and Language · Computer Science 2023-10-10 Nayoung Choi

How Does Pretraining Improve Discourse-Aware Translation?

Pretrained language models (PLMs) have produced substantial improvements in discourse-aware neural machine translation (NMT), for example, improved coherence in spoken language translation. However, the underlying reasons for their strong…

Computation and Language · Computer Science 2023-06-01 Zhihong Huang , Longyue Wang , Siyou Liu , Derek F. Wong

Probing Natural Language Inference Models through Semantic Fragments

Do state-of-the-art models for language understanding already have, or can they easily learn, abilities such as boolean coordination, quantification, conditionals, comparatives, and monotonicity reasoning (i.e., reasoning about word…

Computation and Language · Computer Science 2019-12-03 Kyle Richardson , Hai Hu , Lawrence S. Moss , Ashish Sabharwal

Logical Parsing from Natural Language Based on a Neural Translation Model

Semantic parsing has emerged as a significant and powerful paradigm for natural language interface and question answering systems. Traditional methods of building a semantic parser rely on high-quality lexicons, hand-crafted grammars and…

Computation and Language · Computer Science 2017-05-10 Liang Li , Pengyu Li , Yifan Liu , Tao Wan , Zengchang Qin

Dual Information Speech Language Models for Emotional Conversations

Conversational systems relying on text-based large language models (LLMs) often overlook paralinguistic cues, essential for understanding emotions and intentions. Speech-language models (SLMs), which use speech as input, are emerging as a…

Computation and Language · Computer Science 2025-08-12 Chun Wang , Chenyang Liu , Wenze Xu , Weihong Deng

Semantic Mastery: Enhancing LLMs with Advanced Natural Language Understanding

Large language models (LLMs) have greatly improved their capability in performing NLP tasks. However, deeper semantic understanding, contextual coherence, and more subtle reasoning are still difficult to obtain. The paper discusses…

Computation and Language · Computer Science 2025-12-05 Mohanakrishnan Hariharan

Deriving Word Vectors from Contextualized Language Models using Topic-Aware Mention Selection

One of the long-standing challenges in lexical semantics consists in learning representations of words which reflect their semantic properties. The remarkable success of word embeddings for this purpose suggests that high-quality…

Computation and Language · Computer Science 2021-06-16 Yixiao Wang , Zied Bouraoui , Luis Espinosa Anke , Steven Schockaert

Learning Disentangled Representations for Natural Language Definitions

Disentangling the encodings of neural models is a fundamental aspect for improving interpretability, semantic control and downstream task performance in Natural Language Processing. Currently, most disentanglement methods are unsupervised…

Computation and Language · Computer Science 2023-02-17 Danilo S. Carvalho , Giangiacomo Mercatali , Yingji Zhang , Andre Freitas

Explaining Large Language Model-Based Neural Semantic Parsers (Student Abstract)

While large language models (LLMs) have demonstrated strong capability in structured prediction tasks such as semantic parsing, few amounts of research have explored the underlying mechanisms of their success. Our work studies different…

Computation and Language · Computer Science 2023-02-01 Daking Rai , Yilun Zhou , Bailin Wang , Ziyu Yao

Towards Computationally Verifiable Semantic Grounding for Language Models

The paper presents an approach to semantic grounding of language models (LMs) that conceptualizes the LM as a conditional model generating text given a desired semantic message formalized as a set of entity-relationship triples. It embeds…

Computation and Language · Computer Science 2022-11-17 Chris Alberti , Kuzman Ganchev , Michael Collins , Sebastian Gehrmann , Ciprian Chelba

SLM: Learning a Discourse Language Representation with Sentence Unshuffling

We introduce Sentence-level Language Modeling, a new pre-training objective for learning a discourse language representation in a fully self-supervised manner. Recent pre-training methods in NLP focus on learning either bottom or top-level…

Computation and Language · Computer Science 2020-11-02 Haejun Lee , Drew A. Hudson , Kangwook Lee , Christopher D. Manning

A Survey on Speech Large Language Models for Understanding

Speech understanding is essential for interpreting the diverse forms of information embedded in spoken language, including linguistic, paralinguistic, and non-linguistic cues that are vital for effective human-computer interaction. The…

Audio and Speech Processing · Electrical Eng. & Systems 2025-12-08 Jing Peng , Yucheng Wang , Bohan Li , Yiwei Guo , Hankun Wang , Yangui Fang , Yu Xi , Haoyu Li , Xu Li , Ke Zhang , Shuai Wang , Kai Yu

Semantic-based Pre-training for Dialogue Understanding

Pre-trained language models have made great progress on dialogue tasks. However, these models are typically trained on surface dialogue text, thus are proven to be weak in understanding the main semantic meaning of a dialogue context. We…

Computation and Language · Computer Science 2022-09-20 Xuefeng Bai , Linfeng Song , Yue Zhang

Structured Language Modeling for Speech Recognition

A new language model for speech recognition is presented. The model develops hidden hierarchical syntactic-like structure incrementally and uses it to extract meaningful information from the word history, thus complementing the locality of…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Frederick Jelinek

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Semantic matching is of central importance to many natural language tasks \cite{bordes2014semantic,RetrievalQA}. A successful matching algorithm needs to adequately model the internal structures of language objects and the interaction…

Computation and Language · Computer Science 2015-03-12 Baotian Hu , Zhengdong Lu , Hang Li , Qingcai Chen

Neural Discourse Relation Recognition with Semantic Memory

Humans comprehend the meanings and relations of discourses heavily relying on their semantic memory that encodes general knowledge about concepts and facts. Inspired by this, we propose a neural recognizer for implicit discourse relation…

Computation and Language · Computer Science 2017-12-15 Biao Zhang , Deyi Xiong , Jinsong Su

An Overview on Language Models: Recent Developments and Outlook

Language modeling studies the probability distributions over strings of texts. It is one of the most fundamental tasks in natural language processing (NLP). It has been widely used in text generation, speech recognition, machine…

Computation and Language · Computer Science 2024-07-18 Chengwei Wei , Yun-Cheng Wang , Bin Wang , C. -C. Jay Kuo