Related papers: Knowledge-Augmented Language Models for Cause-Effe…

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Determining the plausibility of causal relations between clauses is a commonsense reasoning task that requires complex inference ability. The general approach to this task is to train a large pretrained language model on a specific dataset.…

Computation and Language · Computer Science 2021-01-14 Ieva Staliūnaitė , Philip John Gorinski , Ignacio Iacobacci

Teaching Pretrained Models with Commonsense Reasoning: A Preliminary KB-Based Approach

Recently, pretrained language models (e.g., BERT) have achieved great success on many downstream natural language understanding tasks and exhibit a certain level of commonsense reasoning ability. However, their performance on commonsense…

Artificial Intelligence · Computer Science 2023-02-17 Shiyang Li , Jianshu Chen , Dian Yu

Evaluating Commonsense in Pre-trained Language Models

Contextualized representations trained over large raw text data have given remarkable improvements for NLP tasks including question answering and reading comprehension. There have been works showing that syntactic, semantic and word sense…

Computation and Language · Computer Science 2021-02-12 Xuhui Zhou , Yue Zhang , Leyang Cui , Dandan Huang

Automatic Knowledge Augmentation for Generative Commonsense Reasoning

Generative commonsense reasoning is the capability of a language model to generate a sentence with a given concept-set that is based on commonsense knowledge. However, generative language models still struggle to provide outputs, and the…

Computation and Language · Computer Science 2021-11-02 Jaehyung Seo , Chanjun Park , Sugyeong Eo , Hyeonseok Moon , Heuiseok Lim

A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation

Story generation, namely generating a reasonable story from a leading context, is an important but challenging task. In spite of the success in modeling fluency and local coherence, existing neural language generation models (e.g., GPT-2)…

Computation and Language · Computer Science 2020-01-16 Jian Guan , Fei Huang , Zhihao Zhao , Xiaoyan Zhu , Minlie Huang

Generated Knowledge Prompting for Commonsense Reasoning

It remains an open question whether incorporating external knowledge benefits commonsense reasoning while maintaining the flexibility of pretrained sequence models. To investigate this question, we develop generated knowledge prompting,…

Computation and Language · Computer Science 2022-09-30 Jiacheng Liu , Alisa Liu , Ximing Lu , Sean Welleck , Peter West , Ronan Le Bras , Yejin Choi , Hannaneh Hajishirzi

Go Beyond Plain Fine-tuning: Improving Pretrained Models for Social Commonsense

Pretrained language models have demonstrated outstanding performance in many NLP tasks recently. However, their social intelligence, which requires commonsense reasoning about the current situation and mental states of others, is still…

Computation and Language · Computer Science 2021-05-14 Ting-Yun Chang , Yang Liu , Karthik Gopalakrishnan , Behnam Hedayatnia , Pei Zhou , Dilek Hakkani-Tur

How Additional Knowledge can Improve Natural Language Commonsense Question Answering?

Recently several datasets have been proposed to encourage research in Question Answering domains where commonsense knowledge is expected to play an important role. Recent language models such as ROBERTA, BERT and GPT that have been…

Computation and Language · Computer Science 2020-04-20 Arindam Mitra , Pratyay Banerjee , Kuntal Kumar Pal , Swaroop Mishra , Chitta Baral

Commonsense Knowledge Reasoning and Generation with Pre-trained Language Models: A Survey

While commonsense knowledge acquisition and reasoning has traditionally been a core research topic in the knowledge representation and reasoning community, recent years have seen a surge of interest in the natural language processing…

Computation and Language · Computer Science 2022-02-01 Prajjwal Bhargava , Vincent Ng

Does Pre-training Induce Systematic Inference? How Masked Language Models Acquire Commonsense Knowledge

Transformer models pre-trained with a masked-language-modeling objective (e.g., BERT) encode commonsense knowledge as evidenced by behavioral probes; however, the extent to which this knowledge is acquired by systematic inference over the…

Computation and Language · Computer Science 2021-12-17 Ian Porada , Alessandro Sordoni , Jackie Chi Kit Cheung

CoCoLM: COmplex COmmonsense Enhanced Language Model with Discourse Relations

Large-scale pre-trained language models have demonstrated strong knowledge representation ability. However, recent studies suggest that even though these giant models contains rich simple commonsense knowledge (e.g., bird can fly and fish…

Computation and Language · Computer Science 2022-05-27 Changlong Yu , Hongming Zhang , Yangqiu Song , Wilfred Ng

Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded Vocabulary

Progress on commonsense reasoning is usually measured from performance improvements on Question Answering tasks designed to require commonsense knowledge. However, fine-tuning large Language Models (LMs) on these specific tasks does not…

Computation and Language · Computer Science 2022-10-13 Daniel Loureiro , Alípio Mário Jorge

Align, Mask and Select: A Simple Method for Incorporating Commonsense Knowledge into Language Representation Models

The state-of-the-art pre-trained language representation models, such as Bidirectional Encoder Representations from Transformers (BERT), rarely incorporate commonsense knowledge or other knowledge explicitly. We propose a pre-training…

Computation and Language · Computer Science 2020-05-07 Zhi-Xiu Ye , Qian Chen , Wen Wang , Zhen-Hua Ling

Commonsense Knowledge Mining from Pretrained Models

Inferring commonsense knowledge is a key challenge in natural language processing, but due to the sparsity of training data, previous work has shown that supervised methods for commonsense knowledge mining underperform when evaluated on…

Computation and Language · Computer Science 2019-09-15 Joshua Feldman , Joe Davison , Alexander M. Rush

KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning

Generative commonsense reasoning which aims to empower machines to generate sentences with the capacity of reasoning over a set of concepts is a critical bottleneck for text generation. Even the state-of-the-art pre-trained language…

Computation and Language · Computer Science 2021-01-22 Ye Liu , Yao Wan , Lifang He , Hao Peng , Philip S. Yu

Advancing Transformer's Capabilities in Commonsense Reasoning

Recent advances in general purpose pre-trained language models have shown great potential in commonsense reasoning. However, current works still perform poorly on standard commonsense reasoning benchmarks including the Com2Sense Dataset. We…

Computation and Language · Computer Science 2023-10-11 Yu Zhou , Yunqiu Han , Hanyu Zhou , Yulun Wu

Cause and Effect: Can Large Language Models Truly Understand Causality?

With the rise of Large Language Models(LLMs), it has become crucial to understand their capabilities and limitations in deciphering and explaining the complex web of causal relationships that language entails. Current methods use either…

Computation and Language · Computer Science 2024-10-01 Swagata Ashwani , Kshiteesh Hegde , Nishith Reddy Mannuru , Mayank Jindal , Dushyant Singh Sengar , Krishna Chaitanya Rao Kathala , Dishant Banga , Vinija Jain , Aman Chadha

When Choosing Plausible Alternatives, Clever Hans can be Clever

Pretrained language models, such as BERT and RoBERTa, have shown large improvements in the commonsense reasoning benchmark COPA. However, recent work found that many improvements in benchmarks of natural language understanding are not due…

Computation and Language · Computer Science 2019-11-04 Pride Kavumba , Naoya Inoue , Benjamin Heinzerling , Keshav Singh , Paul Reisert , Kentaro Inui

LINKED: Eliciting, Filtering and Integrating Knowledge in Large Language Model for Commonsense Reasoning

Large language models (LLMs) sometimes demonstrate poor performance on knowledge-intensive tasks, commonsense reasoning is one of them. Researchers typically address these issues by retrieving related knowledge from knowledge graphs or…

Computation and Language · Computer Science 2024-10-15 Jiachun Li , Pengfei Cao , Chenhao Wang , Zhuoran Jin , Yubo Chen , Kang Liu , Xiaojian Jiang , Jiexin Xu , Jun Zhao

Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation

Improving the emotional awareness of pre-trained language models is an emerging important problem for dialogue generation tasks. Although prior studies have introduced methods to improve empathetic dialogue generation, few have discussed…

Computation and Language · Computer Science 2023-02-06 Yiren Liu , Halil Kilicoglu