English

Causality for Natural Language Processing

Computation and Language 2025-04-22 v1 Artificial Intelligence Computers and Society Machine Learning

Abstract

Causal reasoning is a cornerstone of human intelligence and a critical capability for artificial systems aiming to achieve advanced understanding and decision-making. This thesis delves into various dimensions of causal reasoning and understanding in large language models (LLMs). It encompasses a series of studies that explore the causal inference skills of LLMs, the mechanisms behind their performance, and the implications of causal and anticausal learning for natural language processing (NLP) tasks. Additionally, it investigates the application of causal reasoning in text-based computational social science, specifically focusing on political decision-making and the evaluation of scientific impact through citations. Through novel datasets, benchmark tasks, and methodological frameworks, this work identifies key challenges and opportunities to improve the causal capabilities of LLMs, providing a comprehensive foundation for future research in this evolving field.

Keywords

Cite

@article{arxiv.2504.14530,
  title  = {Causality for Natural Language Processing},
  author = {Zhijing Jin},
  journal= {arXiv preprint arXiv:2504.14530},
  year   = {2025}
}

Comments

PhD Thesis 2024