Related papers: Constructing a Testbed for Psychometric Natural La…

Undesirable Biases in NLP: Addressing Challenges of Measurement

As Large Language Models and Natural Language Processing (NLP) technology rapidly develop and spread into daily life, it becomes crucial to anticipate how their use could harm people. One problem that has received a lot of attention in…

Computation and Language · Computer Science 2024-01-17 Oskar van der Wal , Dominik Bachmann , Alina Leidinger , Leendert van Maanen , Willem Zuidema , Katrin Schulz

Predicting Human Psychometric Properties Using Computational Language Models

Transformer-based language models (LMs) continue to achieve state-of-the-art performance on natural language processing (NLP) benchmarks, including tasks designed to mimic human-inspired "commonsense" competencies. To better understand the…

Computation and Language · Computer Science 2022-05-13 Antonio Laverghetta , Animesh Nighojkar , Jamshidbek Mirzakhalov , John Licato

Evaluating Large Language Models with Psychometrics

Large Language Models (LLMs) have demonstrated exceptional capabilities in solving various tasks, progressively evolving into general-purpose assistants. The increasing integration of LLMs into society has sparked interest in whether they…

Computation and Language · Computer Science 2025-10-20 Yuan Li , Yue Huang , Hongyi Wang , Ying Cheng , Xiangliang Zhang , James Zou , Lichao Sun

Assessment and manipulation of latent constructs in pre-trained language models using psychometric scales

Human-like personality traits have recently been discovered in large language models, raising the hypothesis that their (known and as yet undiscovered) biases conform with human latent psychological constructs. While large conversational…

Computation and Language · Computer Science 2025-01-14 Maor Reuben , Ortal Slobodin , Aviad Elyshar , Idan-Chaim Cohen , Orna Braun-Lewensohn , Odeya Cohen , Rami Puzis

Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models

Language models (LMs) are increasingly used to simulate human-like responses in scenarios where accurately mimicking a population's behavior can guide decision-making, such as in developing educational materials and designing public…

Computation and Language · Computer Science 2024-07-23 Joy He-Yueya , Wanjing Anya Ma , Kanishk Gandhi , Benjamin W. Domingue , Emma Brunskill , Noah D. Goodman

Natural Language Processing, Sentiment Analysis and Clinical Analytics

Recent advances in Big Data has prompted health care practitioners to utilize the data available on social media to discern sentiment and emotions expression. Health Informatics and Clinical Analytics depend heavily on information gathered…

Computation and Language · Computer Science 2019-02-05 Adil Rajput

Improving LLM Leaderboards with Psychometrical Methodology

The rapid development of large language models (LLMs) has necessitated the creation of benchmarks to evaluate their performance. These benchmarks resemble human tests and surveys, as they consist of sets of questions designed to measure…

Computation and Language · Computer Science 2025-01-30 Denis Federiakin

Do Psychometric Tests Work for Large Language Models? Evaluation of Tests on Sexism, Racism, and Morality

Psychometric tests are increasingly used to assess psychological constructs in large language models (LLMs). However, it remains unclear whether these tests -- originally developed for humans -- yield meaningful results when applied to…

Computation and Language · Computer Science 2026-01-28 Jana Jung , Marlene Lutz , Indira Sen , Markus Strohmaier

MindShift: Analyzing Language Models' Reactions to Psychological Prompts

Large language models (LLMs) hold the potential to absorb and reflect personality traits and attitudes specified by users. In our study, we investigated this potential using robust psychometric measures. We adapted the most studied test in…

Computation and Language · Computer Science 2025-12-19 Anton Vasiliuk , Irina Abdullaeva , Polina Druzhinina , Anton Razzhigaev , Andrey Kuznetsov

Can Transformer Language Models Predict Psychometric Properties?

Transformer-based language models (LMs) continue to advance state-of-the-art performance on NLP benchmark tasks, including tasks designed to mimic human-inspired "commonsense" competencies. To better understand the degree to which LMs can…

Computation and Language · Computer Science 2021-06-15 Antonio Laverghetta , Animesh Nighojkar , Jamshidbek Mirzakhalov , John Licato

BEAMetrics: A Benchmark for Language Generation Evaluation Evaluation

Natural language processing (NLP) systems are increasingly trained to generate open-ended text rather than classifying between responses. This makes research on evaluation metrics for generated language -- functions that score system output…

Computation and Language · Computer Science 2021-10-19 Thomas Scialom , Felix Hill

Natural Language Processing for Dialects of a Language: A Survey

State-of-the-art natural language processing (NLP) models are trained on massive training corpora, and report a superlative performance on evaluation datasets. This survey delves into an important attribute of these datasets: the dialect of…

Computation and Language · Computer Science 2024-12-10 Aditya Joshi , Raj Dabre , Diptesh Kanojia , Zhuang Li , Haolan Zhan , Gholamreza Haffari , Doris Dippold

On Measures of Biases and Harms in NLP

Recent studies show that Natural Language Processing (NLP) technologies propagate societal biases about demographic groups associated with attributes such as gender, race, and nationality. To create interventions and mitigate these biases…

Computation and Language · Computer Science 2022-10-17 Sunipa Dev , Emily Sheng , Jieyu Zhao , Aubrie Amstutz , Jiao Sun , Yu Hou , Mattie Sanseverino , Jiin Kim , Akihiro Nishi , Nanyun Peng , Kai-Wei Chang

Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation Practices

Automatic metrics are extensively used to evaluate natural language processing systems. However, there has been increasing focus on how they are used and reported by practitioners within the field. In this paper, we have conducted a survey…

Computation and Language · Computer Science 2024-08-20 Patrícia Schmidtová , Saad Mahamood , Simone Balloccu , Ondřej Dušek , Albert Gatt , Dimitra Gkatzia , David M. Howcroft , Ondřej Plátek , Adarsa Sivaprasad

From traces to measures: Large language models as a tool for psychological measurement from text

Large language models are increasingly being used to label or rate psychological features in text data. This approach helps address one of the limiting factors of digital trace data - their lack of an inherent target of measurement.…

Human-Computer Interaction · Computer Science 2024-10-15 Joseph J. P. Simons , Wong Liang Ze , Prasanta Bhattacharya , Brandon Siyuan Loh , Wei Gao

Predicting Performance for Natural Language Processing Tasks

Given the complexity of combinations of tasks, languages, and domains in natural language processing (NLP) research, it is computationally prohibitive to exhaustively test newly proposed models on each possible experimental setting. In this…

Computation and Language · Computer Science 2020-05-05 Mengzhou Xia , Antonios Anastasopoulos , Ruochen Xu , Yiming Yang , Graham Neubig

Automatic Generation of Behavioral Test Cases For Natural Language Processing Using Clustering and Prompting

Recent work in behavioral testing for natural language processing (NLP) models, such as Checklist, is inspired by related paradigms in software engineering testing. They allow evaluation of general linguistic capabilities and domain…

Computation and Language · Computer Science 2024-08-09 Ying Li , Rahul Singh , Tarun Joshi , Agus Sudjianto

Unveiling LLM Evaluation Focused on Metrics: Challenges and Solutions

Natural Language Processing (NLP) is witnessing a remarkable breakthrough driven by the success of Large Language Models (LLMs). LLMs have gained significant attention across academia and industry for their versatile applications in text…

Computation and Language · Computer Science 2024-04-16 Taojun Hu , Xiao-Hua Zhou

Modeling Subjectivity in Cognitive Appraisal with Language Models

As the utilization of language models in interdisciplinary, human-centered studies grow, expectations of their capabilities continue to evolve. Beyond excelling at conventional tasks, models are now expected to perform well on user-centric…

Computation and Language · Computer Science 2025-09-25 Yuxiang Zhou , Hainiu Xu , Desmond C. Ong , Maria Liakata , Petr Slovak , Yulan He

Reverse-Engineering the Reader

Numerous previous studies have sought to determine to what extent language models, pretrained on natural language text, can serve as useful models of human cognition. In this paper, we are interested in the opposite question: whether we can…

Computation and Language · Computer Science 2024-10-18 Samuel Kiegeland , Ethan Gotlieb Wilcox , Afra Amini , David Robert Reich , Ryan Cotterell