Related papers: Validating Wordscores

Political Text Scaling Meets Computational Semantics

During the last fifteen years, automatic text scaling has become one of the key tools of the Text as Data community in political science. Prominent text scaling algorithms, however, rely on the assumption that latent positions can be…

Computation and Language · Computer Science 2021-10-15 Federico Nanni , Goran Glavas , Ines Rehbein , Simone Paolo Ponzetto , Heiner Stuckenschmidt

Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset

The political biases of Large Language Models (LLMs) are usually assessed by simulating their answers to English surveys. In this work, we propose an alternative framing of political biases, relying on principles of fairness in multilingual…

Computation and Language · Computer Science 2026-03-12 Paul Lerner , François Yvon

"Read My Lips": Using Automatic Text Analysis to Classify Politicians by Party and Ideology

The increasing digitization of political speech has opened the door to studying a new dimension of political behavior using text analysis. This work investigates the value of word-level statistical data from the US Congressional…

General Economics · Economics 2018-09-05 Eitan Sapiro-Gheiler

Elite Polarization in European Parliamentary Speeches: a Novel Measurement Approach Using Large Language Models

Theories of democratic stability, populism, and party-system crisis often point to a form of polarization that comparative research rarely measures directly: hostile relations among political elites. Existing comparative measures capture…

Computation and Language · Computer Science 2026-05-12 Gennadii Iakovlev

Validating Political Position Predictions of Arguments

Real-world knowledge representation often requires capturing subjective, continuous attributes -- such as political positions -- that conflict with pairwise validation, the widely accepted gold standard for human evaluation. We address this…

Computation and Language · Computer Science 2026-02-23 Jordan Robinson , Angus R. Williams , Katie Atkinson , Anthony G. Cohn

Are Words Commensurate with Actions? Quantifying Commitment to a Cause from Online Public Messaging

Public entities such as companies and politicians increasingly use online social networks to communicate directly with their constituencies. Often, this public messaging is aimed at aligning the entity with a particular cause or issue, such…

Computation and Language · Computer Science 2020-10-07 Zhao Wang , Jennifer Cutler , Aron Culotta

Forecasting election results by studying brand importance in online news

This study uses the semantic brand score, a novel measure of brand importance in big textual data, to forecast elections based on online news. About 35,000 online news articles were transformed into networks of co-occurring words and…

Social and Information Networks · Computer Science 2021-05-13 A. Fronzetti Colladon

Learning to Substitute Words with Model-based Score Ranking

Smart word substitution aims to enhance sentence quality by improving word choices; however current benchmarks rely on human-labeled data. Since word choices are inherently subjective, ground-truth word substitutions generated by a small…

Computation and Language · Computer Science 2025-02-18 Hongye Liu , Ricardo Henao

Unsupervised Word Polysemy Quantification with Multiresolution Grids of Contextual Embeddings

The number of senses of a given word, or polysemy, is a very subjective notion, which varies widely across annotators and resources. We propose a novel method to estimate polysemy, based on simple geometry in the contextual embedding space.…

Computation and Language · Computer Science 2023-05-03 Christos Xypolopoulos , Antoine J. -P. Tixier , Michalis Vazirgiannis

Strategies for political-statement segmentation and labelling in unstructured text

Analysis of parliamentary speeches and political-party manifestos has become an integral area of computational study of political texts. While speeches have been overwhelmingly analysed using unsupervised methods, a large corpus of…

Computation and Language · Computer Science 2025-03-11 Dmitry Nikolaev , Sean Papay

Exploiting the Bipartite Structure of Entity Grids for Document Coherence and Retrieval

Document coherence describes how much sense text makes in terms of its logical organisation and discourse flow. Even though coherence is a relatively difficult notion to quantify precisely, it can be approximated automatically. This type of…

Information Retrieval · Computer Science 2016-08-03 Christina Lioma , Fabien Tarissan , Jakob Grue Simonsen , Casper Petersen , Birger Larsen

Evaluating topic coherence measures

Topic models extract representative word sets - called topics - from word counts in documents without requiring any semantic annotations. Topics are not guaranteed to be well interpretable, therefore, coherence measures have been proposed…

Machine Learning · Computer Science 2014-03-26 Frank Rosner , Alexander Hinneburg , Michael Röder , Martin Nettling , Andreas Both

Modeling Topical Coherence in Discourse without Supervision

Coherence of text is an important attribute to be measured for both manually and automatically generated discourse; but well-defined quantitative metrics for it are still elusive. In this paper, we present a metric for scoring topical…

Computation and Language · Computer Science 2018-09-05 Disha Shrivastava , Abhijit Mishra , Karthik Sankaranarayanan

An Unsupervised Semantic Sentence Ranking Scheme for Text Documents

This paper presents Semantic SentenceRank (SSR), an unsupervised scheme for automatically ranking sentences in a single document according to their relative importance. In particular, SSR extracts essential words and phrases from a text…

Information Retrieval · Computer Science 2020-05-06 Hao Zhang , Jie Wang

Words are not Equal: Graded Weighting Model for building Composite Document Vectors

Despite the success of distributional semantics, composing phrases from word vectors remains an important challenge. Several methods have been tried for benchmark tasks such as sentiment classification, including word vector averaging,…

Computation and Language · Computer Science 2015-12-14 Pranjal Singh , Amitabha Mukerjee

Phrase-Verified Voting: Verifiable Low-Tech Remote Boardroom Voting

We present Phrase-Verified Voting, a voter-verifiable remote voting system assembled from commercial off-the-shelf software for small private elections. The system is transparent and enables each voter to verify that the tally includes…

Cryptography and Security · Computer Science 2021-03-15 Enka Blanchard , Ryan Robucci , Ted Selker , Alan Sherman

Multilingual estimation of political-party positioning: From label aggregation to long-input Transformers

Scaling analysis is a technique in computational political science that assigns a political actor (e.g. politician or party) a score on a predefined scale based on a (typically long) body of text (e.g. a parliamentary speech or an election…

Computation and Language · Computer Science 2023-10-20 Dmitry Nikolaev , Tanise Ceron , Sebastian Padó

Political Footprints: Political Discourse Analysis using Pre-Trained Word Vectors

In this paper, we discuss how machine learning could be used to produce a systematic and more objective political discourse analysis. Political footprints are vector space models (VSMs) applied to political discourse. Each of their vectors…

Computation and Language · Computer Science 2017-05-19 Christophe Bruchansky

WordRank: Learning Word Embeddings via Robust Ranking

Embedding words in a vector space has gained a lot of attention in recent years. While state-of-the-art methods provide efficient computation of word similarities via a low-dimensional matrix embedding, their motivation is often left…

Computation and Language · Computer Science 2016-09-29 Shihao Ji , Hyokun Yun , Pinar Yanardag , Shin Matsushima , S. V. N. Vishwanathan

ROSCOE: A Suite of Metrics for Scoring Step-by-Step Reasoning

Large language models show improved downstream task performance when prompted to generate step-by-step reasoning to justify their final answers. These reasoning steps greatly improve model interpretability and verification, but objectively…

Computation and Language · Computer Science 2023-09-13 Olga Golovneva , Moya Chen , Spencer Poff , Martin Corredor , Luke Zettlemoyer , Maryam Fazel-Zarandi , Asli Celikyilmaz