English
Related papers

Related papers: Validating Wordscores

200 papers

During the last fifteen years, automatic text scaling has become one of the key tools of the Text as Data community in political science. Prominent text scaling algorithms, however, rely on the assumption that latent positions can be…

Computation and Language · Computer Science 2021-10-15 Federico Nanni , Goran Glavas , Ines Rehbein , Simone Paolo Ponzetto , Heiner Stuckenschmidt

The political biases of Large Language Models (LLMs) are usually assessed by simulating their answers to English surveys. In this work, we propose an alternative framing of political biases, relying on principles of fairness in multilingual…

Computation and Language · Computer Science 2026-03-12 Paul Lerner , François Yvon

The increasing digitization of political speech has opened the door to studying a new dimension of political behavior using text analysis. This work investigates the value of word-level statistical data from the US Congressional…

General Economics · Economics 2018-09-05 Eitan Sapiro-Gheiler

Theories of democratic stability, populism, and party-system crisis often point to a form of polarization that comparative research rarely measures directly: hostile relations among political elites. Existing comparative measures capture…

Computation and Language · Computer Science 2026-05-12 Gennadii Iakovlev

Real-world knowledge representation often requires capturing subjective, continuous attributes -- such as political positions -- that conflict with pairwise validation, the widely accepted gold standard for human evaluation. We address this…

Computation and Language · Computer Science 2026-02-23 Jordan Robinson , Angus R. Williams , Katie Atkinson , Anthony G. Cohn

Public entities such as companies and politicians increasingly use online social networks to communicate directly with their constituencies. Often, this public messaging is aimed at aligning the entity with a particular cause or issue, such…

Computation and Language · Computer Science 2020-10-07 Zhao Wang , Jennifer Cutler , Aron Culotta

This study uses the semantic brand score, a novel measure of brand importance in big textual data, to forecast elections based on online news. About 35,000 online news articles were transformed into networks of co-occurring words and…

Social and Information Networks · Computer Science 2021-05-13 A. Fronzetti Colladon

Smart word substitution aims to enhance sentence quality by improving word choices; however current benchmarks rely on human-labeled data. Since word choices are inherently subjective, ground-truth word substitutions generated by a small…

Computation and Language · Computer Science 2025-02-18 Hongye Liu , Ricardo Henao

The number of senses of a given word, or polysemy, is a very subjective notion, which varies widely across annotators and resources. We propose a novel method to estimate polysemy, based on simple geometry in the contextual embedding space.…

Computation and Language · Computer Science 2023-05-03 Christos Xypolopoulos , Antoine J. -P. Tixier , Michalis Vazirgiannis

Analysis of parliamentary speeches and political-party manifestos has become an integral area of computational study of political texts. While speeches have been overwhelmingly analysed using unsupervised methods, a large corpus of…

Computation and Language · Computer Science 2025-03-11 Dmitry Nikolaev , Sean Papay

Document coherence describes how much sense text makes in terms of its logical organisation and discourse flow. Even though coherence is a relatively difficult notion to quantify precisely, it can be approximated automatically. This type of…

Information Retrieval · Computer Science 2016-08-03 Christina Lioma , Fabien Tarissan , Jakob Grue Simonsen , Casper Petersen , Birger Larsen

Topic models extract representative word sets - called topics - from word counts in documents without requiring any semantic annotations. Topics are not guaranteed to be well interpretable, therefore, coherence measures have been proposed…

Machine Learning · Computer Science 2014-03-26 Frank Rosner , Alexander Hinneburg , Michael Röder , Martin Nettling , Andreas Both

Coherence of text is an important attribute to be measured for both manually and automatically generated discourse; but well-defined quantitative metrics for it are still elusive. In this paper, we present a metric for scoring topical…

Computation and Language · Computer Science 2018-09-05 Disha Shrivastava , Abhijit Mishra , Karthik Sankaranarayanan

This paper presents Semantic SentenceRank (SSR), an unsupervised scheme for automatically ranking sentences in a single document according to their relative importance. In particular, SSR extracts essential words and phrases from a text…

Information Retrieval · Computer Science 2020-05-06 Hao Zhang , Jie Wang

Despite the success of distributional semantics, composing phrases from word vectors remains an important challenge. Several methods have been tried for benchmark tasks such as sentiment classification, including word vector averaging,…

Computation and Language · Computer Science 2015-12-14 Pranjal Singh , Amitabha Mukerjee

We present Phrase-Verified Voting, a voter-verifiable remote voting system assembled from commercial off-the-shelf software for small private elections. The system is transparent and enables each voter to verify that the tally includes…

Cryptography and Security · Computer Science 2021-03-15 Enka Blanchard , Ryan Robucci , Ted Selker , Alan Sherman

Scaling analysis is a technique in computational political science that assigns a political actor (e.g. politician or party) a score on a predefined scale based on a (typically long) body of text (e.g. a parliamentary speech or an election…

Computation and Language · Computer Science 2023-10-20 Dmitry Nikolaev , Tanise Ceron , Sebastian Padó

In this paper, we discuss how machine learning could be used to produce a systematic and more objective political discourse analysis. Political footprints are vector space models (VSMs) applied to political discourse. Each of their vectors…

Computation and Language · Computer Science 2017-05-19 Christophe Bruchansky

Embedding words in a vector space has gained a lot of attention in recent years. While state-of-the-art methods provide efficient computation of word similarities via a low-dimensional matrix embedding, their motivation is often left…

Computation and Language · Computer Science 2016-09-29 Shihao Ji , Hyokun Yun , Pinar Yanardag , Shin Matsushima , S. V. N. Vishwanathan

Large language models show improved downstream task performance when prompted to generate step-by-step reasoning to justify their final answers. These reasoning steps greatly improve model interpretability and verification, but objectively…

Computation and Language · Computer Science 2023-09-13 Olga Golovneva , Moya Chen , Spencer Poff , Martin Corredor , Luke Zettlemoyer , Maryam Fazel-Zarandi , Asli Celikyilmaz
‹ Prev 1 2 3 10 Next ›