Related papers: Approximately Independent Features of Languages

Features of word similarity

In this theoretical note we compare different types of computational models of word similarity and association in their ability to predict a set of about 900 rating data. Using regression and predictive modeling tools (neural net, decision…

Computation and Language · Computer Science 2018-08-27 Arthur M. Jacobs , Annette Kinder

Automatic Analysis of Linguistic Features in Journal Articles of Different Academic Impacts with Feature Engineering Techniques

English research articles (RAs) are an essential genre in academia, so the attempts to employ NLP to assist the development of academic writing ability have received considerable attention in the last two decades. However, there has been no…

Computation and Language · Computer Science 2021-11-16 Siyu Lei , Ruiying Yang , Chu-Ren Huang

Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models

We introduce a novel analysis that leverages linguistic minimal pairs to probe the internal linguistic representations of Large Language Models (LLMs). By measuring the similarity between LLM activation differences across minimal pairs, we…

Computation and Language · Computer Science 2024-12-16 Xinyu Zhou , Delong Chen , Samuel Cahyawijaya , Xufeng Duan , Zhenguang G. Cai

Exploring Language-Independent Emotional Acoustic Features via Feature Selection

We propose a novel feature selection strategy to discover language-independent acoustic features that tend to be responsible for emotions regardless of languages, linguistics and other factors. Experimental results suggest that the…

Machine Learning · Computer Science 2010-09-02 Arslan Shaukat , Ke Chen

Linguistic Dependencies and Statistical Dependence

Are pairs of words that tend to occur together also likely to stand in a linguistic dependency? This empirical question is motivated by a long history of literature in cognitive science, psycholinguistics, and NLP. In this work we…

Computation and Language · Computer Science 2022-05-02 Jacob Louis Hoover , Alessandro Sordoni , Wenyu Du , Timothy J. O'Donnell

A Probabilistic Generative Model of Linguistic Typology

In the principles-and-parameters framework, the structural features of languages depend on parameters that may be toggled on or off, with a single parameter often dictating the status of multiple features. The implied covariance between…

Computation and Language · Computer Science 2019-05-16 Johannes Bjerva , Yova Kementchedjhieva , Ryan Cotterell , Isabelle Augenstein

A Study of Language and Classifier-independent Feature Analysis for Vocal Emotion Recognition

Every speech signal carries implicit information about the emotions, which can be extracted by speech processing methods. In this paper, we propose an algorithm for extracting features that are independent from the spoken language and the…

Audio and Speech Processing · Electrical Eng. & Systems 2018-11-26 Fatemeh Noroozi , Marina Marjanovic , Angelina Njegus , Sergio Escalera , Gholamreza Anbarjafari

A Measure for Transparent Comparison of Linguistic Diversity in Multilingual NLP Data Sets

Typologically diverse benchmarks are increasingly created to track the progress achieved in multilingual NLP. Linguistic diversity of these data sets is typically measured as the number of languages or language families included in the…

Computation and Language · Computer Science 2024-04-17 Tanja Samardzic , Ximena Gutierrez , Christian Bentz , Steven Moran , Olga Pelloni

Language Identification with a Reciprocal Rank Classifier

Language identification is a critical component of language processing pipelines (Jauhiainen et al.,2019) and is not a solved problem in real-world settings. We present a lightweight and effective language identifier that is robust to…

Computation and Language · Computer Science 2021-09-22 Dominic Widdows , Chris Brew

Language Recognition using Random Indexing

Random Indexing is a simple implementation of Random Projections with a wide range of applications. It can solve a variety of problems with good accuracy without introducing much complexity. Here we use it for identifying the language of…

Computation and Language · Computer Science 2015-03-02 Aditya Joshi , Johan Halseth , Pentti Kanerva

Deep Lexical Hypothesis: Identifying personality structure in natural language

Recent advances in natural language processing (NLP) have produced general models that can perform complex tasks such as summarizing long passages and translating across languages. Here, we introduce a method to extract adjective…

Computation and Language · Computer Science 2022-03-07 Andrew Cutler , David M. Condon

Machine individuality: Separating genuine idiosyncrasy from response bias in large language models

As large language models (LLMs) are increasingly integrated into daily life, in roles ranging from high-stakes decision support to companionship, understanding their behavioral dispositions becomes critical. A growing literature uses…

Artificial Intelligence · Computer Science 2026-04-22 Valentin Kriegmair , Dirk U. Wulff

On the Use of Linguistic Features for the Evaluation of Generative Dialogue Systems

Automatically evaluating text-based, non-task-oriented dialogue systems (i.e., `chatbots') remains an open problem. Previous approaches have suffered challenges ranging from poor correlation with human judgment to poor generalization and…

Computation and Language · Computer Science 2021-04-14 Ian Berlot-Attwell , Frank Rudzicz

Feature Specific Sentiment Analysis for Product Reviews

In this paper, we present a novel approach to identify feature specific expressions of opinion in product reviews with different features and mixed emotions. The objective is realized by identifying a set of potential features in the review…

Information Retrieval · Computer Science 2012-09-19 Subhabrata Mukherjee , Pushpak Bhattacharyya

Quantifying Dependence Between Random Vectors: A New Index with Applications

This article proposes a new index for quantifying the degree of dependence between random vectors. The index takes values in [0,1] and equals zero if and only if the random vectors are sub-independent. Unlike mere uncorrelatedness,…

Statistics Theory · Mathematics 2026-05-19 Chuancun yin

Feature-Refined Unsupervised Model for Loanword Detection

We propose an unsupervised method for detecting loanwords i.e., words borrowed from one language into another. While prior work has primarily relied on language-external information to identify loanwords, such approaches can introduce…

Computation and Language · Computer Science 2025-08-26 Promise Dodzi Kpoglu

A tentative model for dimensionless phoneme distance from binary distinctive features

This work proposes a tentative model for the calculation of dimensionless distances between phonemes; sounds are described with binary distinctive features and distances show linear consistency in terms of such features. The model can be…

Computation and Language · Computer Science 2016-11-03 Tiago Tresoldi

A Massively Multilingual Analysis of Cross-linguality in Shared Embedding Space

In cross-lingual language models, representations for many different languages live in the same space. Here, we investigate the linguistic and non-linguistic factors affecting sentence-level alignment in cross-lingual pretrained language…

Computation and Language · Computer Science 2021-09-15 Alex Jones , William Yang Wang , Kyle Mahowald

Universal and Independent: Multilingual Probing Framework for Exhaustive Model Interpretation and Evaluation

Linguistic analysis of language models is one of the ways to explain and describe their reasoning, weaknesses, and limitations. In the probing part of the model interpretability research, studies concern individual languages as well as…

Computation and Language · Computer Science 2022-10-25 Oleg Serikov , Vitaly Protasov , Ekaterina Voloshina , Viktoria Knyazkova , Tatiana Shavrina

Language-Independent Sentiment Analysis Using Subjectivity and Positional Information

We describe a novel language-independent approach to the task of determining the polarity, positive or negative, of the author's opinion on a specific topic in natural language text. In particular, weights are assigned to attributes,…

Computation and Language · Computer Science 2019-12-02 Veselin Raychev , Preslav Nakov