Related papers: Characteristic Characteristics

Traits and tangles: An analysis of the Big Five paradigm by tangle-based clustering

Using the recently developed mathematical theory of tangles, we re-assess the mathematical foundations for applications of the five factor model in personality tests by a new, mathematically rigorous, quantitative method. Our findings…

Neurons and Cognition · Quantitative Biology 2024-12-02 Hanno von Bergen , Reinhard Diestel

New Approach to Clustering Random Attributes

This paper proposes a new method for similarity analysis and, consequently, a new algorithm for clustering different types of random attributes, both numerical and nominal. However, in order for nominal attributes to be clustered, their…

Machine Learning · Computer Science 2024-12-16 Zenon Gniazdowski

Information based clustering

In an age of increasingly large data sets, investigators in many different disciplines have turned to clustering as a tool for data analysis and exploration. Existing clustering methods, however, typically depend on several nontrivial…

Quantitative Methods · Quantitative Biology 2009-11-11 Noam Slonim , Gurinder Singh Atwal , Gasper Tkacik , William Bialek

Computing Word Classes Using Spectral Clustering

Clustering a lexicon of words is a well-studied problem in natural language processing (NLP). Word clusters are used to deal with sparse data in statistical language processing, as well as features for solving various NLP tasks (text…

Computation and Language · Computer Science 2018-08-17 Effi Levi , Saggy Herman , Ari Rappoport

Using clustering of rankings to explain brand preferences with personality and socio-demographic variables

The primary aim of market segmentation is to identify relevant groups of consumers that can be addressed efficiently by marketing or advertising campaigns. This paper addresses the issue whether consumer groups can be identified from…

Applications · Statistics 2017-04-05 Daniel Müllensiefen , Christian Hennig , Hedie Howells

Bayesian Clustering Factor Models

We present a novel framework for concomitant dimension reduction and clustering. This framework is based on a novel class of Bayesian clustering factor models. These models assume a factor model structure where the vectors of common factors…

Methodology · Statistics 2025-05-09 Hwasoo Shin , Marco A. R. Ferreira , Allison N. Tegge

How many clusters? An information theoretic perspective

Clustering provides a common means of identifying structure in complex data, and there is renewed interest in clustering as a tool for the analysis of large data sets in many fields. A natural question is how many clusters are appropriate…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Susanne Still , William Bialek

Machine learning methods fail to provide cohesive atheoretical construction of personality traits from semantic embeddings

The lexical hypothesis posits that personality traits are encoded in language and is foundational to models like the Big Five. We created a bottom-up personality model from a classic adjective list using machine learning and compared its…

Machine Learning · Computer Science 2025-10-14 Ayoub Bouguettaya , Elizabeth M. Stuart

The Five Factor Model of personality and evaluation of drug consumption risk

The problem of evaluating an individual's risk of drug consumption and misuse is highly important. An online survey methodology was employed to collect data including Big Five personality traits (NEO-FFI-R), impulsivity (BIS-11), sensation…

Applications · Statistics 2017-01-17 E. Fehrman , A. K. Muhammad , E. M. Mirkes , V. Egan , A. N. Gorban

Factor Adjusted Spectral Clustering for Mixture Models

This paper studies a factor modeling-based approach for clustering high-dimensional data generated from a mixture of strongly correlated variables. Statistical modeling with correlated structures pervades modern applications in economics,…

Statistics Theory · Mathematics 2024-08-23 Shange Tang , Soham Jana , Jianqing Fan

Cluster Analysis of Educational Data: an Example of Quantitative Study on the answers to an Open-Ended Questionnaire

In the last years many studies examined the consistency of students' answers in a variety of contexts. Some of these papers tried to develop more detailed models of the consistency of students' reasoning, or to subdivide a sample of…

Physics Education · Physics 2017-08-17 Onofrio Rosario Battaglia , Benedetto Di Paola , Claudio Fazio

Towards Spectroscopy: Susceptibility Clusters in Language Models

Spectroscopy infers the internal structure of physical systems by measuring their response to perturbations. We apply this principle to neural networks: perturbing the data distribution by upweighting a token $y$ in context $x$, we measure…

Machine Learning · Computer Science 2026-01-21 Andrew Gordon , Garrett Baker , George Wang , William Snell , Stan van Wingerden , Daniel Murfet

Identifying Privacy Personas

Privacy personas capture the differences in user segments with respect to one's knowledge, behavioural patterns, level of self-efficacy, and perception of the importance of privacy protection. Modelling these differences is essential for…

Machine Learning · Computer Science 2025-02-20 Olena Hrynenko , Andrea Cavallaro

Clustering Algorithms: A Comparative Approach

Many real-world systems can be studied in terms of pattern recognition tasks, so that proper use (and understanding) of machine learning methods in practical applications becomes essential. While a myriad of classification methods have been…

Machine Learning · Computer Science 2016-12-28 Mayra Z. Rodriguez , Cesar H. Comin , Dalcimar Casanova , Odemir M. Bruno , Diego R. Amancio , Francisco A. Rodrigues , Luciano da F. Costa

Element-centric clustering comparison unifies overlaps and hierarchy

Clustering is one of the most universal approaches for understanding complex data. A pivotal aspect of clustering analysis is quantitatively comparing clusterings; clustering comparison is the basis for many tasks such as clustering…

Machine Learning · Statistics 2019-06-13 Alexander J. Gates , Ian B. Wood , William P. Hetrick , Yong-Yeol Ahn

Clustered Factor Analysis for Multivariate Spatial Data

Factor analysis has been extensively used to reveal the dependence structures among multivariate variables, offering valuable insight in various fields. However, it cannot incorporate the spatial heterogeneity that is typically present in…

Methodology · Statistics 2024-11-14 Yanxiu Jin , Tomoya Wakayama , Renhe Jiang , Shonosuke Sugasawa

Clustering with Prototype Extraction for Census Data Analysis

Not long ago primary census data became available to publicity. It opened qualitatively new perspectives not only for researchers in demography and sociology, but also for those people, who somehow face processes occurring in society. In…

Databases · Computer Science 2011-06-28 Oleg Chertov , Marharyta Aleksandrova

Factor Analysis with Correlated Topic Model for Multi-Modal Data

Integrating various data modalities brings valuable insights into underlying phenomena. Multimodal factor analysis (FA) uncovers shared axes of variation underlying different simple data modalities, where each sample is represented by a…

Machine Learning · Computer Science 2025-04-29 Małgorzata Łazęcka , Ewa Szczurek

Peer groups for organisational learning: clustering with practical constraints

Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible…

Applications · Statistics 2021-07-14 Daniel William Kennedy , Jessica Cameron , Paul Pao-Yen Wu , Kerrie Mengersen

Spectral Feature Transformation for Person Re-identification

With the surge of deep learning techniques, the field of person re-identification has witnessed rapid progress in recent years. Deep learning based methods focus on learning a feature space where samples are clustered compactly according to…

Computer Vision and Pattern Recognition · Computer Science 2018-11-29 Chuanchen Luo , Yuntao Chen , Naiyan Wang , Zhaoxiang Zhang