English
Related papers

Related papers: Word statistics in Blogs and RSS feeds: Towards em…

200 papers

Collective human behaviors are analyzed using the time series of word appearances in blogs. As expected, we confirm that the number of fluctuations is approximated by a Poisson distribution for very-low-frequency words. A non-trivial…

Physics and Society · Physics 2009-12-11 Y. Sano , K. Kaski , M. Takayasu

To uncover underlying mechanism of collective human dynamics, we survey more than 1.8 billion blog entries and observe the statistical properties of word appearances. We focus on words that show dynamic growth and decay with a tendency to…

Physics and Society · Physics 2015-03-19 Yukie Sano , Kenta Yamada , Hayafumi Watanabe , Hideki Takayasu , Misako Takayasu

We observe the statistical properties of blogs that are expected to reflect social human interaction. Firstly, we introduce a basic normalization preprocess that enables us to evaluate the genuine word frequency in blogs that are…

Physics and Society · Physics 2010-04-09 Yukie Sano , Misako Takayasu

To elucidate the non-trivial empirical statistical properties of fluctuations of a typical non-steady time series representing the appearance of words in blogs, we investigated approximately five billion Japanese blogs over a period of six…

Physics and Society · Physics 2016-12-05 Hayafumi Watanabe , Yukie Sano , Hideki Takayasu , Misako Takayasu

Background: Zipf's discovery that word frequency distributions obey a power law established parallels between biological and physical processes, and language, laying the groundwork for a complex systems perspective on human communication.…

Computation and Language · Computer Science 2009-11-11 Eduardo G. Altmann , Janet B. Pierrehumbert , Adilson E. Motter

On-line communities offer a great opportunity to investigate human dynamics, because much information about individuals is registered in databases. In this paper, based on data statistics of online comments on Blog posts, we first present…

Social and Information Networks · Computer Science 2010-09-28 Jin-Li Guo

What dynamics govern a time series representing the appearance of words in social media data? In this paper, we investigate an elementary dynamics, from which word-dependent special effects are segregated, such as breaking news, increasing…

Physics and Society · Physics 2017-07-31 Hayafumi Watanabe

Ultraslow diffusion (i.e. logarithmic diffusion) has been extensively studied theoretically, but has hardly been observed empirically. In this paper, firstly, we find the ultraslow-like diffusion of the time-series of word counts of already…

Physics and Society · Physics 2018-07-25 Hayafumi Watanabe

The rate of occurrence of words is not uniform but varies from document to document. Despite this observation, parameters for conventional n-gram language models are usually derived using the assumption of a constant word rate. In this…

Computation and Language · Computer Science 2007-05-23 Yoshihiko Gotoh , Steve Renals

The distribution of frequency counts of distinct words by length in a language's vocabulary will be analyzed using two methods. The first, will look at the empirical distributions of several languages and derive a distribution that…

Computation and Language · Computer Science 2012-07-17 Reginald D. Smith

This paper describes the analysis of quantitative characteristics of frequent sets and association rules in the posts of Twitter microblogs related to different event discussions. For the analysis, we used a theory of frequent sets,…

Social and Information Networks · Computer Science 2013-10-15 Bohdan Pavlyshenko

Online social media such as the micro-blogging site Twitter has become a rich source of real-time data on online human behaviors. Here we analyze the occurrence and co-occurrence frequency of keywords in user posts on Twitter. From the…

Physics and Society · Physics 2014-01-17 Joachim Mathiesen , Luiza Angheluta , Mogens H. Jensen

In many complex systems studied in statistical physics, inter-arrival times between events such as solar flares, trades and neuron voltages follow a heavy-tailed distribution. The set of event times is fractal-like, being dense in some time…

Statistics Theory · Mathematics 2020-09-16 Katharina Hees , Smarak Nayak , Peter Straka

We study the dynamics of public media attention by monitoring the content of online blogs. Social and media events can be traced by the propagation of word frequencies of related keywords. Media events are classified as exogenous - where…

Physics and Society · Physics 2015-05-27 Peter Klimek , Werner Bayer , Stefan Thurner

Current models for opinion dynamics typically utilize a Poisson process for speaker selection, making the waiting time between events exponentially distributed. Human interaction tends to be bursty, though, having higher probabilities of…

Physics and Society · Physics 2017-07-28 Casey Doyle , Boleslaw Szymanski , Gyorgy Korniss

It is part of our daily social-media experience that seemingly ordinary items (videos, news, publications, etc.) unexpectedly gain an enormous amount of attention. Here we investigate how unexpected these events are. We propose a method…

Physics and Society · Physics 2014-12-09 José M. Miotto , Eduardo G. Altmann

Inspired by previous works on human dynamics, we collect the temporal statistics of the article creation by three Western scientists and an Eastern writer. We investigate the distributions of the time intervals between the creations of…

Physics and Society · Physics 2012-04-02 Na Li , Han Yan , Wen-Yao Zhang , Yu-Jian Li , Zhen-Dong Xi , Bing-Hong Wang

The massive diffusion of online social media allows for the rapid and uncontrolled spreading of conspiracy theories, hoaxes, unsubstantiated claims, and false news. Such an impressive amount of misinformation can influence policy…

Social and Information Networks · Computer Science 2017-02-01 Alessandro Bessi

Recent observations in the theory of verse and empirical metrics have suggested that constructing a verse line involves a pattern-matching search through a source text, and that the number of found elements (complete words totaling a…

cmp-lg · Computer Science 2007-05-23 Hideaki Aoyama , John Constable

Weblog is the fourth way of network exchange after Email, BBS and MSN. Most bloggers begin to write blogs with great interest, and then their interests gradually achieve a balance with the passage of time. In order to describe the…

Social and Information Networks · Computer Science 2015-05-19 Jin-Li Guo
‹ Prev 1 2 3 10 Next ›