English

Triangular clustering in document networks

Physics and Society 2009-03-20 v4

Abstract

Document networks are characteristic in that a document node, e.g. a webpage or an article, carries meaningful content. Properties of document networks are not only affected by topological connectivity between nodes, but also strongly influenced by the semantic relation between content of the nodes. We observe that document networks have a large number of triangles and a high value of clustering coefficient. And there is a strong correlation between the probability of formation of a triangle and the content similarity among the three nodes involved. We propose the degree-similarity product (DSP) model which well reproduces these properties. The model achieves this by using a preferential attachment mechanism which favours the linkage between nodes that are both popular and similar. This work is a step forward towards a better understanding of the structure and evolution of document networks.

Keywords

Cite

@article{arxiv.0807.2113,
  title  = {Triangular clustering in document networks},
  author = {Xue-qi Cheng and Fu-xin Ren and Shi Zhou and Mao-Bin Hu},
  journal= {arXiv preprint arXiv:0807.2113},
  year   = {2009}
}

Comments

10 pages, 3 figures, 2 tables

R2 v1 2026-06-21T11:00:10.275Z