English

From Data to the p-Adic or Ultrametric Model

Machine Learning 2011-01-11 v1 Applications

Abstract

We model anomaly and change in data by embedding the data in an ultrametric space. Taking our initial data as cross-tabulation counts (or other input data formats), Correspondence Analysis allows us to endow the information space with a Euclidean metric. We then model anomaly or change by an induced ultrametric. The induced ultrametric that we are particularly interested in takes a sequential - e.g. temporal - ordering of the data into account. We apply this work to the flow of narrative expressed in the film script of the Casablanca movie; and to the evolution between 1988 and 2004 of the Colombian social conflict and violence.

Keywords

Cite

@article{arxiv.0809.0492,
  title  = {From Data to the p-Adic or Ultrametric Model},
  author = {Fionn Murtagh},
  journal= {arXiv preprint arXiv:0809.0492},
  year   = {2011}
}

Comments

15 pages, 6 figures. To appear in: Proceedings of Third International Conference on p-Adic Mathematical Physics: From Planck Scale Physics to Complex Systems to Biology, Steklov Mathematics Institute, Russian Academy of Sciences

R2 v1 2026-06-21T11:16:14.256Z