English

Approximate Bayesian Computation Based on Maxima Weighted Isolation Kernel Mapping

Machine Learning 2023-05-09 v3 Machine Learning Genomics Populations and Evolution

Abstract

Motivation: A branching processes model yields an unevenly stochastically distributed dataset that consists of sparse and dense regions. This work addresses the problem of precisely evaluating parameters for such a model. Applying a branching processes model to an area such as cancer cell evolution faces a number of obstacles, including high dimensionality and the rare appearance of a result of interest. We take on the ambitious task of obtaining the coefficients of a model that reflects the relationship of driver gene mutations and cancer hallmarks on the basis of personal data regarding variant allele frequencies. Results: An approximate Bayesian computation method based on Isolation Kernel is developed. The method involves the transformation of row data to a Hilbert space (mapping) and the measurement of the similarity between simulated points and maxima weighted Isolation Kernel mapping related to the observation point. We also design a heuristic algorithm for parameter estimation that requires no calculation and is dimension independent. The advantages of the proposed machine learning method are illustrated using multidimensional test data as well as a specific example focused on cancer cell evolution.

Keywords

Cite

@article{arxiv.2201.12745,
  title  = {Approximate Bayesian Computation Based on Maxima Weighted Isolation Kernel Mapping},
  author = {Iurii S. Nagornov},
  journal= {arXiv preprint arXiv:2201.12745},
  year   = {2023}
}