English

PyKale: Knowledge-Aware Machine Learning from Multiple Sources in Python

Machine Learning 2021-06-21 v1 Artificial Intelligence Computer Vision and Pattern Recognition Machine Learning

Abstract

Machine learning is a general-purpose technology holding promises for many interdisciplinary research problems. However, significant barriers exist in crossing disciplinary boundaries when most machine learning tools are developed in different areas separately. We present Pykale - a Python library for knowledge-aware machine learning on graphs, images, texts, and videos to enable and accelerate interdisciplinary research. We formulate new green machine learning guidelines based on standard software engineering practices and propose a novel pipeline-based application programming interface (API). PyKale focuses on leveraging knowledge from multiple sources for accurate and interpretable prediction, thus supporting multimodal learning and transfer learning (particularly domain adaptation) with latest deep learning and dimensionality reduction models. We build PyKale on PyTorch and leverage the rich PyTorch ecosystem. Our pipeline-based API design enforces standardization and minimalism, embracing green machine learning concepts via reducing repetitions and redundancy, reusing existing resources, and recycling learning models across areas. We demonstrate its interdisciplinary nature via examples in bioinformatics, knowledge graph, image/video recognition, and medical imaging.

Keywords

Cite

@article{arxiv.2106.09756,
  title  = {PyKale: Knowledge-Aware Machine Learning from Multiple Sources in Python},
  author = {Haiping Lu and Xianyuan Liu and Robert Turner and Peizhen Bai and Raivo E Koot and Shuo Zhou and Mustafa Chasmai and Lawrence Schobs},
  journal= {arXiv preprint arXiv:2106.09756},
  year   = {2021}
}

Comments

This library is available at https://github.com/pykale/pykale