English

Kernel Subspace and Feature Extraction

Machine Learning 2023-05-12 v2 Machine Learning

Abstract

We study kernel methods in machine learning from the perspective of feature subspace. We establish a one-to-one correspondence between feature subspaces and kernels and propose an information-theoretic measure for kernels. In particular, we construct a kernel from Hirschfeld--Gebelein--R\'{e}nyi maximal correlation functions, coined the maximal correlation kernel, and demonstrate its information-theoretic optimality. We use the support vector machine (SVM) as an example to illustrate a connection between kernel methods and feature extraction approaches. We show that the kernel SVM on maximal correlation kernel achieves minimum prediction error. Finally, we interpret the Fisher kernel as a special maximal correlation kernel and establish its optimality.

Keywords

Cite

@article{arxiv.2301.01410,
  title  = {Kernel Subspace and Feature Extraction},
  author = {Xiangxiang Xu and Lizhong Zheng},
  journal= {arXiv preprint arXiv:2301.01410},
  year   = {2023}
}

Comments

ISIT 2023