Kernel Subspace and Feature Extraction
Abstract
We study kernel methods in machine learning from the perspective of feature subspace. We establish a one-to-one correspondence between feature subspaces and kernels and propose an information-theoretic measure for kernels. In particular, we construct a kernel from Hirschfeld--Gebelein--R\'{e}nyi maximal correlation functions, coined the maximal correlation kernel, and demonstrate its information-theoretic optimality. We use the support vector machine (SVM) as an example to illustrate a connection between kernel methods and feature extraction approaches. We show that the kernel SVM on maximal correlation kernel achieves minimum prediction error. Finally, we interpret the Fisher kernel as a special maximal correlation kernel and establish its optimality.
Keywords
Cite
@article{arxiv.2301.01410,
title = {Kernel Subspace and Feature Extraction},
author = {Xiangxiang Xu and Lizhong Zheng},
journal= {arXiv preprint arXiv:2301.01410},
year = {2023}
}
Comments
ISIT 2023