English

Kernel computations from large-scale random features obtained by Optical Processing Units

Emerging Technologies 2020-06-23 v2 Machine Learning

Abstract

Approximating kernel functions with random features (RFs)has been a successful application of random projections for nonparametric estimation. However, performing random projections presents computational challenges for large-scale problems. Recently, a new optical hardware called Optical Processing Unit (OPU) has been developed for fast and energy-efficient computation of large-scale RFs in the analog domain. More specifically, the OPU performs the multiplication of input vectors by a large random matrix with complex-valued i.i.d. Gaussian entries, followed by the application of an element-wise squared absolute value operation - this last nonlinearity being intrinsic to the sensing process. In this paper, we show that this operation results in a dot-product kernel that has connections to the polynomial kernel, and we extend this computation to arbitrary powers of the feature map. Experiments demonstrate that the OPU kernel and its RF approximation achieve competitive performance in applications using kernel ridge regression and transfer learning for image classification. Crucially, thanks to the use of the OPU, these results are obtained with time and energy savings.

Keywords

Cite

@article{arxiv.1910.09880,
  title  = {Kernel computations from large-scale random features obtained by Optical Processing Units},
  author = {Ruben Ohana and Jonas Wacker and Jonathan Dong and Sébastien Marmin and Florent Krzakala and Maurizio Filippone and Laurent Daudet},
  journal= {arXiv preprint arXiv:1910.09880},
  year   = {2020}
}

Comments

5 pages, 3 figures, submitted to ICASSP 2020