Autoencoder-based General Purpose Representation Learning for Customer Embedding

Jan Henrik Bertrand; David B. Hoffmann; Jacopo Pio Gargano; Laurent Mombaerts; Jonathan Taws

Autoencoder-based General Purpose Representation Learning for Customer Embedding

Machine Learning 2025-02-05 v2 Artificial Intelligence

Authors: Jan Henrik Bertrand , David B. Hoffmann , Jacopo Pio Gargano , Laurent Mombaerts , Jonathan Taws

Abstract

Recent advances in representation learning have successfully leveraged the underlying domain-specific structure of data across various fields. However, representing diverse and complex entities stored in tabular format within a latent space remains challenging. In this paper, we introduce DEEPCAE, a novel method for calculating the regularization term for multi-layer contractive autoencoders (CAEs). Additionally, we formalize a general-purpose entity embedding framework and use it to empirically show that DEEPCAE outperforms all other tested autoencoder variants in both reconstruction performance and downstream prediction performance. Notably, when compared to a stacked CAE across 13 datasets, DEEPCAE achieves a 34% improvement in reconstruction error.

Keywords

masked autoencoder variational autoencoder encoder-decoder architecture

Cite

@article{arxiv.2402.18164,
  title  = {Autoencoder-based General Purpose Representation Learning for Customer Embedding},
  author = {Jan Henrik Bertrand and David B. Hoffmann and Jacopo Pio Gargano and Laurent Mombaerts and Jonathan Taws},
  journal= {arXiv preprint arXiv:2402.18164},
  year   = {2025}
}

Comments

20 pages, 7 figures

Autoencoder-based General Purpose Representation Learning for Customer Embedding

Abstract

Keywords

Cite

Comments

Related papers