Simplex Deep Linear Discriminant Analysis

Maxat Tezekbayev; Arman Bolatov; Zhenisbek Assylbekov

Simplex Deep Linear Discriminant Analysis

Machine Learning 2026-02-23 v2 Machine Learning

Authors: Maxat Tezekbayev , Arman Bolatov , Zhenisbek Assylbekov

Abstract

We revisit Deep Linear Discriminant Analysis (Deep LDA) from a likelihood-based perspective. While classical LDA is a simple Gaussian model with linear decision boundaries, attaching an LDA head to a neural encoder raises the question of how to train the resulting deep classifier by maximum likelihood estimation (MLE). We first show that end-to-end MLE training of an unconstrained Deep LDA model ignores discrimination: when both the LDA parameters and the encoder parameters are learned jointly, the likelihood admits a degenerate solution in which some of the class clusters may heavily overlap or even collapse, and classification performance deteriorates. Batchwise moment re-estimation of the LDA parameters does not remove this failure mode. We then propose a constrained Deep LDA formulation that fixes the class means to the vertices of a regular simplex in the latent space and restricts the shared covariance to be spherical, leaving only the priors and a single variance parameter to be learned along with the encoder. Under these geometric constraints, MLE becomes stable and yields well-separated class clusters in the latent space. On images (Fashion-MNIST, CIFAR-10, CIFAR-100) and texts (AG News, CLINC150), the resulting Deep LDA models achieve accuracy competitive with softmax baselines while offering a simple, interpretable latent geometry that is clearly visible in two-dimensional projections.

Keywords

discriminant analysis and canonical correlation analysis deep learning multi-task learning

Cite

@article{arxiv.2601.01679,
  title  = {Simplex Deep Linear Discriminant Analysis},
  author = {Maxat Tezekbayev and Arman Bolatov and Zhenisbek Assylbekov},
  journal= {arXiv preprint arXiv:2601.01679},
  year   = {2026}
}

Comments

Accepted at CPAL 2026. Camera-ready version

Simplex Deep Linear Discriminant Analysis

Abstract

Keywords

Cite

Comments

Related papers