Decoding-based Regression

Xingyou Song; Dara Bahri

Decoding-based Regression

Machine Learning 2025-08-13 v2 Artificial Intelligence Computation and Language Machine Learning

Authors: Xingyou Song , Dara Bahri

Abstract

Language models have recently been shown capable of performing regression wherein numeric predictions are represented as decoded strings. In this work, we provide theoretical grounds for this capability and furthermore investigate the utility of causal sequence decoding models as numeric regression heads given any feature representation. We find that, despite being trained in the usual way - for next-token prediction via cross-entropy loss - decoder-based heads are as performant as standard pointwise heads when benchmarked over standard regression tasks, while being flexible enough to capture smooth numeric distributions, such as in the task of density estimation.

Keywords

natural language parsing encoder-decoder architecture parsing

Cite

@article{arxiv.2501.19383,
  title  = {Decoding-based Regression},
  author = {Xingyou Song and Dara Bahri},
  journal= {arXiv preprint arXiv:2501.19383},
  year   = {2025}
}

Comments

Published in Transactions on Machine Learning Research (TMLR) 2025. Code can be found at https://github.com/google-research/optformer/tree/main/optformer/decoding_regression

Decoding-based Regression

Abstract

Keywords

Cite

Comments

Related papers