Tensor-Efficient High-Dimensional Q-learning

Junyi Wu; Dan Li

Tensor-Efficient High-Dimensional Q-learning

Machine Learning 2026-04-09 v2 Systems and Control Systems and Control

Authors: Junyi Wu , Dan Li

Abstract

High-dimensional reinforcement learning(RL) faces challenges with complex calculations and low sample efficiency in large state-action spaces. Q-learning algorithms struggle particularly with the curse of dimensionality, where the number of state-action pairs grows exponentially with problem size. While neural network-based approaches like Deep Q-Networks have shown success, they do not explicitly exploit problem structure. Many high-dimensional control tasks exhibit low-rank structure in their value functions, and tensor-based methods using low-rank decomposition offer parameter-efficient representations. However, existing tensor-based Q-learning methods focus on representation fidelity without leveraging this structure for exploration. We propose Tensor-Efficient Q-Learning (TEQL), which represents the Q-function as a low-rank CP tensor over discretized state-action spaces and exploits the tensor structure for uncertainty-aware exploration. TEQL incorporates Error-Uncertainty Guided Exploration (EUGE), which combines tensor approximation error with visit counts to guide action selection, along with frequency-aware regularization to stabilize updates. Under matched parameter budgets, experiments on classic control tasks demonstrate that TEQL outperforms both matrix-based low-rank methods and deep RL baselines in sample efficiency, making it suitable for resource-constrained applications where sampling costs are high.

Keywords

tensor decomposition reinforcement learning hierarchical reinforcement learning

Cite

@article{arxiv.2511.03595,
  title  = {Tensor-Efficient High-Dimensional Q-learning},
  author = {Junyi Wu and Dan Li},
  journal= {arXiv preprint arXiv:2511.03595},
  year   = {2026}
}

Comments

61 pages, 7 figures. v2 updated to include additional experimental results and refined proofs

Tensor-Efficient High-Dimensional Q-learning

Abstract

Keywords

Cite

Comments

Related papers