English

Optimizing Trajectories with Closed-Loop Dynamic SQP

Optimization and Control 2022-05-06 v2

Abstract

Indirect trajectory optimization methods such as Differential Dynamic Programming (DDP) have found considerable success when only planning under dynamic feasibility constraints. Meanwhile, nonlinear programming (NLP) has been the state-of-the-art approach when faced with additional constraints (e.g., control bounds, obstacle avoidance). However, a nai¨\"ive implementation of NLP algorithms, e.g., shooting-based sequential quadratic programming (SQP), may suffer from slow convergence -- caused from natural instabilities of the underlying system manifesting as poor numerical stability within the optimization. Re-interpreting the DDP closed-loop rollout policy as a sensitivity-based correction to a second-order search direction, we demonstrate how to compute analogous closed-loop policies (i.e., feedback gains) for constrained problems. Our key theoretical result introduces a novel dynamic programming-based constraint-set recursion that augments the canonical "cost-to-go" backward pass. On the algorithmic front, we develop a hybrid-SQP algorithm incorporating DDP-style closed-loop rollouts, enabled via efficient parallelized computation of the feedback gains. Finally, we validate our theoretical and algorithmic contributions on a set of increasingly challenging benchmarks, demonstrating significant improvements in convergence speed over standard open-loop SQP.

Keywords

Cite

@article{arxiv.2109.07081,
  title  = {Optimizing Trajectories with Closed-Loop Dynamic SQP},
  author = {Sumeet Singh and Jean-Jacques Slotine and Vikas Sindhwani},
  journal= {arXiv preprint arXiv:2109.07081},
  year   = {2022}
}

Comments

To be presented at ICRA 2022