Policy Optimization Provably Converges to Nash Equilibria in Zero-Sum Linear Quadratic Games

Kaiqing Zhang, Zhuoran Yang, and Tamer Basar. NeurIPS 2019. (Poster PDF · Paper)