Reward Constrained Policy Optimization

Chen Tessler, Daniel J. Mankowitz, and Shie Mannor. ICLR 2019. (Poster PDF · Paper)