Learning to Generalize from Sparse and Underspecified Rewards

Rishabh Agarwal, Chen Liang, Dale Schuurmans, and Mohammad Norouzi. ICML 2019. (Poster PDF · Paper)