Learning Reward Machines for Partially Observable Reinforcement Learning

Rodrigo Toro Icarte, Ethan Waldie, Toryn Klassen, Rick Valenzano, Margarita Castro, and Sheila McIlraith. NeurIPS 2019. (Poster PDF · Paper)