Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations

Daniel Brown, Wonjoon Goo, Prabhat Nagarajan, and Scott Niekum. ICML 2019. (Poster PDF · Paper)