Self-Imitation Learning via Trajectory-Conditioned Policy for Hard-Exploration Tasks

Publication
Deep Reinforcement Learning Workshop in Neural Information Processing Systems Conference, 2019