arXiv:2002.02794 [cs.LG]AbstractReferencesReviewsResources Classifications Subjects Themes Keywords reward function, reinforcement learning, reward-free exploration, agent first collects trajectories, natural policy gradient Tags Journal Information Publisher Journal Year Month Volume Number Pages DOI URL Miscellaneous Typesetting Pages Language License Submit Reset