First return, then explore
Nature, Published online: 24 February 2021; doi:10.1038/s41586-020-03157-9
A reinforcement learning algorithm that explicitly remembers promising states and returns to them as a basis for further exploration solves all as-yet-unsolved Atari games and out-performs previous algorithms on Montezuma’s Revenge and Pitfall.
Comments
Post a Comment