First return, then explore

Nature, Published online: 24 February 2021; doi:10.1038/s41586-020-03157-9

A reinforcement learning algorithm that explicitly remembers promising states and returns to them as a basis for further exploration solves all as-yet-unsolved Atari games and out-performs previous algorithms on Montezuma’s Revenge and Pitfall.

Comments

Popular posts from this blog

Photo Of The Day By Valerie Millett