First return, then explore

February 24, 2021

Nature, Published online: 24 February 2021; doi:10.1038/s41586-020-03157-9

A reinforcement learning algorithm that explicitly remembers promising states and returns to them as a basis for further exploration solves all as-yet-unsolved Atari games and out-performs previous algorithms on Montezuma’s Revenge and Pitfall.

Search This Blog

NATURE

First return, then explore

Comments

Post a Comment

Popular posts from this blog

Silk Road becomes the one less travelled as China lures science talent home

China’s leading researchers set their sights on new frontiers

Rising tide of China’s science lifts Asia-Pacific research