Introducing n-Step Temporal-Difference Methods

Dissecting “Reinforcement Learning” by Richard S. Sutton with custom Python implementations, Episode V

Author:

Leave a Comment

You must be logged in to post a comment.