Julian Schrittwieser comments on REPL’s: a type signature for agents

Julian Schrittwieser 17 Feb 2022 14:03 UTC
8 points
Could you explain how this differs from the standard Reinforcement Learning formulation? (See eg. http://incompleteideas.net/book/first/ebook/node28.html for an introduction)