Are there different classes of learning systems that optimize for the reward in different ways?
Yes, model-based approaches, model-free approaches (with or without critic), AIXI— all of these should be analyzed on their mechanistic details.
Are there different classes of learning systems that optimize for the reward in different ways?
Yes, model-based approaches, model-free approaches (with or without critic), AIXI— all of these should be analyzed on their mechanistic details.