very powerful and sample efficient learning algorithm
simple?
I’m unsure whether you are drawing attention to the word “sample.” If so, sample efficiency refers to the amount of experience an RL agent needs in order to perform well in an environment. See here.
Yup, this is what I meant.
simple?
I’m unsure whether you are drawing attention to the word “sample.” If so, sample efficiency refers to the amount of experience an RL agent needs in order to perform well in an environment. See here.
Yup, this is what I meant.