Run them on examples such as frown-with-red-bar and smile-with-blue-bar.
That sounds like a black-box approach.
Which problems are you thinking of?
Human’s not knowing what goals we want AI to have and the riggability of the reward learning process. Which you stated were problems for CIRL in 2020.
That sounds like a black-box approach.
Human’s not knowing what goals we want AI to have and the riggability of the reward learning process. Which you stated were problems for CIRL in 2020.