That makes sense and rereading the post the transparency section is clearer now, thanks! If I had to guess what gave me the wrong impression before, it would be this part:
its behavior can only be intelligent when it is exercised on the training distribution
I suspect when I read this, I thought it implied “when it is not on the training distribution, its behavior cannot be intelligent”.
That makes sense and rereading the post the transparency section is clearer now, thanks! If I had to guess what gave me the wrong impression before, it would be this part:
I suspect when I read this, I thought it implied “when it is not on the training distribution, its behavior cannot be intelligent”.
I also had trouble understanding that sub-clause. Maybe we read it in our head with the wrong emphasis:
Meaning: The agent gets inputs that are within the training distribution. ↔ The agent behaves intelligently.
But I guess it’s supposed to be:
Meaning: A behaviour is intelligent. ↔ The behaviour was exercised during training on the training distribution.