But why is that a reasonable assumption to make? Aren’t you just assuming that the AI will play nice? I can see that there are some dangerous Oracles that we can protect from using your strategy, but there are also many that it wouldn’t hinder at all.
>Aren’t you just assuming that the AI will play nice?
I’m assuming that the reward/utility functions that can be defined to be episodic. We hand the Oracle its utility, hence we can (in theory) construct it to be episodic (and train the Oracle in an episodic way, if we need to train it).
But why is that a reasonable assumption to make? Aren’t you just assuming that the AI will play nice? I can see that there are some dangerous Oracles that we can protect from using your strategy, but there are also many that it wouldn’t hinder at all.
>Aren’t you just assuming that the AI will play nice?
I’m assuming that the reward/utility functions that can be defined to be episodic. We hand the Oracle its utility, hence we can (in theory) construct it to be episodic (and train the Oracle in an episodic way, if we need to train it).