You are right; my comment was based on a misunderstanding of what you were saying. Hence why I unendorsed it.
(I read ” In this post, I will outline a general category of agents which may exhibit malign generalization without internal search, and then will provide a concrete example of an agent in the category. Then I will argue that, rather than being a very narrow counterexample, this class of agents could be competitive with search-based agents. ” and thought you meant agents that don’t use internal search at all.)
You are right; my comment was based on a misunderstanding of what you were saying. Hence why I unendorsed it.
(I read ” In this post, I will outline a general category of agents which may exhibit malign generalization without internal search, and then will provide a concrete example of an agent in the category. Then I will argue that, rather than being a very narrow counterexample, this class of agents could be competitive with search-based agents. ” and thought you meant agents that don’t use internal search at all.)