This seems like a better model of the terrain: we don’t know how far down which path we need to get to find a working alignment solution. So the strategy “let’s split up to search, gang; we’ll cover more ground” actually makes sense before trying to stack efforts in the same direction.
This seems like a better model of the terrain: we don’t know how far down which path we need to get to find a working alignment solution. So the strategy “let’s split up to search, gang; we’ll cover more ground” actually makes sense before trying to stack efforts in the same direction.
Yeah, basically explore-then-exploit. (I do worry that the toy model is truer IRL though...)