How do you practically do that? We don’t know what they are, and that seems to be assuming our present progress, e.g. in Mechanical Interpretability doesn’t help at all. Such work requires the existence of more powerful systems than exist today surely?
How do you practically do that? We don’t know what they are, and that seems to be assuming our present progress, e.g. in Mechanical Interpretability doesn’t help at all. Such work requires the existence of more powerful systems than exist today surely?