I believe the terms inner and outer alignment are already typically used in exactly the way you describe Aimability and Goalcraft.
outer alignment as a problem is intuitive enough to understand, i.e., is the specified loss function aligned with the intended goal of its designers?
Outer alignment deals with the problem of matching a formally specified goal function in a computer with an intent in the designer’s mind, but this is not really Goalcrafting which asks what the goal should be.
E.g. Specification gaming is part of outer alignment, but not part of Goalcrafting.
I would classify inner and outer alignment as subcategories of Aimability.
I see that you’re correct. Thanks for the clarification. I’m embarrassed that I’ve been using it wrong.
Now I have no idea where the line between outer and inner alignment falls. It looks like a common point of disagreement. So I’m not sure outer and inner alignment are very useful terms.
Outer alignment deals with the problem of matching a formally specified goal function in a computer with an intent in the designer’s mind, but this is not really Goalcrafting which asks what the goal should be.
E.g. Specification gaming is part of outer alignment, but not part of Goalcrafting.
I would classify inner and outer alignment as subcategories of Aimability.
I see that you’re correct. Thanks for the clarification. I’m embarrassed that I’ve been using it wrong.
Now I have no idea where the line between outer and inner alignment falls. It looks like a common point of disagreement. So I’m not sure outer and inner alignment are very useful terms.