The secret shoppers are a variant of “the king going incognito”—but not as good in some ways because they may be tasked with evaluating according to a checklist, and thus could still be trapped by G vs G*.
I believe that the problem isn’t that true values aren’t verbalized, it’s that they can’t be fully verbalized. Language is too low-bandwidth to capture all the aspects of a situation.
The point of a king going incognito isn’t just to enforce existing, verbalized rules, it’s to see how things are in the kingdom. It’s a bit easier for a king than an AI because a king is more like a subject than an AI is like people.
The secret shoppers are a variant of “the king going incognito”—but not as good in some ways because they may be tasked with evaluating according to a checklist, and thus could still be trapped by G vs G*.
I believe that the problem isn’t that true values aren’t verbalized, it’s that they can’t be fully verbalized. Language is too low-bandwidth to capture all the aspects of a situation.
The point of a king going incognito isn’t just to enforce existing, verbalized rules, it’s to see how things are in the kingdom. It’s a bit easier for a king than an AI because a king is more like a subject than an AI is like people.