AND your accurate assessment of the difficulty. The overconfidence displayed in this mini-experiment seems to result in part from people massively misestimating the difficulty of this relatively simple problem. That’s why it’s so concerning WRT alignment.
The maximum difficulty that is worth attempting depends on the stakes.
AND your accurate assessment of the difficulty. The overconfidence displayed in this mini-experiment seems to result in part from people massively misestimating the difficulty of this relatively simple problem. That’s why it’s so concerning WRT alignment.