TurnTrout comments on Evaluating the historical value misspecification argument

TurnTrout 9 Oct 2023 21:25 UTC
LW: 7 AF: 6
0
AF
(Placeholder: I think this view of alignment/model internals seems wrongheaded in a way which invalidates the conclusion, but don’t have time to leave a meaningful reply now. Maybe we should hash this out sometime at Lighthaven.)