I have the intuition (maybe from applause lights) that if negating a point sounds obviously implausible, then the point is obviously true and it is therefore somewhat meaningless to claim it.
My idea in writing this was to identify some traps that I thought were non obvious (some of which I think I fell into as new alignment researcher).
I have the intuition (maybe from applause lights) that if negating a point sounds obviously implausible, then the point is obviously true and it is therefore somewhat meaningless to claim it.
My idea in writing this was to identify some traps that I thought were non obvious (some of which I think I fell into as new alignment researcher).