Since we recently had an update from Redwood Research, what’s your take on what they’re doing? What problems (either technical or social/political) do you foresee them running into, and what are some avenues forward?
I’m relatively a fan of their approach (although I haven’t spent an enormous amount of time thinking about it). I like starting with problems which are concrete enough to really go at but which are microcosms for things we might eventually want.
I actually kind of think of truthfulness as sitting somewhere on the spectrum between the problem Redwood are working on right now and alignment. Many of the reasons I like truthfulness as medium-term problem to work on are similar to the reasons I like Redwood’s current work.
Since we recently had an update from Redwood Research, what’s your take on what they’re doing? What problems (either technical or social/political) do you foresee them running into, and what are some avenues forward?
I’m relatively a fan of their approach (although I haven’t spent an enormous amount of time thinking about it). I like starting with problems which are concrete enough to really go at but which are microcosms for things we might eventually want.
I actually kind of think of truthfulness as sitting somewhere on the spectrum between the problem Redwood are working on right now and alignment. Many of the reasons I like truthfulness as medium-term problem to work on are similar to the reasons I like Redwood’s current work.