See also My current thoughts on MIRI’s “highly reliable agent design” work by Daniel Dewey (Open Phil lead on technical AI grant-making).
From the “What do I think of HRAD?” section:
… This reduces my credence in HRAD being very helpful to around 10%. I think this is the decision-relevant credence.
See also My current thoughts on MIRI’s “highly reliable agent design” work by Daniel Dewey (Open Phil lead on technical AI grant-making).
From the “What do I think of HRAD?” section: