Yes, that’s right. OTOH I recall OpenPhil trying to evaluate MIRI’s work before the logical induction paper, and they thought it was pretty terrible. As I mentioned, I’d be pro MIRI writing up any / a few clear results over the next few years, for reasons like this.
There’s also a question of how much OpenPhil’s support mattered in your estimation of MIRI. I might write more on it later, but overall it’s not been a major factor for me.
Yes, that’s right. OTOH I recall OpenPhil trying to evaluate MIRI’s work before the logical induction paper, and they thought it was pretty terrible. As I mentioned, I’d be pro MIRI writing up any / a few clear results over the next few years, for reasons like this.
There’s also a question of how much OpenPhil’s support mattered in your estimation of MIRI. I might write more on it later, but overall it’s not been a major factor for me.
See also My current thoughts on MIRI’s “highly reliable agent design” work by Daniel Dewey (Open Phil lead on technical AI grant-making).
From the “What do I think of HRAD?” section: