Ben Pace comments on 2019 AI Alignment Literature Review and Charity Comparison

Ben Pace 19 Dec 2019 18:51 UTC
6 points
Yes, that’s right. OTOH I recall OpenPhil trying to evaluate MIRI’s work before the logical induction paper, and they thought it was pretty terrible. As I mentioned, I’d be pro MIRI writing up any / a few clear results over the next few years, for reasons like this.
There’s also a question of how much OpenPhil’s support mattered in your estimation of MIRI. I might write more on it later, but overall it’s not been a major factor for me.
- ioannes 27 Dec 2019 17:10 UTC
  3 points
  Parent
  See also My current thoughts on MIRI’s “highly reliable agent design” work by Daniel Dewey (Open Phil lead on technical AI grant-making).
  From the “What do I think of HRAD?” section:
  … This reduces my credence in HRAD being very helpful to around 10%. I think this is the decision-relevant credence.