paulfchristiano comments on Announcement: AI alignment prize round 2 winners and next round

paulfchristiano 18 Apr 2018 17:08 UTC
22 points
Personally, my long-term goal is a world where high-quality work on alignment is consistently funded, and where people doing high-quality work on alignment have plenty of money. I think that an effort to restrict to counterfactually-additional alignment work would “save” some money (in the sense that I’d have the money rather than some researcher who is doing alignment work) but wouldn’t be great for that long-term goal.
Also, if you actually think about the dynamics they are pretty crappy, even if you only avoid “obvious” cases. For example, it would become really hard for anyone to actually assess counterfactual impact, since every winner would need to make it look like there was at least a plausible counterfactual impact. (I already wish there was less implicit social pressure in that direction.)
- romeostevensit 18 Apr 2018 23:04 UTC
  8 points
  Parent
  On reflection I strongly agree that social pressure around counterfactualness is a net harm for motivation.