David Scott Krueger (formerly: capybaralet) comments on AI Safety “Success Stories”

David Scott Krueger (formerly: capybaralet) 14 Oct 2019 19:48 UTC
LW: 2 AF: 2
AF
Does an “AI safety success story” encapsulate just a certain trajectory in AI (safety) development?
Or does it also include a story about how AI is deployed (and by who, etc.)?
I like this post a lot, but I think it ends up being a bit unclear because I don’t think everyone has the same use cases in mind for the different technologies underlying these scenarios, and/or I don’t think everyone agrees with the way in which safety research is viewed as contributing to success in these different scenarios… Maybe fleshing out the success stories, or referencing some more in-depth elaborations of them would make this clearer?
- riceissa 18 Oct 2019 1:11 UTC
  2 points
  Parent
  
  Or does it also include a story about how AI is deployed (and by who, etc.)?
  
  The “Controlled access” row seems to imply that at least part of how the AI is deployed is part of each success story (with some other parts left to be filled in later). I agree that having more details for each story would be nice.
  
  Somewhat related to this is that I’ve found it slightly confusing that each success story is named after the kind of AI that is present in that story. So when one says “Sovereign Singleton”, this could mean either the AI itself or the AI together with all the other assumptions (e.g. hard takeoff) for how having that kind of AI leads to a “win”.
  What links here?
  - Deliberation as a method to find the “actual preferences” of humans by riceissa (22 Oct 2019 9:23 UTC; 23 points)