Does an “AI safety success story” encapsulate just a certain trajectory in AI (safety) development?
Or does it also include a story about how AI is deployed (and by who, etc.)?
I like this post a lot, but I think it ends up being a bit unclear because I don’t think everyone has the same use cases in mind for the different technologies underlying these scenarios, and/or I don’t think everyone agrees with the way in which safety research is viewed as contributing to success in these different scenarios… Maybe fleshing out the success stories, or referencing some more in-depth elaborations of them would make this clearer?
Or does it also include a story about how AI is deployed (and by who, etc.)?
The “Controlled access” row seems to imply that at least part of how the AI is deployed is part of each success story (with some other parts left to be filled in later). I agree that having more details for each story would be nice.
Somewhat related to this is that I’ve found it slightly confusing that each success story is named after the kind of AI that is present in that story. So when one says “Sovereign Singleton”, this could mean either the AI itself or the AI together with all the other assumptions (e.g. hard takeoff) for how having that kind of AI leads to a “win”.
Does an “AI safety success story” encapsulate just a certain trajectory in AI (safety) development?
Or does it also include a story about how AI is deployed (and by who, etc.)?
I like this post a lot, but I think it ends up being a bit unclear because I don’t think everyone has the same use cases in mind for the different technologies underlying these scenarios, and/or I don’t think everyone agrees with the way in which safety research is viewed as contributing to success in these different scenarios… Maybe fleshing out the success stories, or referencing some more in-depth elaborations of them would make this clearer?
The “Controlled access” row seems to imply that at least part of how the AI is deployed is part of each success story (with some other parts left to be filled in later). I agree that having more details for each story would be nice.
Somewhat related to this is that I’ve found it slightly confusing that each success story is named after the kind of AI that is present in that story. So when one says “Sovereign Singleton”, this could mean either the AI itself or the AI together with all the other assumptions (e.g. hard takeoff) for how having that kind of AI leads to a “win”.