What the? Where on earth are you getting the idea that building an FAI isn’t hard work? Or that it doesn’t require building stuff and solving gritty engineering problems?
I’d like to reinforce this point. If it isn’t hard work, please point us all at whatever solution any random mathematician and/or programmer could come up with on how to concretely implement Löb’s Theorem within an AI to self-prove that a modification will not cause systematic breakdown or change the AI’s behavior in an unexpected (most likely fatal to the human race, if you randomize through all conceptspace for possible eventualities, which is very much the best guess we have at the current state of research) manner. I’ve yet to see any example of such an application to a level anywhere near this complex in any field of physics, computing or philosophy.
Or maybe you could, instead, prove that there exists Method X that is optimal for the future of the human race which guarantees that for all possible subsets of “future humans”, there exists no possible subsets which contain any human matching the condition “sufficiently irrational yet competent to build the most dangerous form of AI possible”.
I mean, I for one find all this stuff about provability theory way too complicated. Please show us the easy-work stay-in-bed version, if you’re so sure that that’s all there is to it. You must have a lot of evidence to be this confident. All I’ve seen so far is “I’m being skeptic, also I might have evidence that I’m not telling you, so X is wrong and Y must be true!”
@aaronsw:
I’d like to reinforce this point. If it isn’t hard work, please point us all at whatever solution any random mathematician and/or programmer could come up with on how to concretely implement Löb’s Theorem within an AI to self-prove that a modification will not cause systematic breakdown or change the AI’s behavior in an unexpected (most likely fatal to the human race, if you randomize through all conceptspace for possible eventualities, which is very much the best guess we have at the current state of research) manner. I’ve yet to see any example of such an application to a level anywhere near this complex in any field of physics, computing or philosophy.
Or maybe you could, instead, prove that there exists Method X that is optimal for the future of the human race which guarantees that for all possible subsets of “future humans”, there exists no possible subsets which contain any human matching the condition “sufficiently irrational yet competent to build the most dangerous form of AI possible”.
I mean, I for one find all this stuff about provability theory way too complicated. Please show us the easy-work stay-in-bed version, if you’re so sure that that’s all there is to it. You must have a lot of evidence to be this confident. All I’ve seen so far is “I’m being skeptic, also I might have evidence that I’m not telling you, so X is wrong and Y must be true!”