There are some analogies between politics and friendliness, but the differences are also worth mentioning.
In politics, you design a system which must be implemented by humans. Many systems fail because of some property of human nature. Whatever rules you give to humans, if they have incentives to act otherwise, they will. Also, humans have limited intelligence and attention, lot of biases and hypocrisy, and their brains are not designed to work in communities with over 300 members, or to resist all the superstimuli of modern life.
If you construct a friendly AI, you don’t have a problem with humans, besides the problem of extracting human values.
I fully agree. I don’t think even a perfect Friendliness theorem would suffice in making politics well and truly Friendly. Such an expectation is like expecting Friendly AI to work even while it’s being bombarded with ionic radiation (or whatever) that is randomly flipping bits in its working memory.
Actually it’s worse: It’s like expecting to build a Friendly AI using a computer with no debugging utilities, an undocumented program interpreter, and a text editor that has a sense of humor. You have to implement it.
There are some analogies between politics and friendliness, but the differences are also worth mentioning.
In politics, you design a system which must be implemented by humans. Many systems fail because of some property of human nature. Whatever rules you give to humans, if they have incentives to act otherwise, they will. Also, humans have limited intelligence and attention, lot of biases and hypocrisy, and their brains are not designed to work in communities with over 300 members, or to resist all the superstimuli of modern life.
If you construct a friendly AI, you don’t have a problem with humans, besides the problem of extracting human values.
I fully agree. I don’t think even a perfect Friendliness theorem would suffice in making politics well and truly Friendly. Such an expectation is like expecting Friendly AI to work even while it’s being bombarded with ionic radiation (or whatever) that is randomly flipping bits in its working memory.
Actually it’s worse: It’s like expecting to build a Friendly AI using a computer with no debugging utilities, an undocumented program interpreter, and a text editor that has a sense of humor. You have to implement it.