“Define “doomed”. Assuming Murphy’s law, they will eventually fail. Yet some “prosaic” approaches may be on average very helpful.”
I’m defining “doomed” here as not having a chance of actually working in the real world before the world ends. So yes, that they will eventually fail, in some way or another.
“Human values aren’t a static thing to be “aligned” with. They can be loved, traded with, etc.”
My understanding is that human values don’t have to be static for alignment to work. Values are constantly changing and vary across the world, but why is it so difficult to align it with some version of human values that doesn’t result in everyone dying?
[removed]
“Define “doomed”. Assuming Murphy’s law, they will eventually fail. Yet some “prosaic” approaches may be on average very helpful.”
I’m defining “doomed” here as not having a chance of actually working in the real world before the world ends. So yes, that they will eventually fail, in some way or another.
“Human values aren’t a static thing to be “aligned” with. They can be loved, traded with, etc.”
My understanding is that human values don’t have to be static for alignment to work. Values are constantly changing and vary across the world, but why is it so difficult to align it with some version of human values that doesn’t result in everyone dying?