Noosphere89 comments on Understanding Conjecture: Notes from Connor Leahy interview

Noosphere89 23 Sep 2022 12:30 UTC
1 point
0
I’d argue that a major part of the problem really is long-term consequentialism, but I’d argue that this is inevitable at least partially as soon as 2 conditions are met by default:
1. Trade offs exist, and the value of something cannot be infinity nor arbitrarily large values.
2. Full knowledge of the values of something isn’t known to the agent.
It really doesn’t matter whether consequentialism or morality is actually true, just whether it’s more useful than other approaches (given that capabilities researchers are only focusing on how capable a model is.)

And for a lot of problems in the real world, this is pretty likely to occur.

For a link to a dentological AI idea, here it is:

https://www.lesswrong.com/posts/FSQ4RCJobu9pussjY/ideological-inference-engines-making-deontology

And for a myopic decision theory, LCDT:

https://www.lesswrong.com/posts/Y76durQHrfqwgwM5o/lcdt-a-myopic-decision-theory