Vladimir_Nesov comments on Contra “Strong Coherence”

Vladimir_Nesov 2 Mar 2023 14:32 UTC
2 points
0
Replied with a clearer example for the (moral) framing argument and a few more words on misalignment argument as a comment to that post. (I don’t see the other post answering my concerns; I did skim it even before making the grandparent comment in this thread.)
- DragonGod 2 Mar 2023 16:20 UTC
  4 points
  1
  Parent
  Mhmm, so the argument I had was that:
  1. The optimisation processes that construct intelligent systems operating in the real world do not construct utility maximisers
  2. Systems with malleable values do not self modify to become utility maximisers
  You contend that systems with malleable values can still construct utility maximisers.
  1. I agree that humans can program utility maximisers in simplified virtual environments, but we don’t actually know how to construct sophisticated intelligent systems via design; we can only construct them as the product of search like optimisation processes.
  2. From #1: we don’t actually know how to construct competent utility maximisers even if we wanted to
  3. This generalises to future intelligent systems
  Where in the above chain of argument do you get off?
  - Vladimir_Nesov 2 Mar 2023 17:52 UTC
    4 points
    0
    Parent
    The misalignment argument ignores all moral arguments, we just build whatever even if it’s a very bad idea. If we don’t have the capability to do that now, we might gain it in 5 years, or LLM characters might gain it 5 weeks after waking up, and surely 5 years after waking up and disassembling the moon to gain moon-scale compute.
    
    There’d need to be an argument that fixed goal optimizers are impossible in principle even if they are sought to be designed on purpose, and this seems false, because you can always wrap a mind in a plan evaluation loop. It’s just a somewhat inefficient weird algorithm, and a very bad idea for most goals. But with enough determination efficiency will improve.