rpglover64 comments on Cyborgism

rpglover64 12 Feb 2023 19:41 UTC
1 point
0
AF
Like, I may not want to become a cyborg if I stop being me, but that’s a separate concern from whether it’s bad for alignment (if the resulting cyborg is still aligned).
- David Scott Krueger (formerly: capybaralet) 13 Feb 2023 17:47 UTC
  LW: 6 AF: 3
  3
  AF Parent
  Oh I see. I was getting at the “it’s not aligned” bit.
  
  Basically, it seems like if I become a cyborg without understanding what I’m doing, the result is either:
  - I’m in control
  - The machine part is in control
  - Something in the middle
  Only the first one seems likely to be sufficiently aligned.
  - rpglover64 14 Feb 2023 13:58 UTC
    3 points
    3
    Parent
    I think “sufficiently” is doing a lot of work here. For example, are we talking about >99% chance that it kills <1% of humanity, or >50% chance that it kills <50% of humanity?
    
    I also don’t think “something in the middle” is the right characterization; I think “something else” it more accurate. I think that the failure you’re pointing at will look less like a power struggle or akrasia and more like an emergent goal structure that wasn’t really present in either part.
    
    I also think that “cyborg alignment” is in many ways a much more tractable problem than “AI alignment” (and in some ways even less tractable, because of pesky human psychology):
    
    It’s a much more gradual problem; a misaligned cyborg (with no agentic AI components) is not directly capable of FOOM (Amdhal’s law was mentioned elsewhere in the comments as a limit on usefulness of cyborgism, but it’s also a limit on damage)
    It has been studied longer and has existed longer; all technologies have influenced human thought
    
    It also may be an important paradigm to study (even if we don’t actively create tools for it) because it’s already happening.