Vladimir_Nesov comments on Contra Hanson on AI Risk

Vladimir_Nesov 4 Mar 2023 8:48 UTC
10 points
1

As in his book The Age of Em, he’s talking about a world where we’re in the presence of superhuman AI and we haven’t been slaughtered.

The ems don’t need to be superhuman or inhumane, or keep superhuman AIs around. The historically considered WBEs were most likely to be built by superintelligent AGIs, since the level of technological restraint needed for humans to build them without building AGIs first seemed even less plausible than what it takes to ensure alignment. But LLM human imitations could play the role of ems now, without any other AGIs by the time they build their cities.

So in my view Hanson was likely shockingly prescient, even as the nature of ems seems to be shaking out a bit differently. It’s also a good framing for the situation where success of alignment is more likely to be decided, by LLM ems and not by humans.
- Liron 4 Mar 2023 17:42 UTC
  4 points
  0
  Parent
  I don’t know if LLM Ems can really be a significant factorizable part of the AI tech tree. If they have anything like today’s LLM limitations, they’re not as powerful as humans and ems. If they’re much more powerful than today’s LLMs, they’re likely to have powerful submodules that are qualitatively different from what we think of as LLMs.
  - Vladimir_Nesov 4 Mar 2023 18:27 UTC
    4 points
    0
    Parent
    I’m thinking of LLMs that are not necessarily more powerful than GPT-4, but have auxiliary routines for studying specific skills or topics that don’t automatically fall out of SSL and instead require deliberate practice (because there are currently no datasets that train them out of the box). This would make them AGI in a singularity-relevant sense, and shore up coherent agency, if it’s practiced as skills.
    
    That doesn’t move them significantly above human level, and I suspect improving quality of their thinking (as opposed to depth of technical knowledge) might prove difficult without risking misalignment, because capabilities of LLM characters are borrowed from humans, not spun up from first principles. At this point, these are essentially people, human imitations, slightly alien but still mostly aligned ems, ready to destroy the world by making further AGI capability progress.
    - Liron 5 Mar 2023 0:52 UTC
      4 points
      0
      Parent
      I guess that’s plausible, but then my main doom scenario would involve them getting leapfrogged by a different AI that has hit a rapid positive feedback loop of how to keep amplifying its consequentialist planning abilities.
      - Vladimir_Nesov 5 Mar 2023 7:50 UTC
        4 points
        0
        Parent
        
        my main doom scenario would involve them getting leapfrogged by a different AI
        
        Mine as well, hence the reference to AGI capabilities at the end of my comment, though given the premise I expect them to build it, not us. But in the meantime, there’ll be great em cities.