Gurkenglas comments on Gurkenglas’s Shortform

Gurkenglas 11 Aug 2020 20:21 UTC
2 points
I expect that all that’s required for a Singularity is to wait a few years for the sort of language model that can replicate a human’s thoughts faithfully, then make it generate a thousand year’s worth of that researcher’s internal monologue, perhaps with access to the internet.
Neural networks should be good at this task—we have direct evidence that neural networks can run human brains.
Whether our world’s plot has a happy ending then merely depends on the details of that prompt/protocol—such as whether it decides to solve alignment before running a successor. Though it’s probably simple to check alignment of the character—we have access to his thoughts. A harder question is whether the first LM able to run humans is still inner aligned.