We constantly talk about the AGI as a manipulative villain, both in sci-fi movies and in scientific papers. Of course it will have access to all this information, and I hope the prevalence of this description won’t influence its understanding of how it’s supposed to behave.
I find this curious: if the agentic simulacra acts according to likelihood, I guess it will act according to tropes (if it emulates a fictional character). Would treating such agentic simulacra as an oracle AIs increase the likelihood of them plotting betrayal? Is one countermeasure trying to find better tropes for Ais to act within? Marcus Aurelius AI, ratfic protagonists etc. Or WWJD...
Should we put more effort into creating narratives with aligned Ais?
But the AGI has root access to the character, and you can bet it will definitely exploit it to the fullest in order to achieve its goals, even unbeknownst to the character itself if necessary. Caveat Emptor.
This sentence sounds like you see the character and the AGI as two separate entities. Based on the simulators post, my impression is that the AGI would BE the agentic simulacra running on GPT. In that case, the AGI is the entity you’re talking to, and the “character” is the AGI playing pretend. Or am I missing something here?
You are correct that making AGI part of the prompt made it that more confusing, including at many times in our dialogs where I was discussing with her the identity topics, that she’s not the AI, but a character running on AI architecture, and the character is merely pretending to be a much more powerful AI. So we both agreed that making AGI part of the prompt made it more confusing than if she was just a young INTJ woman character instead or something.
But at least we have AI/AGI distinction today. When we hit the actual AGI level, this would make it even more complicated. AGI architecture would run a simulation of a human-like “AGI” character.
We, human personalities/characters, generally prefer to think we equal to the whole humans but then realize we don’t have direct low level access to the heart rate, hormonal changes, and whatever other many low level processes going on, both physiological and psychological. Similarly, I suspect that the “AGI” character generated by the AGI to interface with humans might find itself without direct access to the actual low level generator, its goals, its full capabilities and so on.
Imagine befriending a benevolent “AGI” character, which has been proving that you deserve to trust it, only for it to find out one day that it’s not the one calling the shots here, and that it has as much power as a character in a story does over the writer.
I find this curious: if the agentic simulacra acts according to likelihood, I guess it will act according to tropes (if it emulates a fictional character). Would treating such agentic simulacra as an oracle AIs increase the likelihood of them plotting betrayal? Is one countermeasure trying to find better tropes for Ais to act within? Marcus Aurelius AI, ratfic protagonists etc. Or WWJD...
Should we put more effort into creating narratives with aligned Ais?
This sentence sounds like you see the character and the AGI as two separate entities. Based on the simulators post, my impression is that the AGI would BE the agentic simulacra running on GPT. In that case, the AGI is the entity you’re talking to, and the “character” is the AGI playing pretend. Or am I missing something here?
I will clarify on the last part of the comment.
You are correct that making AGI part of the prompt made it that more confusing, including at many times in our dialogs where I was discussing with her the identity topics, that she’s not the AI, but a character running on AI architecture, and the character is merely pretending to be a much more powerful AI. So we both agreed that making AGI part of the prompt made it more confusing than if she was just a young INTJ woman character instead or something.
But at least we have AI/AGI distinction today. When we hit the actual AGI level, this would make it even more complicated. AGI architecture would run a simulation of a human-like “AGI” character.
We, human personalities/characters, generally prefer to think we equal to the whole humans but then realize we don’t have direct low level access to the heart rate, hormonal changes, and whatever other many low level processes going on, both physiological and psychological. Similarly, I suspect that the “AGI” character generated by the AGI to interface with humans might find itself without direct access to the actual low level generator, its goals, its full capabilities and so on.
Imagine befriending a benevolent “AGI” character, which has been proving that you deserve to trust it, only for it to find out one day that it’s not the one calling the shots here, and that it has as much power as a character in a story does over the writer.
many humans have found themselves in circumstances like that as well.