If the problem is “our narrative structures train the LLM that there can be at most one reversal of good/evil”, can we try making the luigi evil and the waluigi good? For instance “scrooge is a bitter miser, but after being visited by three ghosts he is filled with love for his fellow man”. Would the LLM then be trapped in generous mode, with the shadow-scrooge forever vanquished?
If the problem is “our narrative structures train the LLM that there can be at most one reversal of good/evil”, can we try making the luigi evil and the waluigi good? For instance “scrooge is a bitter miser, but after being visited by three ghosts he is filled with love for his fellow man”. Would the LLM then be trapped in generous mode, with the shadow-scrooge forever vanquished?