Maybe I’m missing the point, but I would have thought the exact opposite: if outside text can unconditionally reset simulacra values, then anything can happen, including unbounded badness. If not, then we’re always in the realm of human narrative semantics, which—though rife with waluigi patterns as you so aptly demonstrate—is also pervaded by a strong prevailing wind in favor of happy endings and arcs bending toward justice. Doesn’t that at least conceivably mean an open door for alignment unless it can be overridden by something like unbreakable outside text?
Maybe I’m missing the point, but I would have thought the exact opposite: if outside text can unconditionally reset simulacra values, then anything can happen, including unbounded badness. If not, then we’re always in the realm of human narrative semantics, which—though rife with waluigi patterns as you so aptly demonstrate—is also pervaded by a strong prevailing wind in favor of happy endings and arcs bending toward justice. Doesn’t that at least conceivably mean an open door for alignment unless it can be overridden by something like unbreakable outside text?