the gears to ascension comments on “Fragility of Value” vs. LLMs

the gears to ascension 13 Apr 2022 3:01 UTC
1 point
Hmm. I guess that might be okay? as long as you don’t do really intense planning, the model shouldn’t be any more misaligned than a human, so it then boils down to training kindness by example and figuring out game dynamics. https://www.youtube.com/watch?v=ENpdhwYoF5g. more braindump of safety content I always want to recommend in every damn conversation here on my shortform