the gears to ascension comments on Use these three heuristic imperatives to solve alignment

the gears to ascension 6 Apr 2023 17:35 UTC
3 points
0
Some specific tags on these topics:
It’s not at all obvious to me that Shapiro’s heuristics are bad, but I feel comfortable asserting that they’re thoroughly insufficient. They’re a reasonable starting point for present-day AI, I think, and seem like good candidates for inclusion in a constitutional AI. but adversarial examples—holes in the behavior manifold—make it unclear whether even an entirely correct english description of human values would currently produce acceptable AI behavior in all edge cases.