Buck comments on Untrusted smart models and trusted dumb models

Buck 15 Nov 2024 15:16 UTC
2 points
2
I think your short definition should include the part about our epistemic status: “We are happy to assume the AI isn’t adversarially trying to cause a bad outcome”.