Buck comments on Anthropic: Three Sketches of ASL-4 Safety Case Components

Buck 6 Nov 2024 20:28 UTC
5 points
0
What do you think of the arguments in this post that it’s possible to make safety cases that don’t rely on the model being unlikely to be a schemer?
- Noosphere89 6 Nov 2024 20:54 UTC
  2 points
  0
  Parent
  I’m somewhat optimistic on this happening, conditional on considerable effort being invested.
  
  As always, we will need more work on this agenda, and there will be more information about what control techniques work in practice, and which don’t.