A related/​overlapping thing I should collect: basic low-hanging fruit for safety. E.g.:
Have a high-level capability threshold at which securing model weights is very important
Evaluate whether your models are capable at agentic tasks; look at the score before releasing or internally deploying them
A related/​overlapping thing I should collect: basic low-hanging fruit for safety. E.g.:
Have a high-level capability threshold at which securing model weights is very important
Evaluate whether your models are capable at agentic tasks; look at the score before releasing or internally deploying them