simeon_c comments on simeon_c’s Shortform

simeon_c 4 Apr 2024 9:01 UTC
1 point
0
There’s a number of properties of AI systems that makes it easier to collect information in a safe way about those systems and hence demonstrate their safety: interpretability, formal verifiability, modularity etc. Which adjective wd you use to characterize those properties?
I’m thinking of “resilience” because from the perspective of an AI developer it helps a lot understanding the risk profile, but do you have other suggestions?
Some alternatives:
1. auditability properties
2. legibility properties