I was trying to communicate that I already share the concern around excess strictness. So, I don’t understand why this (apparently condescendingly phrased) point is being repeated back to me again. The point of this post is to explore the pros and cons of this eval, and see if there are relaxations which capture most of the pros without most of the cons.
From the original comment:
How is this a relevant metric for safety at all?
If you don’t know what I think the pros are, maybe try asking more specific questions about more specific claims I make in the post?
I was trying to communicate that I already share the concern around excess strictness. So, I don’t understand why this (apparently condescendingly phrased) point is being repeated back to me again. The point of this post is to explore the pros and cons of this eval, and see if there are relaxations which capture most of the pros without most of the cons.
From the original comment:
If you don’t know what I think the pros are, maybe try asking more specific questions about more specific claims I make in the post?