Somewhat related thread (which I think was super valuable for me at least, independently) Experimentally evaluating whether honesty generalizes—LessWrong
Somewhat related thread (which I think was super valuable for me at least, independently) Experimentally evaluating whether honesty generalizes—LessWrong