Olli Järviniemi comments on [Paper] AI Sandbagging: Language Models can Strategically Underperform on Evaluations

Olli Järviniemi 14 Jun 2024 22:11 UTC
2 points
0
Ollie Jarviniemi
This is an amusing typo; to clear any potential confusion, I am a distinct person from “Ollie J”, who is an author of the current article.
(I don’t have much to say on the article, but it looks good! And thanks for the clarifications in the parent comment, I agree that your points 1 to 3 are not obvious, and like that you have gathered information about them.)
- Teun van der Weij 14 Jun 2024 22:18 UTC
  1 point
  0
  Parent
  Oh, I am sorry, should have double-checked your name. My bad!
  
  Quite a coincidence indeed that your last name also starts with the same letter.