RSS

Olli Järviniemi

Karma: 1,301

Schel­ling game eval­u­a­tions for AI control

Olli Järviniemi8 Oct 2024 12:01 UTC
65 points
4 comments11 min readLW link

Dist­in­guish worst-case anal­y­sis from in­stru­men­tal train­ing-gaming

5 Sep 2024 19:13 UTC
37 points
0 comments5 min readLW link

Un­trust­wor­thy mod­els: a frame for schem­ing evaluations

Olli Järviniemi19 Aug 2024 16:27 UTC
46 points
3 comments8 min readLW link

Near-mode think­ing on AI

Olli Järviniemi4 Aug 2024 20:47 UTC
127 points
8 comments5 min readLW link

An ex­per­i­ment on hid­den cognition

Olli Järviniemi22 Jul 2024 3:26 UTC
25 points
2 comments7 min readLW link