RSS

jan betley

Karma: 169

Me, My­self, and AI: the Si­tu­a­tional Aware­ness Dataset (SAD) for LLMs

8 Jul 2024 22:24 UTC
106 points
28 comments5 min readLW link

Self-shut­down AI

jan betley21 Aug 2023 16:48 UTC
13 points
2 comments2 min readLW link

Lo­cal­iz­ing goal mis­gen­er­al­iza­tion in a maze-solv­ing policy network

jan betley6 Jul 2023 16:21 UTC
37 points
2 comments7 min readLW link

[Question] Re­v­erse en­g­ineer­ing of the simulation

jan betley7 Feb 2022 21:36 UTC
1 point
2 comments1 min readLW link