RSS

julianjm

Karma: 52

Eval­u­at­ing Over­sight Ro­bust­ness with In­cen­tivized Re­ward Hacking

Apr 20, 2025, 4:53 PM
7 points
2 comments15 min readLW link