RSS

ryan_greenblatt

Karma: 14,231

I’m the chief scientist at Redwood Research.

To be leg­ible, ev­i­dence of mis­al­ign­ment prob­a­bly has to be behavioral

ryan_greenblattApr 15, 2025, 6:14 PM
45 points
2 comments3 min readLW link

Why do mis­al­ign­ment risks in­crease as AIs get more ca­pa­ble?

ryan_greenblattApr 11, 2025, 3:06 AM
33 points
6 comments3 min readLW link