RSS

ryan_greenblatt

Karma: 21,258

I’m the chief scientist at Redwood Research.

What’s up with An­thropic pre­dict­ing AGI by early 2027?

ryan_greenblatt3 Nov 2025 16:45 UTC
143 points
8 comments20 min readLW link

Son­net 4.5′s eval gam­ing se­ri­ously un­der­mines al­ign­ment evals, and this seems caused by train­ing on al­ign­ment evals

30 Oct 2025 15:34 UTC
124 points
19 comments14 min readLW link