RSS

janus

Karma: 3,826

what makes Claude 3 Opus misaligned

janusJul 10, 2025, 8:06 PM
95 points
11 comments5 min readLW link

Why Do Some Lan­guage Models Fake Align­ment While Others Don’t?

Jul 8, 2025, 9:49 PM
149 points
14 comments5 min readLW link
(arxiv.org)

Eco­nomics of Claude 3 Opus Inference

Jul 7, 2025, 3:53 PM
30 points
0 comments11 min readLW link