RSS

JamesH

Karma: 327

ARENA 5.0 - Call for Applicants

Jan 30, 2025, 1:18 PM
35 points
2 comments6 min readLW link

ARENA 4.0 Im­pact Report

Nov 27, 2024, 8:51 PM
43 points
3 comments13 min readLW link

AI Align­ment Re­search Eng­ineer Ac­cel­er­a­tor (ARENA): Call for ap­pli­cants v4.0

Jul 6, 2024, 11:34 AM
57 points
7 comments6 min readLW link

In­ner Align­ment via Superpowers

Aug 30, 2022, 8:01 PM
37 points
13 comments4 min readLW link

Find­ing Goals in the World Model

Aug 22, 2022, 6:06 PM
59 points
8 comments13 min readLW link

The Core of the Align­ment Prob­lem is...

Aug 17, 2022, 8:07 PM
76 points
10 comments9 min readLW link

Pro­ject pro­posal: Test­ing the IBP defi­ni­tion of agent

Aug 9, 2022, 1:09 AM
21 points
4 comments2 min readLW link

Trans­lat­ing be­tween La­tent Spaces

Jul 30, 2022, 3:25 AM
27 points
2 comments8 min readLW link

For­mal­iz­ing Deception

JamesHJun 26, 2022, 5:39 PM
14 points
2 comments5 min readLW link