RSS

Tony Wang

Karma: 282

Covert Mal­i­cious Finetuning

Jul 2, 2024, 2:41 AM
89 points
4 comments3 min readLW link

Take­aways from a Mechanis­tic In­ter­pretabil­ity pro­ject on “For­bid­den Facts”

Dec 15, 2023, 11:05 AM
33 points
8 comments10 min readLW link

Even Su­per­hu­man Go AIs Have Sur­pris­ing Failure Modes

Jul 20, 2023, 5:31 PM
129 points
22 comments10 min readLW link
(far.ai)

Cam­bridge LW Meetup: When Science Isn’t Enough

Apr 13, 2023, 5:36 PM
2 points
0 comments1 min readLW link

Cam­bridge LW Ra­tion­al­ity Prac­tice: Be­ing Specific

Feb 16, 2023, 6:37 AM
2 points
0 comments1 min readLW link

Cam­bridge LW Meetup: Lifehacks

Nov 29, 2022, 5:45 AM
2 points
0 comments1 min readLW link

Cam­bridge LW Meetup: See the Invisible

Tony WangOct 13, 2022, 5:44 AM
1 point
0 comments1 min readLW link

Cam­bridge LW Meetup: Authen­tic Re­lat­ing Games

Tony WangSep 19, 2022, 2:51 PM
1 point
0 comments1 min readLW link

Cam­bridge LW Meetup: Con­struc­tive Complaining

Tony WangAug 13, 2022, 4:52 AM
2 points
0 comments1 min readLW link

Cam­bridge LW Meetup: Per­sonal Finance

Tony WangJun 14, 2022, 12:12 AM
3 points
0 comments1 min readLW link

Cam­bridge LW Meetup: Books That Change

May 8, 2022, 5:23 AM
5 points
0 comments1 min readLW link

Cam­bridge LW Meetup: Bean on Why You Should Stop Wor­ry­ing and Love the Bomb

Apr 5, 2022, 6:34 PM
9 points
0 comments1 min readLW link