RSS

dannyhalawi

Karma: 135

Covert Mal­i­cious Finetuning

Jul 2, 2024, 2:41 AM
89 points
4 comments3 min readLW link

Ap­proach­ing Hu­man-Level Fore­cast­ing with Lan­guage Models

Feb 29, 2024, 10:36 PM
60 points
6 comments3 min readLW link