RSS

TheManxLoiner

Karma: 154

Two flaws in the Machi­avelli Benchmark

TheManxLoinerFeb 12, 2025, 7:34 PM
23 points
0 comments3 min readLW link

Liron Shapira vs Ken Stan­ley on Doom De­bates. A review

TheManxLoinerJan 24, 2025, 6:01 PM
9 points
0 comments14 min readLW link

TheManxLoiner’s Shortform

TheManxLoinerDec 20, 2024, 10:30 AM
3 points
6 commentsLW link

How to make evals for the AISI evals bounty

TheManxLoinerDec 3, 2024, 10:44 AM
9 points
0 comments5 min readLW link

Scat­tered thoughts on what it means for an LLM to believe

TheManxLoinerNov 6, 2024, 10:10 PM
5 points
4 comments5 min readLW link

AI as a pow­er­ful meme, via CGP Grey

TheManxLoinerOct 30, 2024, 6:31 PM
46 points
8 comments4 min readLW link

Distil­la­tion of ‘Do lan­guage mod­els plan for fu­ture to­kens’

TheManxLoinerJun 27, 2024, 8:57 PM
26 points
2 comments6 min readLW link

How to build a data cen­ter, by Con­struc­tion Physics

TheManxLoinerJun 10, 2024, 5:38 PM
2 points
0 comments1 min readLW link
(www.construction-physics.com)

AI Safety In­sti­tute’s In­spect hello world ex­am­ple for AI evals

TheManxLoinerMay 16, 2024, 8:47 PM
3 points
0 comments1 min readLW link
(lovkush.medium.com)

My ex­pe­rience at ML4Good AI Safety Bootcamp

TheManxLoinerApr 13, 2024, 10:55 AM
21 points
1 comment5 min readLW link