RSS

TheManxLoiner

Karma: 138

Two flaws in the Machi­avelli Benchmark

TheManxLoiner12 Feb 2025 19:34 UTC
23 points
0 comments3 min readLW link

Liron Shapira vs Ken Stan­ley on Doom De­bates. A review

TheManxLoiner24 Jan 2025 18:01 UTC
9 points
0 comments14 min readLW link

TheManxLoiner’s Shortform

TheManxLoiner20 Dec 2024 10:30 UTC
3 points
5 comments1 min readLW link

How to make evals for the AISI evals bounty

TheManxLoiner3 Dec 2024 10:44 UTC
9 points
0 comments5 min readLW link

Scat­tered thoughts on what it means for an LLM to believe

TheManxLoiner6 Nov 2024 22:10 UTC
5 points
4 comments5 min readLW link

AI as a pow­er­ful meme, via CGP Grey

TheManxLoiner30 Oct 2024 18:31 UTC
46 points
8 comments4 min readLW link