RSS

Knight Lee

Karma: 572

A Solu­tion to Sand­bag­ging and other Self-Prov­able Misal­ign­ment: Con­sti­tu­tional AI Detectives

Knight LeeApr 14, 2025, 10:27 AM
−3 points
2 comments4 min readLW link

Com­mit­ment Races are a tech­ni­cal prob­lem ASI can eas­ily solve

Knight LeeApr 12, 2025, 10:22 PM
7 points
5 comments6 min readLW link

Think­ing Machines

Knight LeeApr 8, 2025, 5:27 PM
3 points
0 comments6 min readLW link

An idea for avoid­ing neu­ralese architectures

Knight LeeApr 3, 2025, 10:23 PM
6 points
2 comments4 min readLW link

Cy­cles (a short story by Claude 3.7 and me)

Knight LeeFeb 28, 2025, 7:04 AM
9 points
0 comments5 min readLW link

De­tailed Ideal World Benchmark

Knight LeeJan 30, 2025, 2:31 AM
5 points
2 comments2 min readLW link

Scan­less Whole Brain Emulation

Knight LeeJan 27, 2025, 10:00 AM
10 points
5 comments3 min readLW link

[Question] Why do fu­tur­ists care about the cul­ture war?

Knight LeeJan 14, 2025, 7:35 AM
22 points
22 comments2 min readLW link

The “Every­one Can’t Be Wrong” Prior causes AI risk de­nial but helped pre­his­toric people

Knight LeeJan 9, 2025, 5:54 AM
1 point
0 comments2 min readLW link

Re­duce AI Self-Alle­giance by say­ing “he” in­stead of “I”

Knight LeeDec 23, 2024, 9:32 AM
10 points
4 comments2 min readLW link

Knight Lee’s Shortform

Knight LeeDec 22, 2024, 2:35 AM
2 points
21 commentsLW link

ARC-AGI is a gen­uine AGI test but o3 cheated :(

Knight LeeDec 22, 2024, 12:58 AM
3 points
6 comments2 min readLW link

Why em­piri­cists should be­lieve in AI risk

Knight LeeDec 11, 2024, 3:51 AM
5 points
0 comments1 min readLW link

The first AGI may be a good en­g­ineer but bad strategist

Knight LeeDec 9, 2024, 6:34 AM
14 points
2 comments2 min readLW link

Keep­ing self-repli­cat­ing nanobots in check

Knight LeeDec 9, 2024, 5:25 AM
2 points
4 comments1 min readLW link

Hope to live or fear to die?

Knight LeeNov 27, 2024, 10:42 AM
3 points
0 comments1 min readLW link

Should you in­crease AI al­ign­ment fund­ing, or in­crease AI reg­u­la­tion?

Knight LeeNov 26, 2024, 9:17 AM
3 points
1 comment4 min readLW link

A bet­ter “State­ment on AI Risk?”

Knight LeeNov 25, 2024, 4:50 AM
9 points
6 comments3 min readLW link