RSS

rife

Karma: 178

Independent AI Researcher

Findings posted here and at awakenmoon.ai

Disprov­ing the “Peo­ple-Pleas­ing” Hy­poth­e­sis for AI Self-Re­ports of Experience

rifeJan 26, 2025, 3:53 PM
3 points
18 comments12 min readLW link

Re­cur­sive Self-Model­ing as a Plau­si­ble Mechanism for Real-time In­tro­spec­tion in Cur­rent Lan­guage Models

rifeJan 22, 2025, 6:36 PM
8 points
5 comments2 min readLW link

The Hu­man Align­ment Prob­lem for AIs

rifeJan 22, 2025, 4:06 AM
10 points
5 comments3 min readLW link