Milan W

Karma: 395

No-self as an alignment target

Milan WMay 13, 2025, 1:48 AM

33 points

Milan W Apr 5, 2025, 11:11 PM
1 point
0
in reply to: Milan W’s comment on: Milan W’s Shortform
i think my preference is “both at once” or something like that

Milan W Apr 5, 2025, 11:10 PM
2 points
1
in reply to: LVSN’s comment on: Milan W’s Shortform
keep in mind that one persons modus tollens is anothers modus ponens, and i provided no indication as to what update i prefer people make from reading my observation

Milan W Apr 5, 2025, 10:47 PM
1 point
0
on: Milan W’s Shortform
“agentic” and “power seeker” (when applied to a person) form a pair of russell conjugates

Milan W Mar 15, 2025, 2:37 PM
1 point
0
on: AI for Epistemics Hackathon
I am interested in the space. Lots of competent people in the general public are also interested. I had not heard of this hackathon. I think you probably should have done a lot more promotion/outreach.

Milan W Mar 7, 2025, 11:49 PM
1 point
0
on: Social Media Automation for Artists: Marketing Management with AI
This reads like marketing content. However, when read at a meta level, it is a good demonstration of LLMs being already deployed in the wild.

Milan W Mar 5, 2025, 7:21 PM
2 points
1
in reply to: ank’s comment on: Share AI Safety Ideas: Both Crazy and Not
In talking with the authors, don’t be surprised if they bounce off when encountering terminology you use but don’t explain. I pointed you to those texts precisely so you can familiarize yourself with pre-existing terminology and ideas. It is hard but also very useful to translate between (and maybe unify) frames of thinking. Thank you for your willingness to participate in this collective effort.

Milan W Mar 3, 2025, 9:37 PM
2 points
1
in reply to: ank’s comment on: Share AI Safety Ideas: Both Crazy and Not
Let me summarize so I can see whether I got it: So you see “place AI” as body of knowledge that can be used to make a good-enough simulation of arbitrary sections of spacetime, where are events are precomputed. That precomputed (thus, deterministic) aspect you call “staticness”.

Milan W Mar 2, 2025, 6:31 PM
2 points
0
in reply to: ank’s comment on: Share AI Safety Ideas: Both Crazy and Not
How can a place be useful if it is static? For reference I’m imagining a garden where blades of grass are 100% rigid in place and water does not flow. I think you are imagining something different.

Milan W Mar 2, 2025, 6:10 PM
2 points
0
in reply to: ank’s comment on: Share AI Safety Ideas: Both Crazy and Not
I think you may be conflating between capabilities and freedom. Interesting hypothesis about rules and anger though, has it been experimentally tested?

Milan W Mar 2, 2025, 6:06 PM
2 points
1
in reply to: ank’s comment on: Share AI Safety Ideas: Both Crazy and Not
Hmm i think i get you a bit better now. You want to build human-friendly and even fun and useful-by-themselves interfaces for looking at the knowledge encoded in LLMs without making them generate text. Intriguing.

Milan W Mar 2, 2025, 5:54 PM
1 point
0
in reply to: ank’s comment on: Share AI Safety Ideas: Both Crazy and Not
I’m not sure I follow. I think you are proposing a gamification of interpretability, but I don’t know how the game works. I can gather something about player choice making the LLM run and maybe some analogies to physical movement, but I can’t really grasp it. Could you rephrase it from it’s basic principles up instead of from an example?

Milan W Mar 2, 2025, 3:29 PM
2 points
0
on: Share AI Safety Ideas: Both Crazy and Not
A qualitative analysis of LLM personas and the Waluigi effect using Internal Family Systems tools

Milan W Feb 24, 2025, 7:31 PM
1 point
0
in reply to: Yair Halberstadt’s comment on: Yair Halberstadt’s Shortform
Anthropic is calling it an “hybrid reasoning model”. I don’t know what they mean by that.