nim

Karma: 1,222

nim Jan 29, 2025, 12:22 AM
5 points
0
on: A Three-Layer Model of LLM Psychology
Feels to me like at the moment, the “character layer” handles transforming a “you” into an “I”. I find it conspicuously absent in smaller models running locally, though maybe the absence is just more obvious with CoT than without it.

I’ve also noticed that the “training data” we get as humans is primarily focusing on or contextualized relative to ourselves, whereas the parts you’re referring to as ground layers don’t really have a concept of the LLM as an entity so they tend to assume they’re humans on that level.

nim Jan 24, 2025, 9:45 PM
2 points
0
on: nim’s Shortform
saw a post from zvi on twitter yesterday, “remember that this corner of the world has a very different outlook than most other corners. if you don’t have random day-to-day ai queries what are you even doing all day?”, in reply to someone asking “who the fuck has random day-to-day ai queries?” (yes, i looked at that on my phone and typed it out, because my twitter account for following tpot people lives only on my phone, and my lw account lives only on a device with a real keyboard, and getting the link from point A to point B would have been even more hassle)

i think the lines between worlds could be defined relative to search. like, in the beginning, there was no search. if you needed information, it was given to you. if it wasn’t given to you, you made do without it. some people still live there.

then there was search, and some of us learned how to use it (others definitely didn’t!) i’ve had to train my parents carefully to search, and my mother has taken to it better than my father. search at its best, with grep within a single local text file, is amnesiac, immaterial and eternal—there is no social calculus around having searched becuase the social implications of “over there on the computer” are unrelated to the contents of the typing and reading going on. i think I speak search with a faint accent, like a kid who moved countries in their early teens.

AI feels to me like it’s at its best when I use it as search on ideas. It also works as a bigger room to layout concept prototypes in, but that’s a higher barrier to entry; it’s harder for me to learn to design and build bigger conceptual things than it is to just use good search as search.

so we have two search modalities, conventional/literal and linguistic. perhaps worth noting, i don’t think modern google is the embodiment of the old-school literal form of search—i think database queries are what that grew up into as it got with the times. Modern google and google-alikes feels like it’s a database trying to be the kind of concept-search that LLMs surpass it at.

it feels to me like the question of which search tech is superior is answered by the context of what you’re searching for. it’s the difference between numbers and letters, the difference between data and information, the difference between intelligence and wisdom—it’s a level-of-abstraction kind of distinction.

thus the question becomes, for those of us who do queries all day, what are we querying? DB queries are for things with right answers, because they can kinda read a smudged copy of the answer key to the test of reality. AI queries are worse at that, but they’re better at a different kind of thing.

There’s also a kind of creativity and engineering where you query reality directly. There are areas where direct-query is a lot better, and other areas where db-query is a lot better… like if i wanna know about how this one item will behave, i should examine it, whereas if i want to know how items of this type should be expected to behave, i should not extrapolate over-much from my examination of a single item. it’s like something not-turning-up-in-search that i recall seeing recently, perhaps a youtube short from chris boden, about the difference between “engineering” and “the knack”—if you have the knack you can overbuild anything, but engineering is the art of cutting away all the excess to build the bare minimum-viable of any given thing. If you’re using the knack, it gets to the point where you ask the materials what they’re gonna do, and they give you a better answer than the books about them might.

but why would people close with AI have that physical experience? Material hobbies are expensive and messy and cost a lot of storage space and waste time you could spend arguing about x-risk, and when material hobbies do arise, computer-people have the means to get idealized standardized raw materials for them from wherever in the world they’re made, rather than having to make do with whatever you can find nearby. Yes, there are exceptions. Yes, you’re exceptional. But most people? Have you ever tried teaching a room full of most-people to tie a new knot? Modern life very rarely calls “us people” to direct physical problem-solving, us who do so well or at least so connectedly online. So of course we’d forget the other way, the get-your-information-straight-from-the-source way, if we veer away from the areas where that way’s better.

There are probably other hobbies where this is relevant. Maybe you like hiking, maybe you sail. You are probably worse at predicting the weather than the forecast on your phone, but you’ve probably met a human who was better. It’s kinda like that, with the kind of queries that lose resolution if you even put them into language. Most people building something real have to go through a quantified level, a CoT corrigibility level, to be adequately assessed and supervised on their thought process. At least, that’s the engineering-degree paradigm. And the engineering-interview one, or at least the fields that larp as engineering, “solve this problem on the whiteboard and show your work”. It’s an implicit prioritization of the kind of work that can be shown.

anyways, that’s all to say that i think there’s a certain kind of smart that’s agnostic of whether you’re expanding your worldview in the externally legible way or the less-legible ones. i think there’s a “kind of guy” who is basically the same “kind”, in the meaningful ways, as the constantly-querying-AI one… but the kind who queries the world instead of the world-model. It probably takes both kinds. What’s “it”? Who knows!

nim Jan 22, 2025, 3:53 PM
2 points
0
in reply to: Viliam’s comment on: Don’t Legalize Drugs
I have a lot of elderly neighbors. Alcoholism’s been a major factor in about half the deaths in the past few years. The runners-up in this popularity contest are cannabis of the grow-it-on-the-sly strength, and opioids because they’re nasty but people tend to start them later in life.

If we’re being literal about drugs as “non-caloric chemicals we ingest to modify our physiology”, the most popular drug in the elderly is probably a blood pressure med of some kind.

nim Jan 14, 2025, 6:03 PM
6 points
2
in reply to: DirectedEvolution’s comment on: Don’t Legalize Drugs
I’m not claiming that we’ve solved any substance abuse! I’m claiming that you and Dalrymple appear to be ignoring the potential lessons we can learn from the equilibrium that society has reached with the most widely used and abused modern intoxicant. The equilibrium doesn’t have to be perfect, nor to solve every problem, in order to be a relatively stable and well-tolerated compromise between allowing individual freedom and punishing misbehavior.

nim Jan 14, 2025, 4:30 PM
18 points
2
on: How do you deal w/ Super Stimuli?
Similar stuff that’s worked for me includes:
- lock the notifications down completely. Every notification on your phone should be something your ideal self cares about—usually direct human contact. Might help to differentiate between “public” vs “private” apps—“public” apps aren’t allowed notifications because it’s the algo pushing stuff on you, whereas “private” apps are allowed notifications because they consistently represent an actual human who you’ve invited to contact you.
- Model your engagement with content as training your algorithm. Just as you probably wouldn’t cuss in front of a toddler that’s absorbing everything you say, be careful of watching garbage because everything you watch is training it that that’s what you like.
- Block all ads and the too-aggressive engagement feeds. Unhook is one extension that does this for YouTube; I keep the home feed but hide everything else (recommended vids, shorts, etc)
- Move your app icons on your phone whenever you catch yourself reflexively opening an app. Put something else in the location where you’ve formed the habit of tapping when bored.
- replace “don’t wanna x” with “do wanna y”. Same principle as teaching a dog to pick up a pillow instead of “don’t bark” when it hears someone at the door—the easiest “don’t x” goals are shaped like “do y” ones. Maybe that’s “use my flashcards”, maybe that’s “read a book”, maybe that’s “be still and quiet”… the trick is to start your “do y” as easy as possible. If it’s “read a book”, start yourself on the trashiest easiest most clickbaity-engaging book you can find, or even a magazine or comic.

nim Jan 14, 2025, 3:58 PM
7 points
4
on: Don’t Legalize Drugs
Alcohol is also a drug. If Dalrymple really means “drugs” when he says “drugs”, it would follow that he’s advocating for prohibition to protect alcoholics from themselves.

We seem to have found a relatively tolerable equilibrium around alcohol where the substance is widely available, the majority of individuals who can enjoy it recreationally are free to do so, and yet it’s legally just as intolerable for an intoxicated person to harm others as it would be for a sober person to take the same actions. Some individuals have addiction problems, and we have varyingly effective programs in place to help them deal with that, but ultimately the right of the majority to enjoy it responsibly (and the rights of the businesses to sell it to those who can use it responsibly) trump the “rights” of the minority to be protected from themselves by the government.

Maybe to get the same equilibrium around other drugs, we would need harsher punishments for the antisocial behaviors that we’re actually trying to prevent by banning the drugs themselves. All I know is that anyone who unironically makes “ban the intoxicants” claims without considering what we can learn from our most widely accepted and normalized intoxicants is speaking on some level other than the literal and logical.

nim Jan 10, 2025, 6:19 PM
4 points
0
on: nim’s Shortform
One lens to view AI is as a prediction engine—predict what color to make each pixel, predict what word to put next.

Whoever is first to applying this predictive skill to stock markets will probably make immense amounts of money. Then again, people are probably already trying to do this, which creates a situation unlike that from which we derive the historic data to train on, which might render it impossible?

On the gripping hand, large slow and powerful institutions want to make the numbers go up and to the right.

nim Jan 7, 2025, 12:52 AM
0 points
0
in reply to: alkjash’s comment on: Meal Replacements in 2025?

I’ve also never had an item I can imagine stomaching every day.

FWIW, this is likely to be a worse problem with a meal replacement than a protein bar, and a worse problem with a protein bar than a frozen option.

bring to work

That adds complexity. Are there social norms at work which necessitate eating with others? If so, having a shake or similar every day may not meet those needs.

I sure wish I could skip breakfast and/or lunch and only have one sit-down meal with my family in the evening

Are you aware of the concept of OMAD (one meal a day)? I don’t think it’s super likely that this is the right solution for you, but it seems like you’d learn useful things about the best solution for your food-is-inconvenient problem by considering it as an option and determining why you would rule it out. Basically unless you’re diabetic or attempting to gain weight, you can just have all your day’s calories in a single meal instead of spread across multiple. Again, there are many reasons why this might not be a good fit, but it seems worth making sure that it’s in your overton window as an option that works for some people.

(edit to add)

packaged in sizes more suitable for full meals?

a “full meal” for someone who’s smaller, sedentary, or pursuing weight loss can be a protein bar. A “full meal” for someone who’s larger, more active, or pursuing weight gain can be 10x that amount, at the extreme. We sort of have a standard daily intake of 2,000kcal from nutrition facts, but not even food packaging attempts to prescribe how many meals an individual eats in a day, how they distribute their intake across those meals, and therefore asking whether an item is packaged in a size suitable for a “full meal” is like asking whether a piece of software will run on “a computer”.

nim Jan 6, 2025, 6:27 PM
2 points
−1
on: Is “hidden complexity of wishes problem” solved?

we do not have a robot that is perfectly capable of executing the “saving grandma” task

Do you mean to imply that humans are perfectly capable of executing the “saving grandma” task?

Opening a door in a burning building at the wrong time can cause the entire building to explode by introducing enough oxygen to suddenly combust a lot of uncombusted gases.

I’m not convinced that there exists a “perfect solution” to any task with 0 unintended consequences, though, so my opinions probably aren’t all that helpful in the matter.

nim Jan 6, 2025, 3:50 PM
9 points
1
on: Meal Replacements in 2025?
I notice that I am confused: I experience comparable price and convenience, and superior subjective experience of eating, by purchasing pre-made frozen meals and microwaving them. I experience comparable price and superior travel convenience by throwing a protein bar in my bag on the way out the door.

Possible reasons one might prefer a meal replacement over comparably easy “real” food include:
- less waste? a powder mixed into a drink would trade the hassle of washing a reusable bottle for the trash creation of discarding a disposable bottle
- Flavor/texture concerns? If you hate eating real food for sensory reasons, you may love some meal replacements and hate others
- Nutritional concerns? If there’s a specific nutrition profile that you’re seeking which can’t be obtained through sufficiently easy conventional meals, that seems worth mentioning
- time savings? if you have special scheduling needs, or experience unusually high cognitive load from thinking about choosing meals, “meal replacements” might be superior?
Based on observing the eating behaviors of many friends and acquaintances, I’d speculate that the soylent-style “meal replacement” market has split between meal delivery services that offer better flavor/variety/nutrition for equivalent ease, and protein/supplement products that offer more optimized and targeted nutrition than the originals. In short, I suspect but cannot prove that demand for soylent/huel has decreased because options more pleasant to eat and otherwise cost/convenience equivalent have become more mainstream.

Anyways, could you clarify what successful meal replacement would mean to you, if you would like suggestions on how to get there?

nim Dec 26, 2024, 12:57 AM
2 points
0
in reply to: Raemon’s comment on: Hire (or become) a Thinking Assistant / Body Double
Depth of specialization to the individual is an interesting question. I suspect that if this was a mature field, we’d have names for distinct subtypes of assistant skillset—like how an android app dev isn’t quite the same as an ios app dev, although often one person can do whichever skillset a situation demands.

I suspect that low-skill candidates would gravitate toward one assistance subtype or another, and lack of skill would show up in their inability to identify which subtype a situation calls for and then adapt to it. But on taskrabbit, we don’t need the same tasker to be good at picking up groceries and also building furniture, as long as we’re clear enough about which task we’re asking for...

nim Dec 25, 2024, 4:59 AM
2 points
0
in reply to: Hide’s comment on: Hire (or become) a Thinking Assistant / Body Double
Oops! I only realized in your reply that you’re considering “reliability” the load-bearing element. Yes, the hiring pipeline will look like a background noise of consistent interest from the unqualified, and sporadic hits from excellent candidates. You’re approaching it from the perspective that the background noise of incompetents is the more important part, whereas I think that the availability of an adequate candidate eventually is the important part.

I think this because basically anywhere that hires can reliably find unqualified applicants. For a role where people stay in the job for 6 months, for instance, you only need to find a suitable replacement once every 6 months… so “reliably” being able to find an excellent candidate every day seems simply irrelevant.

nim Dec 24, 2024, 7:14 PM
6 points
0
on: Orienting to 3 year AGI timelines
Joining the few places that will have leverage over what happens.
I agree that this is good if one has sufficient skill and knowledge to improve outcomes. What if one has reason to suspect that joining a key AI lab would be a net negative toward their success, compared to if they hired someone else? For instance I interview disproportionately well compared to my actual efficacy in tech roles—I get hired based on the best of my work, but that best work is a low percentage of my actual output (f which most is barely average and some is conterproductive), so it seems like someone in my situation might actually do harm by seeking greater leverage?

nim Dec 24, 2024, 7:03 PM
4 points
0
on: Recommendations on communities that discuss AI applications in society
Could you share an example of a specific discussion that exemplifies what you’re looking for? I’d hazard a guess that such an example might come from bluesky or mastodon at the moment. But starting from something concrete would give a first set of examples of how people actually benefit from discussing at your target level of abstraction without slipping out of it, as you’ve noticed that much discussion seems to do.

nim Dec 24, 2024, 6:57 PM
7 points
4
in reply to: Hide’s comment on: Hire (or become) a Thinking Assistant / Body Double
Counterexample: financially self-sufficient individual who is curious about the work that the thinker is doing, and wants to learn more of how it is done.

nim Dec 24, 2024, 6:56 PM
11 points
6
on: Hire (or become) a Thinking Assistant / Body Double
Interesting! I’m way out in the middle of nowhere, and experience suggests that the greatest benefits of intellectual co-location happen with physical co-location as well. I wonder if there would be interest in a program with some overlap across airbnb or farm stays, where one visits a spot out in the woods with decent internet but few distractions, and stays for a while (a week or two sounds like a plausible guess to start iterating from) with a host who assumes a metacognitive role in the project that one is working on. It seems quite appealing from a hosting perspective—doing a short-term cognitive job-shadow role like that for an expert thinker would be deeply enriching, and hosting many thinkers over the years would build a fascinating expertise in pattern-matching between them, crafting an ontology of how folks in a given field get stuck and un-stuck, etc.

And I don’t think I’m the only prospective host who prefers a remote location because dealing with strangers frequently (as one must to live richly in a city) gets exhausting, yet enjoys deeper small-group interaction when it’s available. There’s also a social dynamic where visiting someone in the middle of nowhere gives the host greater control over how time is used, since excursions outside the homestead cost more travel time and thus warrant more careful planning. This dynamic seems like it could be quite helpful if the host’s primary priority is to advance the success of the guest’s project.

nim Dec 20, 2024, 11:28 PM
6 points
0
on: You can validly be seen and validated by a chatbot
Interesting—my experiences are similar, but I frame them somewhat differently.

I also find that Claude teaches me new words when I’m wandering around in areas of thought that other thinkers have already explored thoroughly, but I experience that as more like a gift of new vocabulary than emotional validation. It’s ultimately a value-add that a really good combination of a search engine and a thesaurus could conceptually implement.

Claude also works on me like a very sophisticated elizabot, but the noteworthy difference seems to be that it’s a more skilled language user than I am, and therefore I experience a sort of social respect toward it that I don’t get from tools where I feel like I could accurately predict all of their responses and have the whole conversation with myself.

The biggest emotional value that I experience Claude as providing for me is that it reflects a subtly improved tone of my inputs, without altering the underlying facts that I’m discussing. Too often humans in emotional conversations skip straight to “you shouldn’t feel that way” or similar… that comes across as simply calling me alien, whereas Claude does the “have you considered this potential reframe” thing in a much more sophisticated and respectful way. Probably helps that it lacks the biology which causes us embodied language users to mirror one another’s moods even to our own detriment...

Another validation-style value add that I experience with Claude is how I feel a sufficient sense of reward from reading its replies, which motivates me to bother exerting the effort to think like talking instead of just think like ruminating. I derive the social benefits of brainstorming with another language user, without having to consume the finite resource of an embodied language user’s time.

nim Dec 18, 2024, 4:52 PM
2 points
0
on: Walking Sue
This is a fascinating case study of Claude as a thought tool—I’m guessing you were using speech to text and it pulled its stunt of grabbing the wrong homophones here and there? It picked “heal” as “heel” more often than I’d expect in any other situation.

How did you prompt on getting the essay out? My first approach to doing a similar experiment in essay-ifying my Claude chats would be to copy the entire chat into a new context and ask for summary… but that muddles the “I” significantly.

nim Dec 18, 2024, 3:52 AM
10 points
2
on: Preppers Are Too Negative on Objects
Yep. I’d also add a couple other factors that seem to play into the prepper object negativity memeplex:
- “an object solves this problem” is something of a cognitive stop sign to most people—tabooing the “object solved it” concept forces more accurate thinking about what one’s options would be without the object
- prepper proclivities seem to have a substantial overlap with hoarding disorders. With any hoarding comorbidity, “i have the object” does not imply “I can find the object and retrieve it in good condition”.

nim Dec 17, 2024, 6:50 PM
5 points
1
on: nim’s Shortform
needn’t clutter up the comments on https://www.lesswrong.com/posts/h2Hk2c2Gp5sY4abQh/lack-of-social-grace-is-an-epistemic-virtue, as it’s old and a contender for bestof, but....

what about the negativity bias??!!

if humans naturally put x% extra weight on negative feedback by default, then if i want a human to get an accurate idea of what i’m trying to communicate, i need to counteract their innate negativity bias by de-emphasizing the negative or over-emphasizing the positive. if i just communicate the literal truth directly to someone who still has the negativity bias, that’s BAD COMMUNICATION because i am knowingly giving that person a set of inputs that will cause them to draw an inaccurate conclusion.

in my model of the world, a major justification of social grace is that it corrects for listeners’ natural tendency to assume the worst of whatever they hear.

this follows in the feynman/bohr example because bohr had fixed his own negativity biases but the “yes-men” continued to correct for them. but feynman was just not doing that correction by default, and was therefore capable of better communication with bohr.