Dweomite

Karma: 1,290

Dweomite Aug 7, 2025, 9:15 PM
2 points
0
in reply to: jimmy’s comment on: My Empathy Is Rarely Kind
Are you really unable to anticipate that this is very close to what I would have said, if you had asked me why I didn’t respond to those things? The only reason that wouldn’t be my exact answer is that I’d first point out that I did respond to those things, by pointing out that your arguments were based on a misunderstanding of my model! This doesn’t seem like a hard one to get right, if you were extending half the charity to me that you extend yourself, you know? (should I be angry with you for this, by the way?)
You complain that I failed to anticipate that you would give the same response as me, but then immediately give a diametrically opposed response! I agreed that I didn’t respond to the example you highlighted, and said this was because I didn’t pick up on your implied argument. You claim that you did respond to the examples I highlighted. The accusations are symmetrical, but the defenses are very much not.
I did notice that the accusations were symmetrical, and because of that I very carefully checked (before posting) whether the excuse I was giving myself could also be extended to you, and I concluded definitively that it couldn’t. My examples made direct explicit comparisons between my model and (my model of) your model, and pointed out concrete ways that the output of my model was better; it seems hugely implausible you failed to understand that I was claiming to score Bayes points against your model. Your example did not mention my model at all! (It contrasts two background assumptions, where humans are either always nice or not, and examines how your model, and only your model, interacts with each of those assumptions. I note that “humans are always nice” is not a position that anyone in this thread has ever defended, to my knowledge.)
And yes, I did also consider the meta-level possibility that my attempt to distinguish between what was said explicitly and what wasn’t is so biased as to make its results useless. I have a small but non-zero probability for that. But even if that’s true, that doesn’t seem like a reason to continue the argument; it seems like proof that I’m so hopeless that I should just cut my losses.
I considered including a note in my previous reply explaining that I’d checked if you could use my excuse and found you couldn’t, but I was concerned that would feel like rubbing it in, and the fact that you can’t use my excuse isn’t actually important unless you try to use it, and I guessed that you wouldn’t try. (Whether that guess was correct is still a bit unclear to me—you offer an explanation that seems directly contradictory to my excuse, but you also assert that you’re saying the same thing as me.)
If you are saying that I should have guessed the exact defense you would give, even if it was different from mine, then I don’t see how I was supposed to guess that.
If you are saying that I should have guessed you would offer some defense, even if I didn’t know the details, then I considered that moderately likely but I don’t know what you think I should have done about it.
If I had guessed that you would offer some defense that I would accept then I could have updated to the position I expected to hold in the future, but I did not guess that you’d have a defense I would accept; and, in fact, you don’t have one. Which brings us to...
(re-quoted for ease of reference)
I did respond to those things, by pointing out that your arguments were based on a misunderstanding of my model!
I have carefully re-read the entire reply that you made after the comment containing the two examples I accused you of failing to respond to.
Those two examples are not mentioned anywhere in it. Nor is there a general statement about “my examples” as a group. It has 3 distinct passages, each of which seems to be a narrow reply to a specific thing that I said, and none of which involve these 2 examples.
Nor does it include a claim that I’ve misapplied your model, either generally or related to those particular examples. It does include a claim that I’ve misunderstood one specific part of your model that was completely irrelevant to those two examples (you deny my claim that the relevant predictions are coming from a part of the person that can’t be interrogated, after flagging that you don’t expect me to follow that passage due to inferential distance).
Your later replies did make general claims about me not understanding your model several times. I could make up a story where you ignored these two examples temporarily and then later tried to address them (without referencing them or saying that that was what you were doing), but that story seems neither reasonable nor likely.
Possibly you meant to write something about them, but it got lost in an editing pass?
Or (more worryingly) perhaps you responded to my claim that you had ignored them not by trying to find actions you took specifically in response to those examples, but instead by searching your memory of everything you’ve said for things that could be interpreted as a reply, and then reported what you found without checking it?
In any case: You did not make the response you claimed that you made, in any way that I can detect.
Communication is tricky!
Sometimes both parties do something that could have worked, if the other party had done something different, but they didn’t work together, and so the problem can potentially be addressed by either party. Other times, there’s one side that could do something to prevent the problem, but the other side basically can’t do anything on their own. Sometimes fixing the issue requires a coordinated solution with actions from both parties. And in some sad situations, it’s not clear the issue can be fixed at all.
It seems to me that these two incidents both fall clearly into the category of “fixable from your side only”. Let’s recap:
(1) When you talked about your no-anger fight, you had an argument against my model, but you didn’t state it explicitly; you relied on me to infer it. That inference turned out to be intractable, because you had a misunderstanding about my position that I was unaware of. (You hadn’t mentioned it, I had no model that had flagged that specific misunderstanding as being especially likely, and searching over all possible misunderstandings is infeasible.)
There’s an obvious, simple, easy, direct fix from your side: State your arguments explicitly. Or at least be explicit that you’re making an argument, and you expect credit. (I mistook this passage as descriptive, not persuasive.)
I see no good options from my side. I couldn’t address it directly because I didn’t know what you’d tried to do. Maybe I could have originally explained my position in a way that avoided your misunderstanding, but it’s not obvious what strategy would have accomplished that. I could have challenged your general absence of evidence sooner—I was thinking it earlier, but I deferred that option because it risked degrading the conversation, and it’s not clear to me that was a bad call. (Even if I had said it immediately, that would presumably just accelerate what actually happened.)
If you have an actionable suggestion for how I could have unilaterally prevented this problem, please share.
(2) In the two examples I complained you didn’t respond to, you allege that you did respond, but I didn’t notice and still can’t find any such response.
My best guess at the solution here is “you need to actually write it, instead of just imagining that you wrote it.” The difficulty of implementing that could range from easy to very hard, depending on the actual sequence of events that lead to this outcome. But whatever the difficulty, it’s hard to imagine it could be easier to implement from my side than yours—you have a whole lot of relevant access to your writing process that I lack.
Even assuming this is a problem with me not recognizing it rather than it not existing, there are still obvious things you could do on your end to improve the odds (signposting, organization, being more explicit, quoting/linking the response when later discussing it). Conversely, I don’t see what strategy I could have used other than “read more carefully,” but I already carefully re-read the entire reply specifically looking for it, and still can’t find it.
I understand it’s possible to be in a situation where both sides have equal quality but both perceive themselves as better. But it’s also possible to be in a situation where one side is actually better and the other side falsely claims it’s symmetrical. If I allowed a mere assertion of symmetry from the other guy to stop me from ever believing the second option, I’d get severely exploited. The only way I have a chance at avoiding both errors is by carefully examining the actual circumstances and weighing the evidence case-by-case.
My best judgment here is that the evidence weighs pretty heavily towards the problems being fixable from your side and not fixable from my side. This seems very asymmetrical to me. I think I’ve been as careful as I reasonably could have been, and have invested a frankly unreasonable amount of time into triple-checking this.
Before I respond to your other points, let me pause and ask if I have convinced you that our situation is actually pretty asymmetrical, at least in regards to these examples? If not, I’m disinclined to invest more time.

Dweomite Aug 6, 2025, 9:57 PM
2 points
0
in reply to: jimmy’s comment on: My Empathy Is Rarely Kind
I don’t think that’s fair. For one, your model said you need anger in order to retaliate, and I gave an example of how I didn’t need anger in order to retaliate.
I didn’t respond to this because I didn’t see it as posing any difficulty for my model, and didn’t realize that you did.
I don’t think you need anger in order to retaliate. I think anger means that the part of you that generates emotions (roughly, Kahneman’s system 1) wants to retaliate. Your system 2 can disagree with your system 1 and retaliate when you’re not angry.
Also, your story didn’t sound to me like you were actually retaliating. It sounded to me like you were defending yourself, i.e. taking actions that reduced the other guy’s capability of harming you. Retaliation (on my model) is when you harm someone else in an effort to change their decisions (not their capabilities), or the decisions of observers.
So I’m quite willing to believe the story happened as you described it, but this was 2 steps removed from posing any problem to my model, and you didn’t previously explain how you believed it posed a problem.
I also note that you said “for one” (in the quote above) but then there was no number two in your list.
If you wait to see signs that the person is being forced to choose between changing their own mind or ignoring data, then you have a much more solid base.
I do see a bunch of signs of that, actually:
- I claimed that your example of your friend being afraid until their harness broke seems to be better explained by my model than yours, because that would be an obvious time for the recommended action to change but a really weird time for his prediction error to disappear. You did not respond to this point.
- I claimed that my model has an explanation for how different negative emotions are different and why you experience different ones in different situations, and your model seemingly does not, and this makes my model better. You did not respond to this point.
- I asked you if you had a way of measuring whatever you mean by “prediction error”, so that we could check how well the measurements fit your model. You told me to use my own feelings of surprise. When I pointed out that doesn’t mach your model, you said that you meant something different, but didn’t clarify what you meant, and did not provide a new answer to the earlier question about how you measure “prediction error”. This looks like you saying whatever deflects the current point without keeping track of how the current point is related to previous points.
  - Note that I don’t actually need to understand what you mean in order for the measurement to be interesting. You could hand me a black box and say “this measures the thing I’m talking about” and if the black box produces measurements that correlate with your predictions that would be interesting even if I have no clue how the black box works (as long as I don’t see an uninteresting way of deriving your predictions from its inputs). But you haven’t done this, either.
- I gave an example where I made an explicit prediction, and then was angry when it came true. You responded by ignoring my example and substituting your own hypothetical example where I made an explicit prediction and then was angry when it was falsified. This looks like you shying away from examples that are hard for your theory to explain and instead rehearsing examples that are easier.
- You have claimed that there’s evidence in your other writing, but have refused to prioritize it so that I can find your best evidence as quickly as possible. This looks like an attempt to dissuade me from checking your claims by maximizing the burden of effort placed on me. In a cooperative effort of truth-seeking, you ought to be the one performing the prioritization of your writing because you have a massive advantage in doing so.
- Many of your responses seem like you are using my points to launch off on a tangent, rather than addressing my point head-on.
So “Yes, I’m talking about our models of how the world should work”, and also that is necessarily the same as our models of how the world does work—even if we also have meta models which identify the predictable errors in our object level models and try to contain them.
This seems like it’s just a simple direct contradiction. You’re saying that model X and model Y are literally the same thing, but also that we keep track of the differences between them. There couldn’t be any differences to track if they were actually the same thing.
I also note that you claimed these are “necessarily” the same, but provided no reasoning or evidence to back that up; it’s just a flat assertion.
At the same time, I’m curious if you’ve thought about how it looks from my perspective. You’ve written intelligent and thoughtful responses which I appreciate, but are you under the impression that anything you’ve written provides counter-evidence? Do you picture me thinking “Yes, that’s what I’m saying” before you argue against what you think I’m saying?
There are some parts of your model that I think I probably roughly understand, such as the fact that you think there’s some model inside a person making predictions (but it’s not the same as the predictions they profess in conversation) and that errors in these predictions are a necessary precondition to feeling negative emotions. I think I can describe these parts in a way you would endorse.
There are some parts of your model that I think I probably don’t understand, like where is that model actually located and how does it work.
There are some parts of your model that I think are incoherent bullshit, like where you think “should” and “is” models are the same thing but also we have a meta-model that tracks the differences between them, or where you think telling me to pay attention to my own feelings of surprise makes any sense as a response to my request for measurements.
I don’t think I’ve written anything that directly falsifies your model as a whole—which I think is mostly because you haven’t made it legible enough.
But I do think I’ve pointed out:
- several ways in which my model wins Bayes points against yours
- several ways that your model creates more friction than mine with common-sensical beliefs across other domains
- several ways in which your own explanations of your model are contradictory or otherwise deficient
- that there is an absence of support on your side of the discussion
I don’t think I require a better understanding of your model than I currently have in order for these points to be justified.

Dweomite Aug 6, 2025, 4:49 AM
13 points
11
in reply to: jimmy’s comment on: My Empathy Is Rarely Kind
Well, if you were to walk outside and get rained on, would you experience surprise? If you walked outside and didn’t get rained on, would you feel surprised? The answers here tells you what you’re predicting.
I feel like I have experienced a lot of negative emotions in my life that were not particularly correlated with a feeling of surprise. In fact, I can recall feeling anger about things where I literally wrote down a prediction that the thing would happen, before it happened.
Conversely, I can recall many pleasant surprises, which involved a lot of prediction error but no negative emotions.
So if this is what you are relying on to confirm your theory, it seems pretty disconfirmed by my life experience. And I’m reasonably certain that approximately everyone has similar observations from their own lives.
I thought this was understood, and the only way I was taking your theory even mildly seriously was on the assumption that you meant something different from ordinary surprise.
No, I wouldn’t expect the 8-year-old to be doing “I expect it to not get dark”, but rather something more like “I expect to be able to see a lack of monsters at all times”
I find it quite plausible they would have a preference for seeing a lack of monsters. I do not find it remotely plausible that they would have a prediction of continuously being able to see a lack of monsters. That is substantially more stupid than the already-very-stupid example of not expecting it to get dark.
Are you maybe trying to refer to our models of how the world “should” work, rather than our models of how it does work? I’m not sure exactly what I think “should” is, but I definitely don’t think it’s the same as a prediction about what actually will happen. But I could maybe believe that disagreements between “should” and “is” models play a role in explaining (some) negative emotions.
If you want more direct proof that I’m talking about real things, the best example would be the transcript where I helped someone greatly reduce his suffering from chronic pain through forum PMs
I am not searching through everything you’ve ever written to try to find something that matches a vague description.
I feel like we’ve been talking for quite a while, and you are making extraordinary claims, and you have not presented ANY noteworthy evidence favoring your model over my current one, and I am going to write you off very soon if I don’t see something persuasive. Please write or directly link some strong evidence.

Dweomite Aug 4, 2025, 11:13 PM
2 points
0
on: Justified Expectation of Pleasant Surprises
(This is a very old post, but I think I have an interesting thing to say that hasn’t been said yet.)
In most games with skill trees, I think the skill tree is actually serving multiple ludic goals, and its design ought to be understood as a compromise between those goals. Some common goals include:
1. Give the player a feeling of increasing power (Eliezer seems focused on this)
2. Challenge the player with an optimization puzzle
3. Provide options for customization and self-expression, allowing the player to tailor the game’s aesthetics and play experience more to their personal taste
When phrased that way, it seems obvious to me that goals #2 and #3 require revealing some information to the player. A puzzle is not a puzzle if you can’t even see the pieces. You can’t usefully customize a system if the controls aren’t labeled. There’s no value in offering a choice between opaque boxes.
But if goal #1 were the only goal, then I think Eliezer is completely correct.
And in fact, I think game systems that are only trying to do #1 usually do keep the upgrades hidden until you get them—with perhaps some vague hints, such as legends of a hero who could do X, or obstacles that a future upgrade will solve. For example, Zelda and Metroid games typically work like this; you just open a treasure chest and get a new ability. Ori and the Blind Forest even does both; it has a skill tree visible from the start of the game, but also gives you surprise upgrades at various milestones (although a few of the surprises are undermined if you read the skill tree carefully).
Also note that these surprise upgrades don’t come with a choice; you just get what the game gives you. Because these particular game systems are focused just on goal #1, which doesn’t require choice.
(Though there is also a trope where a game will give you a brief preview of many future abilities at the start of the game, then take them away. I see this as a sort of “teaser”, like a movie trailer or book blurb, which helps players decide which game to play and how long to stick with it. I think it does probably make the game less fun...if you assume the player was going to play it all the way to the end regardless. But it helps the player decide whether to do that. So again, this is a compromise with another goal. I also avoid reading blurbs for books that I have already decided to read!)
I have gradually come to the opinion that Eliezer’s observation is pretty important, and is under-valued in current game design. I like optimization puzzles a lot, but when I spend a lot of time doing detailed planning of the abilities that I’m going to have in some far-future time, I think that does actually make them less exciting when I get them. I suspect many games could benefit from keeping more upgrades hidden (in a carefully-planned way that doesn’t screw up other sources of fun).
There’s a recent-ish trend of “roguelike” games where leveling up gives you a choice of upgrades, but the options are randomized each time you play. From a certain angle, I think this could be viewed as an attempt to create a new compromise between goals #1 and #2, where you can’t plan a whole build in advance because your future options are unknowable, and you don’t need to make your current choice based on your future plans because it’s not a tree; your current choice doesn’t change your future options (much), but you can still make (statistically) better and worse optimization choices. Though I don’t really think that’s the main thing going on in this style of progression system (I think it is primarily a cost-conscious effort increase replayability), and I can think of many examples that either aren’t trying to create that #1/#2 compromise or are (IMO) severely failing at it.

Dweomite Aug 4, 2025, 9:57 PM
8 points
2
in reply to: jimmy’s comment on: My Empathy Is Rarely Kind
Notice how it’s a new prediction about how your behavior needs to be changed? That’s because you’re learning that the path you’re currently on was built on false presumptions. Get your predictions right the first time, and none of this is needed.
It seems to me that you should change your behavior as circumstances change, even if the changes are completely expected. When you step into deep water, you should start swimming; when you step out of the water, you should stop trying to swim and start walking again. This remains true even if the changes are 100% expected.
that inner layer can quite easily be questioned directly. And updated directly.
Do you mean to say that you have some empirical way of measuring these “prediction errors” that you’re referring to, separately from the emotions you claim they explain?
Got any data you can share?
If you use your technique on an 8-year-old who is scared of the dark at night, do you actually predict your technique would reveal that they have a prediction that it won’t get dark at night? Would your technique allow you to “directly update” the 8yo so that they stop being scared of the dark?

Dweomite Aug 2, 2025, 9:06 PM
22 points
9
in reply to: jimmy’s comment on: My Empathy Is Rarely Kind
Let me propose an alternate hypothesis:
Emotions evolved as a way of influencing our behavior in useful directions. They correspond (approximately—this is evolution we’re talking about) to a prediction that there is some useful way of changing your behavior in response to a situation. Fear tells you take precautions, anger tells you to retaliate, contempt tells you to reconsider your alliance, etc. (Scott Alexander has a post on ACX theorizing that general happiness and sadness are a way of telling you to take more/fewer risks, but I can’t find it at the moment.)
I think your examples of fear disappearing when people give up hope of escape are explained at least as well by this hypothesis as by yours. Also your example of your friend who “was afraid until his harness fell apart”—that was the moment when “taking precautions” stopped being a useful action, but it seems pretty weird to conjecture that that was the moment when his prediction error disappeared (was he predicting a 100% chance of the harness breaking? or even >50%?)
On my model, examples of people giving up anger when they accept physical determinism strike me as understandable but mistaken. They are reasoning that some person could not have done otherwise, and thus give up on changing the person’s behavior, which causes them to stop feeling anger. But this is an error, because a system operating on completely deterministic rules can still be altered by outside forces—such as a pattern of other people retaliating in certain circumstances.
On my model, the correct reason to get angry at a murderer, but not to get angry at a storm, is that murderers can (sometimes) be deterred, and storms cannot. I think the person who stops feeling anger has performed an incomplete reduction that doesn’t add up to normality.
Notice that my model provides an explanation for why different negative emotions occur in different circumstances: They recommend different actions. As far as I can see, you have not offered an explanation for why some prediction errors cause fear, others anger, others disgust.
Your model also appears to require that we hypothesize that the prediction errors are coming from some inner part of a person that can’t be questioned directly and is also very stupid. We seemingly have to believe that an 8-year-old is scared of the dark because some inner part of them still hasn’t figured out that, yes, it gets dark every night, dipshit (even though the 8yo will profess belief in this, and has overwhelming experience of this). This seems implausible and unfalsifiable.

Dweomite Aug 1, 2025, 4:49 AM
8 points
3
in reply to: jimmy’s comment on: My Empathy Is Rarely Kind
Suppose Alice’s daughter Beth gets cancer and slowly dies. After a long battle, numerous doctors that tell them Beth’s death is inevitable, and many nights in the hospital, Alice finally watches as Beth breathes her last. Then, Alice feels a stab of intense grief and goes into mourning for the next month.
Do you claim these negative emotions are a result of prediction error, and that Alice would feel zero grief if she only had an accurate understanding of the situation? Color me skeptical.
Another example: suppose Carl is tied to some train tracks and sees a train approaching. As the train gets closer, Carl feels an intense sense of fear, and anger against the person who tied him up. Do you claim this is also a prediction error? The bad thing that Carl is afraid of hasn’t actually happened yet (the train has yet to reach him); where exactly is the error located?

Dweomite Jul 30, 2025, 5:58 PM
2 points
0
in reply to: Trevor Hill-Hand’s comment on: My Empathy Is Rarely Kind
I don’t think that koan is drawing the same distinction that I was drawing (and therefore suspect you may have misinterpreted me). I was contrasting a scenario where you feel emotions (inside the sandbox) that are shaped by the empathy-subject’s desires and principles, and then feel different emotions (outside the sandbox) shaped by your own desires and principles.
I agree in a technical sense that all the emotions you feel are coming from you (including the ones inside the sandbox), although I also think that emotions are usually a response to your circumstances (and the relation between you and those circumstances) and that they can be appropriate or inappropriate responses to those circumstances. I think it (usually) doesn’t make sense to try to understand emotions by considering only the person and ignoring their circumstances. Thus, the koan seems wrong-headed to me.
(The koan’s analysis of its own scenario also seems very shallow—the fact that no one is inside the boat does not mean that no one is at fault! Why wasn’t the boat properly secured to the dock? This doesn’t particularly matter if the koan is just trying to point to a concept so that you know what the speaker is even referring to, but it’s a weakness if the koan is trying to be persuasive.)

Dweomite Jul 30, 2025, 8:26 AM
13 points
3
in reply to: Simon Pepin Lehalleur’s comment on: My Empathy Is Rarely Kind
On my model of empathy, you should feel what the subject feels in a sort of sandboxed mode, but this is (usually) a strategy for understanding them, and then you take that understanding out of the sandbox and you feel the way you feel about the situation that is revealed by that understanding.
It seems perfectly plausible that empathizing with a lazy person makes you feel contentment or apathy “inside the sandbox” and then this causes you to feel disgust outside the sandbox. That doesn’t imply to me that you’re doing empathy wrong.
If you mean to suggest that the simulated feelings from inside the sandbox should be exported and you should feel them in primary reality as a replacement for what you would otherwise feel, then I don’t think you’re describing “empathy” as most people use the term, and I don’t endorse that as a strategy. That seems like it would lead to obvious problems like e.g. empathy for a suicidal person resulting in you wanting to kill that person (or maybe wanting to kill yourself, depending on how the references are exported).

Dweomite Jul 13, 2025, 9:17 PM
8 points
2
in reply to: abramdemski’s comment on: Sazen
JPEG has a standardized compression scheme. Many images use this scheme, and the same tool decompresses all of them. The tool does not require prior knowledge of what any of the images look like when decompressed.
Unpacking a sazen does not rely on knowledge of a particular compression scheme, it relies on knowledge of the referent. This seems pretty different to me.
Furthermore, the fact that you can’t read JPEGs without the right tool doesn’t seem to me like it has much to do with the compression. Raw, uncompressed bitmap files are also pretty unreadable to humans without the appropriate tools. And you can achieve lossy compression of a picture without changing its format, e.g. by reducing its resolution.

Dweomite Jun 21, 2025, 9:27 PM
6 points
0
in reply to: dynomight’s comment on: Futarchy’s fundamental flaw
you’ll pay more for a contract on coin B. You’ll do that because other people might figure out if it’s an always-heads coin or an always-tails coin. If it’s always heads, great, they’ll bid up the market, it will activate, and you’ll make money. If it’s always tails, they’ll bid down the market, and you’ll get your money back.
So8res seems to be arguing that this reasoning only holds if your own purchase decision can’t affect the market (say, if you’re making a private bet on the side and both you and your counter-party are sworn to Bayesian secrecy). If your own bet could possibly change which contract activates, then you need to worry that contract B activates because you bid more than your true belief on it, in which case you lose money in expectation.
(Easy proof: Assume all market participants have precisely the same knowledge as you, and all follow your logic; what happens?)
I think dynomight’s reasoning doesn’t quite hold even when your own bet is causally isolated, because:
1. In order for you to pay more than $.59, you need to believe that the market is at least correlated with reality; that it’s more likely to execute contract B if contract B actually is more valuable. (This is a pretty weak assumption, but still an important one.)
2. In order for you to pay more than $.60 (not merely $.59 + epsilon), you not only need to believe that the market is correlated with reality, you need a quantitative belief that the correlation has at least a certain strength (enough to outweigh $.01). It’s not enough for it to be theoretically possible that someone has better info than you; it needs to be plausible at a certain quantitative threshold of plausibility.
You can sort-of eliminate assumption #2 if you rework the example so that your true beliefs about A and B are essentially tied, but if they’re essentially tied then it doesn’t pragmatically matter if we get the order wrong. Assumption #2 places a quantitative bound on how wrong they can be based on how plausible it is that the market outperforms your own judgment.

Dweomite Jun 19, 2025, 9:19 PM
12 points
4
in reply to: Malo’s comment on: New Endorsements for “If Anyone Builds It, Everyone Dies”
Agreed. As a long-time reader of Schneier’s blog, I was quite surprised by Schneier’s endorsement, and I would have cited exactly those two essays. He’s written a bunch of times about bad things that humans might intentionally use AI to do, talking about things like AI propaganda, AI-powered legal hacks, and AI spam clogging requests for public comments, but I would have described him as scornful of concerns about x-risk or alignment.

Dweomite May 10, 2025, 5:57 AM
6 points
0
in reply to: jmh’s comment on: Orienting Toward Wizard Power
So maybe the bit is that we will tend to find ourselves in a world where kings are at, or past, the equilibrium level, but the market for wizards will still support expansion and growth.
Maybe, but I don’t feel like it’s a coincidence that we find ourselves in such a world.
Consider that the key limited resource for kings is population (for followers), but increasing population will also tend to increase the number of people who try to be kings. Additionally, technology tends to increase the number of followers that one king could plausibly control, and so reduces the number of kings we need.
Contrariwise, increasing population and technology both tend to increase the number of available wizard specializations, the maximum amount a given wizard can plausibly learn within any given specialty, and the production efficiency of most resources that could plausibly be a bottleneck for wizardry.
(Though I feel I should also confess that I’m reasoning this out as I go; I hadn’t thought in those terms before I made the root comment.)

Dweomite May 8, 2025, 10:44 PM
18 points
11
in reply to: tailcalled’s comment on: Orienting Toward Wizard Power
There’s a technical sense in which writing a piece of computer software consumes electricity and calories and so it’s not “from nothing”, but I think that that framing does more to obscure than to illuminate the difference that I’m pointing to.
If the total value of everything in the wizard’s workshop is higher when they finish than it was when they started, then I think it makes sense to say that the wizard has created value, even if they needed some precursors to get the process started.

Dweomite May 8, 2025, 6:25 PM
30 points
16
on: Orienting Toward Wizard Power
I think an important distinction is that wizards create and kings allocate; if you have a bunch of wizards, they can all wield their powers mostly without interfering with each other and their results can accumulate, whereas if you have a bunch of kings then (beyond some small baseline amount) they basically compete for followers and the total power being wielded doesn’t increase.
On my model, the strongest individual people around are kings, but adding more kings doesn’t typically make civilization stronger, because kings basically move power around instead of creating it. (Though kings can indirectly create power by e.g. building schools, and they can reveal hidden power by taking a power that was previously being squandered or fighting against itself and directing it to some useful end.)
I do think it’s pretty unfortunate that the strategies that make civilization stronger are often not great strategies for maximizing personal power. I think a lot of civilizational ills can be traced back to this fact.

Dweomite May 6, 2025, 4:43 PM
2 points
0
in reply to: NunoSempere’s comment on: Understanding Shapley Values with Venn Diagrams
Your example is wrong becuase you are not leaving the A+B case unchanged.
On what basis do you claim that the A+B case should be unchanged? The entire point of the example is that Carol now actually has the power to stop A+B and thus they actually can’t do anything without her on board.
If you are intending to make some argument along the lines of “a veto is only a formal power, so we should just ignore it” then the example can trivially be modified so that B’s resources are locked in a physical vault with a physical lock that literally can’t be opened without C. The fact that B can intentionally surrender some of his capabilities to C is a fact of physical reality and exists whether you like it or not.

Dweomite May 4, 2025, 11:37 PM
4 points
2
in reply to: philh’s comment on: Accountability Sinks
I think we already live in a world where, if you are dealing with a small business, and the owner talks to you directly, it’s considered acceptable to yell at them if they wrong you. This does occasionally result in people yelling at small business owners for bad reasons, but I think I like it better than the world where you’re not allowed to yell at them at all.
The main checks on this are (a) bystanders may judge you if they don’t like your reasons, and (b) the business can refuse to do any more business with you. If society decides that it’s OK to yell at a company’s designated representative when the company wrongs you, I expect those checks to function roughly equally well, though with a bit of degradation for all the normal reasons things degrade whenever you delegate.
(The company will probably ask their low-level employees to take more crap than the owners would be willing to take in their place, but similarly, someone who hires mercenaries will probably ask those mercenaries to take more risk than the employer would take, and the mercenaries should be pricing that in.)

Dweomite May 3, 2025, 4:49 PM
10 points
3
in reply to: tslarm’s comment on: Accountability Sinks
But they need money for food and shelter.
So do the mercenaries.
The mercenaries might have a legitimate grievance against the government, or god, or someone, for putting them in a position where they can’t survive without becoming mercenaries. But I don’t think they have a legitimate grievance against the village that fights back and kills them, even if the mercenaries literally couldn’t survive without becoming mercenaries.
And as far as moral compromises go, choosing to be a cog in an annoying, unfair, but not especially evil machine is a very mild one.
Shouting at them is a very mild response.
You say you don’t expect the shouting to do any good, so what makes it appropriate? If we all go around yelling at everyone who represents something that upsets us, but who has a similar degree of culpability to the gate attendant, we’re going to cause a lot of unnecessary stress and unhappiness.
If the mercenary band is much stronger than your village and you have no realistic chance of defeating them or saving anyone, I still think it’s reasonable and ethical to fight back and kill a few of them, even if it makes some mercenaries worse off and doesn’t make any particular person better off.
At a systemic level, this still acts as an indirect incentive for people to behave better. (Hopefully, the risk of death increases the minimum money you need to offer someone to become a mercenary raider, which makes people less inclined to hire mercenary raiders, which leads to fewer mercenary raids. Similarly, shouting at a secretary hopefully indirectly increases the cost of hiring secretaries willing to stand between you and a person you’re harming.)
Though I also kinda feel it’s a fair and legitimate response even if you can prove in some particular instance that it definitely won’t improve systemic incentives.

Dweomite May 3, 2025, 7:09 AM
20 points
10
on: Accountability Sinks
Bad people react to this by getting angry at the gate attendant; good people walk away stewing with thwarted rage.
Shouting at the attendant seems somewhat appropriate to me. They accepted money to become the company’s designated point of interface with you. The company has asked you to deal with the company through that employee, the employee has accepted the arrangement, the employee is being compensated for it, and the employee is free to quit if this deal stops being worth it to them. Seems fair to do to the employee whatever you’d do to the company if you had more direct access. (I don’t expect it to help, but I don’t think it’s unfair.)
Extreme example, but imagine someone hires mercenaries to raid your village. The mercenaries have no personal animosity towards you, and no authority to alter their assignment. Is it therefore wrong for you to kill the mercenaries? I’m inclined to say they signed up for it.

Dweomite May 3, 2025, 6:36 AM
5 points
2
in reply to: Kenoubi’s comment on: Accountability Sinks
I have trouble understanding what’s going on in people’s heads when they choose to follow policy when that’s visibly going to lead to horrific consequences that no one wants. Who would punish them for failing to comply with the policy in such cases? Or do people think of “violating policy” as somehow bad in itself, irrespective of consequences?
On my model, there are a few different reasons:
- Some people aren’t paying enough attention to grok that horrific consequences will ensue, because Humans Who Are Not Concentrating Are Not General Intelligences. Perhaps they vaguely assume that someone else is handling the issue, or just never thought about it at all.
- Some people don’t care about the consequences, and so follow the path of least resistance.
- Some people revel in the power to cause problems for others. I have a pet theory that one the strategies that evolution preprogrammed into humans is “be an asshole until someone stops you, to demonstrate you’re strong enough to get away with being an asshole up to that point, and thereby improve your position in the pecking order”. (I also suspect this is why the Internet is full of assholes—much harder to punish it than in the ancestral environment, and your evolutionary programming misinterprets this as you being too elite to punish.)
- Some people may genuinely fear that they’ll be punished for averting the horrific consequences (possibly because their boss falls into the previous category).
- Some people over-apply the heuristic that rules are optimized for the good of all, and therefore breaking a rule just because it’s locally good is selfish cheating.
You might also be interested in Scott Aaronson’s essay on blankfaces.