Mateusz Bagiński

Karma: 1,404

Agent foundations, AI macrostrategy, human enhancement.

I endorse and operate by Crocker’s rules.

I have not signed any agreements whose existence I cannot mention.

Mateusz Bagiński Apr 6, 2025, 5:59 PM
4 points
2
in reply to: JenniferRM’s comment on: Why Have Sentence Lengths Decreased?
To show how weird English is: English is the only proto indo european language that doesn’t think the moon is female (“la luna”) and spoons are male (“der Löffel”). I mean… maybe not those genders specifically in every language. But some gender in each language.
Persian is ungendered too. They don’t even have gendered pronouns.
https://en.wikipedia.org/wiki/Persian_grammar

Mateusz Bagiński Apr 5, 2025, 8:39 AM
4 points
0
in reply to: Wei Dai’s comment on: Wei Dai’s Shortform
Writing articles in Chinese for my family members, explaining things like cognitive bias, evolutionary psychology, and why dialectical materialism is wrong.
Your needing to write them seems to suggest that there’s not enough content like that in Chinese, in which case it would plausibly make sense to publish them somewhere?
I’m also curious about how your family received these articles.

Mateusz Bagiński Apr 4, 2025, 7:37 PM
2 points
0
in reply to: avturchin’s comment on: AI 2027: What Superintelligence Looks Like
I think that the scenario of the war between several ASI (each merged with its origin country) is underexplored. Yes, there can be a value handshake between ASIs, but their creators will work to prevent this and see it as a type of misalignment.
Not clear to me, as long as they expect the conflict to be sufficiently destructive.

Mateusz Bagiński Apr 4, 2025, 3:00 PM
4 points
0
on: Meditation and Reduced Sleep Need
I wonder whether it’s related to this https://x.com/RichardMCNgo/status/1866948971694002657 (ping to @Richard_Ngo to get around to writing this up (as I think he hasn’t done it yet?))

Mateusz Bagiński Apr 4, 2025, 6:04 AM
4 points
−1
in reply to: TAG’s comment on: Why Have Sentence Lengths Decreased?
Since this is about written English text (or maybe more broadly, text in Western languages written in Latinic or Cyrillic), the criterion is: ends with a dot, starts with an uppercase letter.

Mateusz Bagiński Apr 4, 2025, 6:02 AM
4 points
0
in reply to: TAG’s comment on: Why Have Sentence Lengths Decreased?
Fair enough. Modify my claim to “languages tend to move from fusional to analytic (or something like that) as their number of users expands”.

Mateusz Bagiński Apr 3, 2025, 6:33 PM
11 points
2
on: Why Have Sentence Lengths Decreased?
Related: https://www.lesswrong.com/posts/Pweg9xpKknkNwN8Fx/have-attention-spans-been-declining
Another related thing is that the grammar of languages appears to be getting simpler with time. Compare the grammar of Latin to that of modern French or Spanish. Or maybe not quite simpler but more structured/regular/principled, as something like the latter has been reproduced experimentally https://royalsocietypublishing.org/doi/10.1098/rspb.2019.1262 (to the extent that this paper’s findings generalize to natural language evolution).

Mateusz Bagiński Apr 1, 2025, 10:30 AM
9 points
0
in reply to: CapResearcher’s comment on: CapResearcher’s Shortform
Somewhat big if true although the publication date makes it marginally less likely to be true.

Mateusz Bagiński Apr 1, 2025, 10:08 AM
2 points
2
in reply to: JenniferRM’s comment on: Policy for LLM Writing on LessWrong
The outline in that post is also very buggy, probably because of the collapsible sections.

Mateusz Bagiński Mar 31, 2025, 8:29 AM
4 points
0
on: OpenAI lost $5 billion in 2024 (and its losses are increasing)
Any info on how this compares to other AI companies?

Mateusz Bagiński Mar 30, 2025, 12:40 PM
2 points
0
in reply to: sapphire’s comment on: Conceptual Rounding Errors
Link to the source of the quote?

Mateusz Bagiński Mar 30, 2025, 12:19 PM
2 points
0
in reply to: Alex Gibson’s comment on: Tracing the Thoughts of a Large Language Model
Seeing some training data more than once would make the incentive to [have concepts that generalize OOD] weaker than if [they saw every possible training datapoint at most once], but this doesn’t mean that the latter is an incentive towards concepts that generalize OOD.
Though admittedly, we are getting into the discussion of where to place the zero point of “null OOD generalization incentive”.
Also, I haven’t looked into it, but it’s plausible to me that models actually do see some data more than once because there are a lot of duplicates on the internet. If your training data contains the entire English Wikipedia, nlab, and some math textbooks, then surely there’s a lot of duplicated theorems and exercises (not necessarily word-by-word, but it doesn’t have to be word-by-word).
But I realized there might be another flaw in my comment, so I’m going to add an ETA.
(If I’m misunderstanding you, feel free to elaborate, ofc.)

Mateusz Bagiński Mar 30, 2025, 10:05 AM
2 points
0
on: Delicious Boy Slop—Boring Diet, Effortless Weightloss
I’m curious about your exercise regimen.

Mateusz Bagiński Mar 29, 2025, 1:53 PM
5 points
0
in reply to: Joseph Miller’s comment on: Tracing the Thoughts of a Large Language Model
DeepMind says boo SAEs, now Anthropic says yay SAEs!
The most straightforward synthesis^[1] of these two reports is that SAEs find some sensible decomposition of the model’s internals into computational elements (concepts, features, etc.), which circuits then operate on. It’s just that these computational elements don’t align with human thinking as nicely as humans would like. E.g. SAE-based concept probes don’t work well OOD because the models were not optimized to have concepts that generalize OOD. This is perfectly consistent with linear probes being able to detect the concept from model activations (the model retains enough information about the concept such as “harmful intent” for the probe to latch onto it, even if the concept itself (or rather, its OOD-generalizing version) is not priviledged in the model’s ontology).
ETA: I think this would (weakly?) predict that SAE generalization failures should align with model performance dropping on some tasks. Or at least that the model would need to have some other features that get engaged OOD so that the performance doesn’t drop? Investigating this is not my priority, but I’d be curious to know if something like this is the case.
1. ^
  not to say that I’m believing it’s strongly; it’s just a tentative/provisional synthesis/conclusion

Mateusz Bagiński Mar 28, 2025, 8:58 PM
15 points
2
in reply to: Gordon Seidoh Worley’s comment on: Conceptual Rounding Errors
So… there surely are things like (overlapping, likely non-exhaustive):
- Memetic Darwinian anarchy—concepts proliferating without control, trying to carve out for themselves new niches in the noosphere or grab parts of real estate belonging to incumbent concepts.
- Memetic warfare—individuals, groups, egregores, trying to control the narrative by describing the same thing in the language of your own ideology, yadda yadda.
- Independent invention of the same idea—in which case it’s usually given different names (but also, plausibly, since some people may grow attached to their concepts of choice, they might latch onto trivial/superficial differences and amplify that, so that one or more instances of this multiply independently invented concept now is now morphed into something else than what it “should be”).
- Memetic rent seeking—because introducing a new catchy concept might marginally bump up your h-index.
So, as usual, the law of equal and opposite advice applies.
Still, the thing Jan describes is real and often a big problem.
I also think I somewhat disagree with this:
An idea should either be precisely defined enough that it’s clear why it can’t be rounded off (once the precise definition is known), or it’s a vague idea and it either needs to become more precise to avoid being rounded or it is inherently vague and being vague there can’t be much harm from rounding because it already wasn’t clear where its boundaries were in concept space.
Meanings are often subtle, intuited but not fully grasped, in which case a (premature) attempt to explicitize them risks collapsing their reference to the important thing they are pointing at. Many important concepts are not precisely defined. Many are best sorta-defined ostensively: “examples of X include A, B, C, D, and E; I’m not sure what it makes all of them instances of X, maybe it’s that they share the properties Y and Z … or at least my best guess is that Y and Z are important parts of X and I’m pretty sure that X is a Thing™”.
Eliezer has a post (I couldn’t find it at the moment) where he noticed that the probabilities he gave were inconsistent. He asks something like, “Would I really not behave as if God existed if I believed that P(Christianity)=1e-5?” and then, “Oh well, too bad, but I don’t know which way to fix it, and fixing it either way risks losing important information, so I’m deciding to live with this lack of consistency for now.”

Mateusz Bagiński Mar 28, 2025, 9:02 AM
2 points
0
on: Tiling agents theory
This Google search is empty (and it’s also empty on the original Arbital page, so it’s not a porting issue).

Mateusz Bagiński Mar 26, 2025, 5:02 PM
2 points
0
in reply to: Raphael Roche’s comment on: The Dangers of Mirrored Life
LUCA lived around 4 billion years ago with some chirality chosen at random.
Not necessarily: https://en.wikipedia.org/wiki/Homochirality#Deterministic_theories
E.g.
Deterministic mechanisms for the production of non-racemic mixtures from racemic starting materials include: asymmetric physical laws, such as the electroweak interaction (via cosmic rays) or asymmetric environments, such as those caused by circularly polarized light, quartz crystals, or the Earth’s rotation, β-Radiolysis or the magnetochiral effect. The most accepted universal deterministic theory is the electroweak interaction. Once established, chirality would be selected for.

Mateusz Bagiński Mar 26, 2025, 11:10 AM
4 points
2
in reply to: momom2’s comment on: Map of all 40 copyright suits v. AI in U.S.
Especially given how concentrated-sparse it is.
It would be much better to have it as a google sheet.

Mateusz Bagiński Mar 26, 2025, 8:46 AM
5 points
0
in reply to: ryan_greenblatt’s comment on: Recent AI model progress feels mostly like bullshit
How long do you^[1] expect it to take to engineer scaffolding that will make reasoning models useful for the kind of stuff described in the OP?
1. ^
  You=Ryan firstmost but anybody reading this secondmost.

Mateusz Bagiński Mar 25, 2025, 4:04 PM
2 points
0
in reply to: Thomas Kwa’s comment on: Goodhart’s Law Causal Diagrams
https://www.lesswrong.com/posts/TYgztDNXhobbqMpXh/goodhart-typology-via-structure-function-and-randomness