Roman Leventov

Karma: 1,431

An independent researcher/blogger/philosopher about intelligence and agency (esp. Active Inference), alignment, ethics, interaction of the AI transition with the sociotechnical risks (epistemics, economics, human psychology), collective mind architecture, research strategy and methodology.

Twitter: https://twitter.com/leventov. E-mail: leventov.ru@gmail.com (the preferred mode of communication). I’m open to collaborations and work.

Presentations at meetups, workshops and conferences, some recorded videos.

I’m a founding member of the Gaia Consoritum, on a mission to create a global, decentralised system for collective sense-making and decision-making, i.e., civilisational intelligence. Drop me a line if you want to learn more about it and/or join the consoritum.

You can help to boost my sense of accountability and give me a feeling that my work is valued by becoming a paid subscriber of my Substack (though I don’t post anything paywalled; in fact, on this blog, I just syndicate my LessWrong writing).

For Russian speakers: русскоязычная сеть по безопасности ИИ, Telegram group.

Roman Leventov Apr 7, 2025, 9:02 AM
2 points
0
in reply to: Vladimir_Nesov’s comment on: An Optimistic 2027 Timeline
But then the possibilities for 2027 branch on whether there are reliable agents, which doesn’t seem knowable either way right now.
Very reliable, long-horizon agency is already in the capability overhang of Gemini 2.5 pro, perhaps even the previous-tier models (gemini 2.0 exp, sonnet 3.5/3.7, gpt-4o, grok 3, deepseek r1, llama 4). It’s just the matter of harness/agent-wrapping logic and inference-time compute budget.
Agency engineering is currently in the brute-force stage. Agent engineers over rely on a “single LLM rollout” to be robust, but also often use LLM APIs that sometimes lack certain nitty-gritty affordances for implementing reliable agency, such as “N completions” with timely self-consistency pruning and perhaps scaling N up again when model’s own uncertainty is up.
This somewhat reminds me of the early LLM scale-up era where LLM engineers over relied on “stack more layers” without digging more into the architectural details. The best example is perhaps Megatron, a trillion-parameter model from 2021 whose performance is probably abysmal relative to the 2025 models of ~10B parameters (perhaps even 1B).
So, the current agents (such as Cursor, Claude Code, Replit, Manus) are in the “Megatron era” of efficiency. In four years, even with the same raw LLM capability, agents will be very reliable.
To give a more specific example when robustness is a matter of spending more on inference, let’s consider Gemini 2.5 pro: contrary to the hype, it often misses crucial considerations or acts strangely stupidly on modestly sized contexts (less than 50k tokens). However, seeing these omissions, it’s obvious to me that if someone applied ~1k token-sized chunks of that context to 2.5-pro’s output and asked a smaller LLM (flash or flash lite) “did this part of the context properly informed that output”, flash would answer No when 2.5-pro indeed missed something important from that part of the context. This should trigger a fallback on N-completions, 2.5 self-review with smaller pieces of the context, breaking down the context hierarchically, etc.

Roman Leventov Mar 27, 2025, 1:01 PM
5 points
0
on: Roman Leventov’s Shortform
It seems that a lot of white collar jobs will become (already becoming) positional goods, such as aristocratic titles, at least for a few years, possibly longer.
AI will do 100% of the “meat” of the job better than almost all humans, and ~equally for every user (prompting won’t matter much).
But business will still demand accountability for results, and that the workers can claim that they understand and attest AI outputs (these claims themselves won’t be tested, though, nor would it really matter in the grand scheme of things). At the same time, the productivity of these jobs will increase more than businesses can absorb, at least for a few years (and then perhaps fully automated companies will ensue). Thus, fewer total white collar workers are needed.
When the skill doesn’t really matter, and the demand decreases, the jobs will become highly contested and the credentials, prestige (pedigree), connections, and “soft skills” (primarily: of passing the interviews) will decide these contents rather than “hard skills” (of which only the skill of understanding sophisticated AI outputs and potentially fix remaining issues with AI outputs will really matter, but the marginal difference between workers who are good and bad at this skill will be relatively small for the company’s bottom line, and testing candidates for this skill will be too hard).
The above straightforwardly applies to all “digital”/online/IT/analyst/manager jobs.
I don’t buy the takes like Steve Yegge’s https://sourcegraph.com/blog/revenge-of-the-junior-developer and similar, with projections of white collar workers becoming 10x, 100x more productive than today. Backlogs are not that deep, and the marginal value of churning through 99% of these backlog issues for companies is ~0.
I also don’t believe in Jevon’s paradox wonders of increased demand for “digital” work, again at least for a few years (or realistically, 10+ years) until the economy goes through a deeper transformation (including geographically). In the meantime, the economy looks to be already ~saturated (or even oversaturated) with IT/digitalization, marketing, compliance, legal proceedings, analysis, educational materials, and other similar outputs of white collar work.

Roman Leventov Feb 4, 2025, 8:59 AM
4 points
−2
on: Gradual Disempowerment, Shell Games and Flinches
Even for those not directly employed by AI labs, there are similar dynamics in the broader AI safety community. Careers, research funding, and professional networks are increasingly built around certain ways of thinking about AI risk. Gradual disempowerment doesn’t fit neatly into these frameworks. It suggests we need different kinds of expertise and different approaches than what many have invested years developing. Academic incentives also currently do not point here—there are likely less than ten economists taking this seriously, trans-disciplinary nature of the problem makes it hard sell as a grant proposal.
I agree this is unfortunate, but this also seems irrelevant? Academic economics (as well as sociology, political science, anthropology, etc.) are approximately completely irrelevant to shaping major governments’ AI policies. “Societal preparedness” and “governance” teams at major AI labs and BigTech giants seem to have approximately no influence on the concrete decisions and strategies of their employers.
The last economist who influenced the economic and policy trajectory significantly was Milton Friedman perhaps?
If not research, what can affect the economic and policy trajectory at all in a deliberate way (disqualifying the unsteerable memetic and cultural drift forces), apart from powerful leaders themselves (Xi, Trump, Putin, Musk, etc.)? Perhaps the way we explore the “technology tree” (see https://michaelnotebook.com/optimism/index.html)? Such as the internet, social media, blockchain, form factors of AI models, etc. I don’t hold too much hope here, but this looks to me like the only plausible lever.

Roman Leventov Feb 4, 2025, 8:42 AM
11 points
0
in reply to: Knight Lee’s comment on: Gradual Disempowerment, Shell Games and Flinches
My quick impression is that this is a brutal and highly significant limitation of this kind of research. It’s just incredibly expensive for others to read and evaluate, so it’s very common for it to get ignored.
I’d predict that if you improved the arguments by 50%, it would lead to little extra uptake.
I think this is wrong. The introduction of the GD paper takes no more than 10 minutes to read and no significant cognitive effort to grasp, really. I don’t think there is more than 10% potential of making it any clearer or approachable.

Roman Leventov Feb 3, 2025, 7:41 AM
2 points
0
in reply to: jimrandomh’s comment on: The Failed Strategy of Artificial Intelligence Doomers
https://gradual-disempowerment.ai/ is mostly about institutional progress, not narrow technical progress.

Roman Leventov Oct 14, 2024, 11:58 PM
5 points
0
on: AI research assistants competition 2024Q3: Tie between Elicit and You.com
Undermind.ai I think is much more useful for searching concepts and ideas in papers rather than extracting tabular info a la Elicit. Nominally Elicit can do the former, too, but is quite bad in my experience.

Roman Leventov Oct 12, 2024, 2:50 PM
4 points
0
on: The Great Data Integration Schlep
https://openmined.org/ develops Syft, a framework for “private computation” in secure enclaves. It potentially reduces the barriers for data integration both within particularly bureaucratic orgs and across orgs.

Roman Leventov Oct 12, 2024, 1:04 PM
13 points
0
on: My theory of change for working in AI healthtech
Thanks for the post, I agree with it!
I just wrote a post with differential knowledge interconnection thesis, where I argue that it is on net beneficial to develop AI capabilities such as
- Federated learning, privacy-preserving multi-party computation, and privacy-preserving machine learning.
- Federated inference and belief sharing.
- Protocols and file formats for data, belief, or claim exchange and validation.
- Semantic knowledge mining and hybrid reasoning on (federated) knowledge graphs and multimodal data.
- Structured or semantic search.
- Datastore federation for retrieval-based LMs.
- Cross-language (such as, English/French) retrieval, search, and semantic knowledge integration. This is especially important for low-online-presence languages.
I discuss whether knowledge interconnection exacerbates or abates the risk if industrial dehumanization on net in a section. It’s a challenging question, but I reach the tentative conclusion that AI capabilities that favor obtaining and leveraging “interconnected” rather than “isolated” knowledge are on net risk-reducing. This is because the “human economy” is more complex than the hypothetical “pure machine-industrial economy”, and “knowledge interconnection” capabilities support that greater complexity.
Would you agree or disagree with this?

Differential knowledge interconnection

Roman LeventovOct 12, 2024, 12:52 PM

6 points

0 comments7 min readLW link

Roman Leventov Jun 1, 2024, 3:47 PM
6 points
0
on: There Should Be More Alignment-Driven Startups!
I think the model of commercial R&D lab would often suit alignment work better than a “classical” startup company. Conjecture and AE Studio come to mind. Answer.AI, founded by Jeremy Howard (of Fast.ai and Kaggle) and Eric Ries (Lean Startup) elaborates on this business and organisational model here: https://www.answer.ai/posts/2023-12-12-launch.html.

The AI Revolution in Biology

Roman LeventovMay 26, 2024, 9:30 AM

13 points

0 comments1 min readLW link

(www.cognitiverevolution.ai)

Roman Leventov May 14, 2024, 5:50 AM
2 points
0
in reply to: Roman Leventov’s comment on: The two-tiered society
But I should add, I agree that 1-3 poses challenging political and coordination problems. Nobody assumes it will be easy, including Acemoglu. It’s just another one in the row of hard political challenges posed by AI, along with the questions of “aligned with whom?”, considering/accounting for people’s voice past dysfunctional governments and political elites in general, etc.

Roman Leventov May 14, 2024, 5:41 AM
2 points
0
in reply to: FlorianH’s comment on: The two-tiered society
Separately, I at least spontaneously wonder: How would one even want to go about differentiating what is the ‘bad automation’ to be discouraged, from legit automation without which no modern economy could competitively run anyway? For a random example, say if Excel wouldn’t yet exist (or, for its next update..), we’d have to say: Sorry, cannot do such software, as any given spreadsheet has the risk of removing thousands of hours of work...?! Or at least: Please, Excel, ask the human to manually confirm each cell’s calculation...?? So I don’t know how we’d in practice enforce non-automation. Just ‘it uses a large LLM’ feels weirdly arbitrary condition—though, ok, I could see how, due to a lack of alternatives, one might use something like that as an ad-hoc criterion, with all the problems it brings. But again, I think points 1. & 2. mean this is unrealistic or unsuccessful anyway.
Clearly, specific rule-based regulation is a dumb strategy. Acemoglu’s suggestions: tax incentives to keep employment and “labour voice” to let people decide in the context of specific company and job how they want to work with AI. I like this self-governing strategy. Basically, the idea is that people will want to keep influencing things and will resist “job bullshittification” done to them, if they have the political power (“labour voice”). But they should also have alternative choice of technology and work arrangement/method that doesn’t turn their work into rubber-stamping bullshit, but also alleviates the burden (“machine usefulness”). Because if they only have the choice between rubber-stamping bullshit job and burdensome job without AI, they may choose rubber-stamping.

Roman Leventov May 14, 2024, 5:33 AM
2 points
0
in reply to: FlorianH’s comment on: The two-tiered society
If you’d really be able to coordinate globally to enable 1. or 2. globally—extremely unlikely in the current environment and given the huge incentives for individual countries to remain weak in enforcement—then it seems you might as well try to impose directly the economic first best solution w.r.t. robots vs. labor: high global tax rates and redistribution.
If anything, this problem seems more pernicious wrt. climate change mitigation and environmental damage: it’s much more distributed, not only in US and China, but Russia and India are also big emitters, big leverage in Brazil, Congo, and Indonesia with their forests, overfishing and ocean pollution everywhere, etc.
With AI, it’s basically the question of regulating US and UK companies: EU is always eager to over-regulate relative to the US, and China is already successfully and closely regulating their AI for a variety of reasons (which Acemoglu points out). The big problem of the Chinese economy is weak internal demand, and automating jobs and therefore increasing inequality and decreasing the local purchasing power is the last thing that China wants.

Roman Leventov May 13, 2024, 5:20 PM
2 points
0
in reply to: Gordon Seidoh Worley’s comment on: The two-tiered society
What levels of automation does the AI provide and at what rate is what he suggests to influence directly (specifically, slow down), through economic and political measures. So it’s not fair to list that as an assumption.

Roman Leventov May 13, 2024, 5:09 PM
2 points
0
in reply to: Viliam’s comment on: The two-tiered society
It would depend on exact details, but if a machine can do something as well or better than a human, then the machine should do it.
It’s a question of how to design work. Machine can cultivate better than a human a monoculture mega-farm, but not a small permaculture garden (at least, yet). Is a monoculture mega-farm more “effective”? Maybe, if we take the pre-AI opportunity cost of human labour, but also maybe not with the post-AI opportunity cost of human labour. And this is before factoring in the “economic value” of better psychological and physical health of people who work on small farms vs. those who eat processed food on their couches that is done from the crops grown on monoculture mega-farms, and do nothing.
As I understand, Acemoglu rougly suggests to look for ways to apply this logic in other domain of economy, including the knowledge economy. Yes, it’s not guaranteed that such arrangements will stay economical for a long time (but it’s also not beyond my imagination, especially if we factor in the economic value of physical and psychological health), but it may set the economy and the society on a different trajectory with higher chances of eventualities that we would consider “not doom”.
What does “foster labour voice” even mean?
Unions 2.0, or something like holacracy?
Especially in companies where everything is automated.
Not yet. Clearly, what he suggests could only remain effective for a limited time.
You can give more power to current employees of current companies, but soon there will be new startups with zero employees (or where, for tax reasons, owners will formally employ their friends or family members).
Not that soon at all, if we speak about the real economy. In IT sector, I suspect that Big Techs will win big in the AI race because only they have deep enough pockets (you already see Inflection AI quasi-acquired by MS, Stability essentially bust, etc.). And Big Techs still have huge workforces and it won’t be just Nadella or just Pichai anytime soon. Many other knowledge sectors (banks, law) are regulated and also won’t shed employees that fast.
Human-complementary AI technologies again sounds like a bullshit job, only mostly did by a machine, where a human is involved somewhere in the loop, but the machine could still do his part better, too.
In my gardening example, a human may wear AI goggles that tell them which plants or animal species do their see or what disease a plant has.
Tax on media platforms—solves a completely different problem. Yes, it is important to care about public mental health. But that is separate from the problem of technological unemployment. (You could have technological unemployment even in the universe where all social media are banned.)
Tax on media platforms is just a concrete example of how “reforming business models” could be done in practice, maybe not the best one (but it’s not my example). I will carry on with my gardening example and suggest “tax on fertiliser”: make it so huge that megafarms (which require a lot of fertiliser) become less economical than permaculture gardens. Because without such a push, permaculture gardens won’t magically materialise. Acemoglu underscores this point multiple times: it’s not a matter of pure technological invention and application of it in a laissez-faire market to switch to a different socioeconomic trajectory. Inventing AI goggles for gardening (or any other technology which makes permaculture gardening arbitrarily convenient) won’t make the economy to switch from monoculture mega-farms without an extra push.
Perhaps, Acemoglu also has something in his mind about attention/creator economy and the automation that may happen to them (AI influencers can replace human influencers) when he talks about “digital ad tax”, but I don’t see it.

The two-tiered society

Roman Leventov13 May 2024 7:53 UTC

5 points

9 comments3 min readLW link

Roman Leventov 30 Mar 2024 13:08 UTC
2 points
0
on: On attunement
John Vervaeke calls attunement “relevance realization”.

Roman Leventov 27 Mar 2024 3:16 UTC
14 points
9
on: Modern Transformers are AGI, and Human-Level
Cf. DeepMind’s “Levels of AGI” paper (https://arxiv.org/abs/2311.02462), calling modern transformers “emerging AGI” there, but also defining “expert”, “virtuoso”, and “superhuman” AGI.

Roman Leventov 24 Mar 2024 13:24 UTC
2 points
0
in reply to: Mateusz Bagiński’s comment on: AI Alignment Metastrategy
Humane/acc, https://twitter.com/AndrewCritchPhD

Roman Leventov

Differ­en­tial knowl­edge interconnection

The AI Revolu­tion in Biology

The two-tiered society

Differential knowledge interconnection

The AI Revolution in Biology