RSS

Roman Leventov

Karma: 1,431

An independent researcher/​blogger/​philosopher about intelligence and agency (esp. Active Inference), alignment, ethics, interaction of the AI transition with the sociotechnical risks (epistemics, economics, human psychology), collective mind architecture, research strategy and methodology.

Twitter: https://​​twitter.com/​​leventov. E-mail: leventov.ru@gmail.com (the preferred mode of communication). I’m open to collaborations and work.

Presentations at meetups, workshops and conferences, some recorded videos.

I’m a founding member of the Gaia Consoritum, on a mission to create a global, decentralised system for collective sense-making and decision-making, i.e., civilisational intelligence. Drop me a line if you want to learn more about it and/​or join the consoritum.

You can help to boost my sense of accountability and give me a feeling that my work is valued by becoming a paid subscriber of my Substack (though I don’t post anything paywalled; in fact, on this blog, I just syndicate my LessWrong writing).

For Russian speakers: русскоязычная сеть по безопасности ИИ, Telegram group.

An LLM-based “ex­em­plary ac­tor”

Roman LeventovMay 29, 2023, 11:12 AM
16 points
0 comments12 min readLW link

Align­ing an H-JEPA agent via train­ing on the out­puts of an LLM-based “ex­em­plary ac­tor”

Roman LeventovMay 29, 2023, 11:08 AM
12 points
10 comments30 min readLW link

[Question] AI in­ter­pretabil­ity could be harm­ful?

Roman LeventovMay 10, 2023, 8:43 PM
13 points
2 comments1 min readLW link

H-JEPA might be tech­ni­cally al­ignable in a mod­ified form

Roman LeventovMay 8, 2023, 11:04 PM
12 points
2 comments7 min readLW link

An­no­tated re­ply to Ben­gio’s “AI Scien­tists: Safe and Use­ful AI?”

Roman LeventovMay 8, 2023, 9:26 PM
18 points
2 comments7 min readLW link
(yoshuabengio.org)

For al­ign­ment, we should si­mul­ta­neously use mul­ti­ple the­o­ries of cog­ni­tion and value

Roman LeventovApr 24, 2023, 10:37 AM
23 points
5 comments5 min readLW link

An open let­ter to SERI MATS pro­gram organisers

Roman LeventovApr 20, 2023, 4:34 PM
26 points
26 comments4 min readLW link

Scien­tism vs. people

Roman LeventovApr 18, 2023, 5:28 PM
4 points
4 comments11 min readLW link

Goal al­ign­ment with­out al­ign­ment on episte­mol­ogy, ethics, and sci­ence is futile

Roman LeventovApr 7, 2023, 8:22 AM
20 points
2 comments2 min readLW link

Yoshua Ben­gio: “Slow­ing down de­vel­op­ment of AI sys­tems pass­ing the Tur­ing test”

Roman LeventovApr 6, 2023, 3:31 AM
49 points
2 comments5 min readLW link
(yoshuabengio.org)

Emer­gent Analog­i­cal Rea­son­ing in Large Lan­guage Models

Roman LeventovMar 22, 2023, 5:18 AM
13 points
2 comments1 min readLW link
(arxiv.org)

Will peo­ple be mo­ti­vated to learn difficult dis­ci­plines and skills with­out eco­nomic in­cen­tive?

Roman LeventovMar 20, 2023, 9:26 AM
18 points
33 comments5 min readLW link

A re­ply to Byrnes on the Free En­ergy Principle

Roman LeventovMar 3, 2023, 1:03 PM
28 points
16 comments14 min readLW link

Joscha Bach on Syn­thetic In­tel­li­gence [an­no­tated]

Roman LeventovMar 2, 2023, 11:02 AM
10 points
1 comment9 min readLW link
(www.jimruttshow.com)

Pow­er­ful mesa-op­ti­mi­sa­tion is already here

Roman LeventovFeb 17, 2023, 4:59 AM
35 points
1 comment2 min readLW link
(arxiv.org)

The Lin­guis­tic Blind Spot of Value-Aligned Agency, Nat­u­ral and Ar­tifi­cial

Roman LeventovFeb 14, 2023, 6:57 AM
6 points
0 comments2 min readLW link
(arxiv.org)

Mor­pholog­i­cal in­tel­li­gence, su­per­hu­man em­pa­thy, and eth­i­cal arbitration

Roman LeventovFeb 13, 2023, 10:25 AM
1 point
0 comments2 min readLW link

A multi-dis­ci­plinary view on AI safety research

Roman LeventovFeb 8, 2023, 4:50 PM
46 points
4 comments26 min readLW link

Tem­po­rally Lay­ered Ar­chi­tec­ture for Adap­tive, Distributed and Con­tin­u­ous Control

Roman LeventovFeb 2, 2023, 6:29 AM
6 points
4 comments1 min readLW link
(arxiv.org)

[Question] Has pri­vate AGI re­search made in­de­pen­dent safety re­search in­effec­tive already? What should we do about this?

Roman LeventovJan 23, 2023, 7:36 AM
43 points
5 comments5 min readLW link