Roman Leventov

Karma: 1,431

An independent researcher/blogger/philosopher about intelligence and agency (esp. Active Inference), alignment, ethics, interaction of the AI transition with the sociotechnical risks (epistemics, economics, human psychology), collective mind architecture, research strategy and methodology.

Twitter: https://twitter.com/leventov. E-mail: leventov.ru@gmail.com (the preferred mode of communication). I’m open to collaborations and work.

Presentations at meetups, workshops and conferences, some recorded videos.

I’m a founding member of the Gaia Consoritum, on a mission to create a global, decentralised system for collective sense-making and decision-making, i.e., civilisational intelligence. Drop me a line if you want to learn more about it and/or join the consoritum.

You can help to boost my sense of accountability and give me a feeling that my work is valued by becoming a paid subscriber of my Substack (though I don’t post anything paywalled; in fact, on this blog, I just syndicate my LessWrong writing).

For Russian speakers: русскоязычная сеть по безопасности ИИ, Telegram group.

An LLM-based “exemplary actor”

Roman LeventovMay 29, 2023, 11:12 AM

16 points

0 comments12 min readLW link

Aligning an H-JEPA agent via training on the outputs of an LLM-based “exemplary actor”

Roman LeventovMay 29, 2023, 11:08 AM

12 points

10 comments30 min readLW link

[Question] AI interpretability could be harmful?

Roman LeventovMay 10, 2023, 8:43 PM

13 points

2 comments1 min readLW link

H-JEPA might be technically alignable in a modified form

Roman LeventovMay 8, 2023, 11:04 PM

12 points

2 comments7 min readLW link

Annotated reply to Bengio’s “AI Scientists: Safe and Useful AI?”

Roman LeventovMay 8, 2023, 9:26 PM

18 points

2 comments7 min readLW link

(yoshuabengio.org)

For alignment, we should simultaneously use multiple theories of cognition and value

Roman LeventovApr 24, 2023, 10:37 AM

23 points

5 comments5 min readLW link

An open letter to SERI MATS program organisers

Roman LeventovApr 20, 2023, 4:34 PM

26 points

26 comments4 min readLW link

Scientism vs. people

Roman LeventovApr 18, 2023, 5:28 PM

4 points

4 comments11 min readLW link

Goal alignment without alignment on epistemology, ethics, and science is futile

Roman LeventovApr 7, 2023, 8:22 AM

20 points

2 comments2 min readLW link

Yoshua Bengio: “Slowing down development of AI systems passing the Turing test”

Roman LeventovApr 6, 2023, 3:31 AM

49 points

2 comments5 min readLW link

(yoshuabengio.org)

Emergent Analogical Reasoning in Large Language Models

Roman LeventovMar 22, 2023, 5:18 AM

13 points

2 comments1 min readLW link

(arxiv.org)

Will people be motivated to learn difficult disciplines and skills without economic incentive?

Roman LeventovMar 20, 2023, 9:26 AM

18 points

33 comments5 min readLW link

A reply to Byrnes on the Free Energy Principle

Roman LeventovMar 3, 2023, 1:03 PM

28 points

16 comments14 min readLW link

Joscha Bach on Synthetic Intelligence [annotated]

Roman LeventovMar 2, 2023, 11:02 AM

10 points

1 comment9 min readLW link

(www.jimruttshow.com)

Powerful mesa-optimisation is already here

Roman LeventovFeb 17, 2023, 4:59 AM

35 points

1 comment2 min readLW link

(arxiv.org)

The Linguistic Blind Spot of Value-Aligned Agency, Natural and Artificial

Roman LeventovFeb 14, 2023, 6:57 AM

6 points

0 comments2 min readLW link

(arxiv.org)

Morphological intelligence, superhuman empathy, and ethical arbitration

Roman LeventovFeb 13, 2023, 10:25 AM

1 point

0 comments2 min readLW link

A multi-disciplinary view on AI safety research

Roman LeventovFeb 8, 2023, 4:50 PM

46 points

4 comments26 min readLW link

Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

Roman LeventovFeb 2, 2023, 6:29 AM

6 points

4 comments1 min readLW link

(arxiv.org)

[Question] Has private AGI research made independent safety research ineffective already? What should we do about this?

Roman LeventovJan 23, 2023, 7:36 AM

43 points

5 comments5 min readLW link