RSS

Jonas Hallgren

Karma: 752

AI Safety person currently working on multi-agent coordination problems.

The Align­ment Map­ping Pro­gram: Forg­ing In­de­pen­dent Thinkers in AI Safety—A Pilot Retrospective

Jan 10, 2025, 4:22 PM
21 points
0 comments4 min readLW link

Med­i­ta­tion in­sights as phase shifts in your self-model

Jonas HallgrenJan 7, 2025, 10:09 AM
13 points
3 comments3 min readLW link

Model In­tegrity: MAI on Value Alignment

Jonas HallgrenDec 5, 2024, 5:11 PM
6 points
11 comments1 min readLW link
(meaningalignment.substack.com)

Re­pro­gram­ing the Mind: Med­i­ta­tion as a Tool for Cog­ni­tive Optimization

Jonas HallgrenJan 11, 2024, 12:03 PM
32 points
3 comments11 min readLW link

How well does your re­search adress the the­ory-prac­tice gap?

Jonas HallgrenNov 8, 2023, 11:27 AM
18 points
0 comments10 min readLW link

Jonas Hal­l­gren’s Shortform

Jonas HallgrenOct 11, 2023, 9:52 AM
3 points
13 commentsLW link

Ad­vice for new al­ign­ment peo­ple: Info Max

Jonas HallgrenMay 30, 2023, 3:42 PM
23 points
4 comments5 min readLW link

Re­spect for Boundaries as non-ar­bir­trary co­or­di­na­tion norms

Jonas HallgrenMay 9, 2023, 7:42 PM
9 points
3 comments7 min readLW link

Max Teg­mark’s new Time ar­ti­cle on how we’re in a Don’t Look Up sce­nario [Linkpost]

Jonas HallgrenApr 25, 2023, 3:41 PM
39 points
9 comments1 min readLW link
(time.com)

The Benefits of Distil­la­tion in Research

Jonas HallgrenMar 4, 2023, 5:45 PM
15 points
2 comments5 min readLW link

Power-Seek­ing = Min­imis­ing free energy

Jonas HallgrenFeb 22, 2023, 4:28 AM
21 points
10 comments7 min readLW link

Black Box In­ves­ti­ga­tion Re­search Hackathon

Sep 12, 2022, 7:20 AM
9 points
4 comments2 min readLW link

An­nounc­ing the Distil­la­tion for Align­ment Practicum (DAP)

Aug 18, 2022, 7:50 PM
23 points
3 comments3 min readLW link

[Question] Does agent foun­da­tions cover all fu­ture ML sys­tems?

Jonas HallgrenJul 25, 2022, 1:17 AM
2 points
0 comments1 min readLW link

[Question] Is it worth mak­ing a database for moral pre­dic­tions?

Jonas HallgrenAug 16, 2021, 2:51 PM
1 point
0 comments2 min readLW link

[Question] Is there any se­ri­ous at­tempt to cre­ate a sys­tem to figure out the CEV of hu­man­ity and if not, why haven’t we started yet?

Jonas HallgrenFeb 25, 2021, 10:06 PM
5 points
2 comments1 min readLW link