RSS

Open Agency Architecture

TagLast edit: Apr 10, 2024, 10:53 PM by Chipmonk

The Open Agency Architecture (“OAA”) is an AI alignment proposal by (among others) @davidad and @Eric Drexler.

See Davidad’s Provably Safe AI Architecture—ARIA’s Programme Thesis for the most up-to-date (2024 Feb 1) and quite readable explanation.

A shorter but older explanation is also available here.

New better link: https://​​www.aria.org.uk/​​programme-safeguarded-ai/​​

Related:

Davi­dad’s Bold Plan for Align­ment: An In-Depth Explanation

Apr 19, 2023, 4:09 PM
168 points
40 comments21 min readLW link2 reviews

An Open Agency Ar­chi­tec­ture for Safe Trans­for­ma­tive AI

davidadDec 20, 2022, 1:04 PM
80 points
22 comments4 min readLW link

A list of core AI safety prob­lems and how I hope to solve them

davidadAug 26, 2023, 3:12 PM
165 points
29 comments5 min readLW link

Gaia Net­work: a prac­ti­cal, in­cre­men­tal path­way to Open Agency Architecture

Dec 20, 2023, 5:11 PM
22 points
8 comments16 min readLW link

The Open Agency Model

Eric DrexlerFeb 22, 2023, 10:35 AM
114 points
18 comments4 min readLW link

So­ci­aLLM: pro­posal for a lan­guage model de­sign for per­son­al­ised apps, so­cial sci­ence, and AI safety research

Roman LeventovDec 19, 2023, 4:49 PM
17 points
5 comments3 min readLW link

AGI will be made of het­ero­ge­neous com­po­nents, Trans­former and Selec­tive SSM blocks will be among them

Roman LeventovDec 27, 2023, 2:51 PM
33 points
9 comments4 min readLW link

Prov­ably Safe AI: Wor­ld­view and Projects

Aug 9, 2024, 11:21 PM
54 points
44 comments7 min readLW link

Roadmap for a col­lab­o­ra­tive pro­to­type of an Open Agency Architecture

Deger TuranMay 10, 2023, 5:41 PM
31 points
0 comments12 min readLW link

Safety First: safety be­fore full al­ign­ment. The de­on­tic suffi­ciency hy­poth­e­sis.

ChipmonkJan 3, 2024, 5:55 PM
48 points
3 comments3 min readLW link

Ap­ply to the Con­cep­tual Boundaries Work­shop for AI Safety

ChipmonkNov 27, 2023, 9:04 PM
50 points
0 comments3 min readLW link

Towards Guaran­teed Safe AI: A Frame­work for En­sur­ing Ro­bust and Reli­able AI Systems

Joar SkalseMay 17, 2024, 7:13 PM
67 points
10 comments2 min readLW link

«Boundaries/​Mem­branes» and AI safety compilation

ChipmonkMay 3, 2023, 9:41 PM
56 points
17 comments8 min readLW link

Davi­dad’s Prov­ably Safe AI Ar­chi­tec­ture—ARIA’s Pro­gramme Thesis

simeon_cFeb 1, 2024, 9:30 PM
69 points
17 comments1 min readLW link
(www.aria.org.uk)

What does davi­dad want from «bound­aries»?

Feb 6, 2024, 5:45 PM
47 points
1 comment5 min readLW link

An­nounc­ing At­las Computing

miyazonoApr 11, 2024, 3:56 PM
44 points
4 comments4 min readLW link

Why I find Davi­dad’s plan interesting

Paul WMay 20, 2024, 8:13 AM
18 points
0 comments6 min readLW link
No comments.