RSS

Iknownothing

Karma: 83

Making a research platform for AI Alignment at https://​​ai-plans.com/​​
Come critique AI Alignment plans and get feedback on your alignment plan!

AI Law-a-Thon

IknownothingJan 28, 2024, 2:30 AM
5 points
3 comments1 min readLW link

Re­view of Align­ment Plan Cri­tiques- De­cem­ber AI-Plans Cri­tique-a-Thon Re­sults

IknownothingJan 15, 2024, 7:37 PM
24 points
0 comments25 min readLW link
(aiplans.substack.com)

Cri­tique-a-Thon of AI Align­ment Plans

IknownothingDec 5, 2023, 8:50 PM
12 points
3 comments1 min readLW link

Pro­posal for im­prov­ing state of al­ign­ment research

IknownothingNov 6, 2023, 1:55 PM
2 points
0 comments1 min readLW link

Look­ing for judges for cri­tiques of Align­ment Plans

IknownothingAug 17, 2023, 10:35 PM
6 points
0 comments1 min readLW link

[Question] Spe­cific Ar­gu­ments against open source LLMs?

IknownothingJul 30, 2023, 2:27 PM
4 points
2 comments1 min readLW link

AI-Plans.com 10-day Cri­tique-a-Thon

IknownothingJul 27, 2023, 11:44 AM
8 points
2 comments2 min readLW link
(manifund.org)

Sim­ple al­ign­ment plan that maybe works

IknownothingJul 18, 2023, 10:48 PM
4 points
8 comments1 min readLW link

Even briefer sum­mary of ai-plans.com

IknownothingJul 16, 2023, 11:25 PM
10 points
6 comments2 min readLW link
(www.ai-plans.com)

LeCun says mak­ing a util­ity func­tion is intractable

IknownothingJun 28, 2023, 6:02 PM
2 points
3 comments1 min readLW link

Brief sum­mary of ai-plans.com

IknownothingJun 28, 2023, 12:33 AM
9 points
4 comments2 min readLW link
(ai-plans.com)

An overview of the points system

IknownothingJun 27, 2023, 9:09 AM
3 points
4 comments1 min readLW link
(ai-plans.com)

AI-Plans.com—a con­tributable compendium

IknownothingJun 25, 2023, 2:40 PM
39 points
7 comments4 min readLW link
(ai-plans.com)

A more effec­tive Ele­va­tor Pitch for AI risk

IknownothingJun 15, 2023, 12:39 PM
2 points
0 comments1 min readLW link

A more grounded idea of AI risk

IknownothingMay 11, 2023, 9:48 AM
3 points
4 comments1 min readLW link

An Ig­no­rant View on Ineffec­tive­ness of AI Safety

IknownothingJan 7, 2023, 1:29 AM
14 points
7 comments3 min readLW link