RSS

Cameron Berg

Karma: 1,670

Currently doing alignment and digital minds research @AE Studio

Meta AI Resident ’23, Cognitive science @ Yale ‘22, SERI MATS ’21, LTFF grantee.

Very interested in work at the intersection of AI x cognitive science x alignment x philosophy.

Mis­tral Large 2 (123B) ex­hibits al­ign­ment faking

Mar 27, 2025, 3:39 PM
80 points
4 comments13 min readLW link

Re­duc­ing LLM de­cep­tion at scale with self-other over­lap fine-tuning

Mar 13, 2025, 7:09 PM
155 points
40 comments6 min readLW link

Align­ment can be the ‘clean en­ergy’ of AI

Feb 22, 2025, 12:08 AM
67 points
8 comments8 min readLW link

Mak­ing a con­ser­va­tive case for alignment

Nov 15, 2024, 6:55 PM
208 points
67 comments7 min readLW link

Science ad­vances one funeral at a time

Nov 1, 2024, 11:06 PM
100 points
9 comments2 min readLW link

Self-pre­dic­tion acts as an emer­gent regularizer

Oct 23, 2024, 10:27 PM
91 points
9 comments4 min readLW link

The case for a nega­tive al­ign­ment tax

Sep 18, 2024, 6:33 PM
75 points
20 comments7 min readLW link

Self-Other Over­lap: A Ne­glected Ap­proach to AI Alignment

Jul 30, 2024, 4:22 PM
215 points
51 comments12 min readLW link

There Should Be More Align­ment-Driven Startups

May 31, 2024, 2:05 AM
62 points
14 comments11 min readLW link

Key take­aways from our EA and al­ign­ment re­search sur­veys

May 3, 2024, 6:10 PM
111 points
10 comments21 min readLW link

AE Stu­dio @ SXSW: We need more AI con­scious­ness re­search (and fur­ther re­sources)

Mar 26, 2024, 8:59 PM
67 points
8 comments3 min readLW link

Sur­vey for al­ign­ment re­searchers!

Feb 2, 2024, 8:41 PM
71 points
11 comments1 min readLW link

The ‘Ne­glected Ap­proaches’ Ap­proach: AE Stu­dio’s Align­ment Agenda

Dec 18, 2023, 8:35 PM
175 points
22 comments12 min readLW link1 review

Com­pu­ta­tional sig­na­tures of psychopathy

Cameron BergDec 19, 2022, 5:01 PM
30 points
3 comments20 min readLW link

AI re­searchers an­nounce Neu­roAI agenda

Cameron BergOct 24, 2022, 12:14 AM
37 points
12 comments6 min readLW link
(arxiv.org)

Align­ment via proso­cial brain algorithms

Cameron BergSep 12, 2022, 1:48 PM
45 points
30 comments6 min readLW link

Paradigm-build­ing: Con­clu­sion and prac­ti­cal takeaways

Cameron BergFeb 15, 2022, 4:11 PM
5 points
1 comment2 min readLW link

Ques­tion 5: The timeline hyperparameter

Cameron BergFeb 14, 2022, 4:38 PM
8 points
3 comments7 min readLW link

Ques­tion 4: Im­ple­ment­ing the con­trol proposals

Cameron BergFeb 13, 2022, 5:12 PM
6 points
2 comments5 min readLW link

Ques­tion 3: Con­trol pro­pos­als for min­i­miz­ing bad outcomes

Cameron BergFeb 12, 2022, 7:13 PM
5 points
1 comment7 min readLW link