RSS

Michaël Trazzi

Karma: 1,861

theinsideview.ai

Finish­ing The SB-1047 Doc­u­men­tary In 6 Weeks

Michaël Trazzi28 Oct 2024 20:17 UTC
93 points
5 comments4 min readLW link
(manifund.org)

Owain Evans on Si­tu­a­tional Aware­ness and Out-of-Con­text Rea­son­ing in LLMs

Michaël Trazzi24 Aug 2024 4:30 UTC
55 points
0 comments5 min readLW link

Paul Chris­ti­ano’s views on “doom” (video ex­plainer)

Michaël Trazzi29 Sep 2023 21:56 UTC
15 points
0 comments1 min readLW link
(youtu.be)

Neel Nanda on the Mechanis­tic In­ter­pretabil­ity Re­searcher Mindset

Michaël Trazzi21 Sep 2023 19:47 UTC
37 points
1 comment3 min readLW link
(theinsideview.ai)

Panel with Is­raeli Prime Minister on ex­is­ten­tial risk from AI

Michaël Trazzi18 Sep 2023 23:16 UTC
22 points
2 comments1 min readLW link
(x.com)

Eric Michaud on the Quan­ti­za­tion Model of Neu­ral Scal­ing, In­ter­pretabil­ity and Grokking

Michaël Trazzi12 Jul 2023 22:45 UTC
10 points
0 comments2 min readLW link
(theinsideview.ai)

Jesse Hoogland on Devel­op­men­tal In­ter­pretabil­ity and Sin­gu­lar Learn­ing Theory

Michaël Trazzi6 Jul 2023 15:46 UTC
42 points
2 comments4 min readLW link
(theinsideview.ai)

[Question] Should Au­toGPT up­date us to­wards re­search­ing IDA?

Michaël Trazzi12 Apr 2023 16:41 UTC
15 points
5 comments1 min readLW link

Col­lin Burns on Align­ment Re­search And Dis­cov­er­ing La­tent Knowl­edge Without Supervision

Michaël Trazzi17 Jan 2023 17:21 UTC
25 points
5 comments4 min readLW link
(theinsideview.ai)

Vic­to­ria Krakovna on AGI Ruin, The Sharp Left Turn and Paradigms of AI Alignment

Michaël Trazzi12 Jan 2023 17:09 UTC
40 points
3 comments4 min readLW link
(www.theinsideview.ai)

David Krueger on AI Align­ment in Academia, Co­or­di­na­tion and Test­ing Intuitions

Michaël Trazzi7 Jan 2023 19:59 UTC
13 points
0 comments4 min readLW link
(theinsideview.ai)

Ethan Ca­ballero on Bro­ken Neu­ral Scal­ing Laws, De­cep­tion, and Re­cur­sive Self Improvement

4 Nov 2022 18:09 UTC
16 points
11 comments10 min readLW link
(theinsideview.ai)

Sha­har Avin On How To Reg­u­late Ad­vanced AI Systems

Michaël Trazzi23 Sep 2022 15:46 UTC
31 points
0 comments4 min readLW link
(theinsideview.ai)

Katja Grace on Slow­ing Down AI, AI Ex­pert Sur­veys And Es­ti­mat­ing AI Risk

Michaël Trazzi16 Sep 2022 17:45 UTC
40 points
2 comments3 min readLW link
(theinsideview.ai)

Alex Lawsen On Fore­cast­ing AI Progress

Michaël Trazzi6 Sep 2022 9:32 UTC
18 points
0 comments2 min readLW link
(theinsideview.ai)

Robert Long On Why Ar­tifi­cial Sen­tience Might Matter

Michaël Trazzi28 Aug 2022 17:30 UTC
26 points
5 comments5 min readLW link
(theinsideview.ai)

Ethan Perez on the In­verse Scal­ing Prize, Lan­guage Feed­back and Red Teaming

Michaël Trazzi24 Aug 2022 16:35 UTC
26 points
0 comments3 min readLW link
(theinsideview.ai)

Con­nor Leahy on Dy­ing with Dig­nity, EleutherAI and Conjecture

Michaël Trazzi22 Jul 2022 18:44 UTC
195 points
29 comments14 min readLW link
(theinsideview.ai)

Raphaël Millière on Gen­er­al­iza­tion and Scal­ing Maximalism

Michaël Trazzi24 Jun 2022 18:18 UTC
21 points
2 comments4 min readLW link
(theinsideview.ai)

Blake Richards on Why he is Skep­ti­cal of Ex­is­ten­tial Risk from AI

Michaël Trazzi14 Jun 2022 19:09 UTC
41 points
12 comments4 min readLW link
(theinsideview.ai)