RSS

Esben Kran

Karma: 533

Catas­trophic Cy­ber Ca­pa­bil­ities Bench­mark (3CB): Ro­bustly Eval­u­at­ing LLM Agent Cy­ber Offense Capabilities

Nov 5, 2024, 1:01 AM
8 points
0 comments6 min readLW link
(www.apartresearch.com)

Can star­tups be im­pact­ful in AI safety?

Sep 13, 2024, 7:00 PM
15 points
0 comments6 min readLW link

Find­ing De­cep­tion in Lan­guage Models

Aug 20, 2024, 9:42 AM
20 points
4 comments4 min readLW link

Re­sults from the AI x Democ­racy Re­search Sprint

Jun 14, 2024, 4:40 PM
13 points
0 comments6 min readLW link

De­mon­strate and eval­u­ate risks from AI to so­ciety at the AI x Democ­racy re­search hackathon

Esben KranApr 19, 2024, 2:46 PM
5 points
0 commentsLW link
(www.apartresearch.com)

Join the AI Eval­u­a­tion Tasks Bounty Hackathon

Esben KranMar 18, 2024, 8:15 AM
12 points
1 commentLW link

Multi-Agent Se­cu­rity Hackathon

Feb 5, 2024, 10:51 PM
6 points
0 comments1 min readLW link

Iden­ti­fy­ing se­man­tic neu­rons, mechanis­tic cir­cuits & in­ter­pretabil­ity web apps

Apr 13, 2023, 11:59 AM
18 points
0 comments8 min readLW link

An­nounc­ing the Euro­pean Net­work for AI Safety (ENAIS)

Esben KranMar 22, 2023, 5:57 PM
19 points
0 commentsLW link

Au­to­mated Sand­wich­ing & Quan­tify­ing Hu­man-LLM Co­op­er­a­tion: ScaleOver­sight hackathon results

Feb 23, 2023, 10:48 AM
8 points
0 comments6 min readLW link

Gen­er­al­iz­abil­ity & Hope for AI [MLAISU W03]

Esben KranJan 20, 2023, 10:06 AM
5 points
2 comments2 min readLW link
(newsletter.apartresearch.com)

Ro­bust­ness & Evolu­tion [MLAISU W02]

Esben KranJan 13, 2023, 3:47 PM
10 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

AI im­prov­ing AI [MLAISU W01!]

Esben KranJan 6, 2023, 11:13 AM
5 points
0 comments4 min readLW link
(newsletter.apartresearch.com)

Re­sults from the AI test­ing hackathon

Esben KranJan 2, 2023, 3:46 PM
13 points
0 commentsLW link

Will Machines Ever Rule the World? MLAISU W50

Esben KranDec 16, 2022, 11:03 AM
12 points
7 comments4 min readLW link
(newsletter.apartresearch.com)

Join the AI Test­ing Hackathon this Friday

Esben KranDec 12, 2022, 2:24 PM
10 points
0 commentsLW link

ML Safety at NeurIPS & Paradig­matic AI Safety? MLAISU W49

Dec 9, 2022, 10:38 AM
19 points
0 comments4 min readLW link
(newsletter.apartresearch.com)

NeurIPS Safety & ChatGPT. MLAISU W48

Dec 2, 2022, 3:50 PM
3 points
0 comments4 min readLW link
(newsletter.apartresearch.com)

Re­sults from the in­ter­pretabil­ity hackathon

Nov 17, 2022, 2:51 PM
81 points
0 comments6 min readLW link
(alignmentjam.com)

[Book] In­ter­pretable Ma­chine Learn­ing: A Guide for Mak­ing Black Box Models Explainable

Esben KranOct 31, 2022, 11:38 AM
20 points
1 comment1 min readLW link
(christophm.github.io)