RSS

markov

Karma: 383

Un­der­stand­ing Bench­marks and mo­ti­vat­ing Evaluations

Feb 6, 2025, 1:32 AM
8 points
0 comments11 min readLW link
(ai-safety-atlas.com)

AI Safety 101 : Ca­pa­bil­ities—Hu­man Level AI, What? How? and When?

Mar 7, 2024, 5:29 PM
46 points
8 comments54 min readLW link

AI Safety Chatbot

Dec 21, 2023, 2:06 PM
61 points
11 comments4 min readLW link

AI Safety 101 : Re­ward Misspecification

markovOct 18, 2023, 8:39 PM
30 points
4 comments31 min readLW link

Is AI Safety drop­ping the ball on pri­vacy?

markovSep 13, 2023, 1:07 PM
50 points
17 comments7 min readLW link

Stampy’s AI Safety Info—New Distil­la­tions #4 [July 2023]

markovAug 16, 2023, 7:03 PM
22 points
10 comments1 min readLW link
(aisafety.info)

Stampy’s AI Safety Info—New Distil­la­tions #3 [May 2023]

markovJun 6, 2023, 2:18 PM
16 points
0 comments2 min readLW link
(aisafety.info)

Stampy’s AI Safety Info—New Distil­la­tions #2 [April 2023]

markovMay 9, 2023, 1:31 PM
25 points
1 comment1 min readLW link
(aisafety.info)

Stampy’s AI Safety Info—New Distil­la­tions #1 [March 2023]

markovApr 7, 2023, 11:06 AM
42 points
0 comments2 min readLW link
(aisafety.info)