RSS

Megan Kinniment

Karma: 422

I work at ARC Evals. I like language models.

Am very happy for people to ask to chat—but I might be too busy to accept (message me).

In­tro­duc­ing METR’s Au­ton­omy Eval­u­a­tion Resources

Mar 15, 2024, 11:16 PM
90 points
0 comments1 min readLW link
(metr.github.io)

Bounty: Di­verse hard tasks for LLM agents

Dec 17, 2023, 1:04 AM
49 points
31 comments16 min readLW link

Send us ex­am­ple gnarly bugs

Dec 10, 2023, 5:23 AM
77 points
10 comments2 min readLW link

Steer­ing Be­havi­our: Test­ing for (Non-)My­opia in Lan­guage Models

Dec 5, 2022, 8:28 PM
40 points
19 comments10 min readLW link

Re­call and Re­gur­gi­ta­tion in GPT2

Megan KinnimentOct 3, 2022, 7:35 PM
43 points
1 comment26 min readLW link

Try­ing out Prompt Eng­ineer­ing on TruthfulQA

Megan KinnimentJul 23, 2022, 2:04 AM
10 points
0 comments8 min readLW link

Me­gan Kin­ni­ment’s Shortform

Megan KinnimentJul 14, 2022, 11:49 PM
3 points
1 comment1 min readLW link

GPT-3 Catch­ing Fish in Morse Code

Megan KinnimentJun 30, 2022, 9:22 PM
117 points
27 comments8 min readLW link

Ex­plor­ing Mild Be­havi­our in Embed­ded Agents

Megan KinnimentJun 27, 2022, 6:56 PM
21 points
4 comments18 min readLW link