RSS

alamerton

Karma: 42

I’m currently working as founder and research lead at Formation Research on technical interventions for lock-in risk, and part-time as a research assistant at King’s College London on clinical machine learning benchmarking.

My website is here.

Digi­tal Er­ror Cor­rec­tion and Lock-In

alamertonApr 8, 2025, 3:46 PM
1 point
0 comments5 min readLW link
(alfielamerton.substack.com)

Or­gani­sa­tion-Level Lock-In Risk Interventions

alamertonApr 1, 2025, 12:42 PM
5 points
0 comments8 min readLW link

Recom­mender Align­ment for Lock-In Risk

alamertonMar 24, 2025, 12:56 PM
2 points
0 comments7 min readLW link

Stac­ity: a Lock-In Risk Bench­mark for Large Lan­guage Models

alamertonMar 13, 2025, 12:08 PM
3 points
0 comments1 min readLW link
(huggingface.co)

Lock-In Threat Models

alamertonMar 10, 2025, 10:22 AM
5 points
0 comments8 min readLW link

What is Lock-In?

alamertonMar 6, 2025, 11:09 AM
5 points
0 comments9 min readLW link

For­ma­tion Re­search: Or­gani­sa­tion Overview

alamertonMar 4, 2025, 3:03 PM
5 points
0 comments11 min readLW link

In-Con­text Learn­ing: An Align­ment Survey

alamertonSep 30, 2024, 6:44 PM
8 points
0 comments20 min readLW link
(docs.google.com)

A Re­view of In-Con­text Learn­ing Hy­pothe­ses for Au­to­mated AI Align­ment Research

alamertonApr 18, 2024, 6:29 PM
25 points
4 comments16 min readLW link