RSS

Simon Lermen

Karma: 794

Twitter: @SimonLermenAI

Ap­ply­ing re­fusal-vec­tor ab­la­tion to a Llama 3 70B agent

Simon LermenMay 11, 2024, 12:08 AM
51 points
14 comments7 min readLW link

Creat­ing un­re­stricted AI Agents with Com­mand R+

Simon LermenApr 16, 2024, 2:52 PM
77 points
13 comments5 min readLW link

unRLHF—Effi­ciently un­do­ing LLM safeguards

Oct 12, 2023, 7:58 PM
117 points
15 comments20 min readLW link

LoRA Fine-tun­ing Effi­ciently Un­does Safety Train­ing from Llama 2-Chat 70B

Oct 12, 2023, 7:58 PM
151 points
29 comments14 min readLW link