RSS

Singularian2501

Karma: 9

I like reading Machine Learning Paper.

Paper: Iden­ti­fy­ing the Risks of LM Agents with an LM-Emu­lated Sand­box—Univer­sity of Toronto 2023 - Bench­mark con­sist­ing of 36 high-stakes tools and 144 test cases!

Singularian2501Oct 9, 2023, 12:00 AM
6 points
0 comments1 min readLW link

RAIN: Your Lan­guage Models Can Align Them­selves with­out Fine­tun­ing—Microsoft Re­search 2023 - Re­duces the ad­ver­sar­ial prompt at­tack suc­cess rate from 94% to 19%!

Singularian2501Sep 24, 2023, 4:48 PM
5 points
0 comments1 min readLW link