RSS

Luke Bailey

Karma: 96

Stanford PhD Student

Image Hi­jacks: Ad­ver­sar­ial Images can Con­trol Gen­er­a­tive Models at Runtime

Sep 20, 2023, 3:23 PM
58 points
9 comments1 min readLW link
(arxiv.org)

Ten­sor Trust: An on­line game to un­cover prompt in­jec­tion vulnerabilities

Sep 1, 2023, 7:31 PM
30 points
0 comments5 min readLW link
(tensortrust.ai)

Ex­am­ples of Prompts that Make GPT-4 Out­put Falsehoods

Jul 22, 2023, 8:21 PM
21 points
5 comments6 min readLW link