RSS

Alex Gibson

Karma: 7

Us­ing the prob­a­bil­is­tic method to bound the perfor­mance of toy transformers

Alex GibsonJan 21, 2025, 11:01 PM
1 point
0 comments3 min readLW link

Con­tex­tual at­ten­tion heads in the first layer of GPT-2

Alex GibsonJan 20, 2025, 1:24 PM
6 points
0 comments13 min readLW link

Du­pli­cate to­ken neu­rons in the first layer of GPT-2

Alex GibsonDec 27, 2024, 4:21 AM
2 points
0 comments5 min readLW link

Alex Gib­son’s Shortform

Alex GibsonDec 27, 2024, 4:21 AM
1 point
0 comments1 min readLW link