RSS

Rareș Baron

Karma: 7

Minor in­ter­pretabil­ity ex­plo­ra­tion #4: Lay­erNorm and the learn­ing coefficient

Rareș BaronMar 20, 2025, 4:18 PM
2 points
0 comments1 min readLW link

Minor in­ter­pretabil­ity ex­plo­ra­tion #3: Ex­tend­ing su­per­po­si­tion to differ­ent ac­ti­va­tion func­tions (loss land­scape)

Rareș BaronMar 14, 2025, 3:45 PM
3 points
0 comments3 min readLW link

Minor in­ter­pretabil­ity ex­plo­ra­tion #2: Ex­tend­ing su­per­po­si­tion to differ­ent ac­ti­va­tion functions

Rareș BaronMar 6, 2025, 11:22 AM
1 point
0 comments4 min readLW link

Minor in­ter­pretabil­ity ex­plo­ra­tion #1: Grokking of mod­u­lar ad­di­tion, sub­trac­tion, mul­ti­pli­ca­tion, for differ­ent ac­ti­va­tion functions

Rareș BaronFeb 26, 2025, 11:35 AM
3 points
13 comments4 min readLW link