RSS

Joseph Miller

Karma: 1,473

Gra­di­ent Rout­ing: Mask­ing Gra­di­ents to Lo­cal­ize Com­pu­ta­tion in Neu­ral Networks

6 Dec 2024 22:19 UTC
153 points
12 comments11 min readLW link
(arxiv.org)