RSS

cloud

Karma: 130

Gra­di­ent Rout­ing: Mask­ing Gra­di­ents to Lo­cal­ize Com­pu­ta­tion in Neu­ral Networks

6 Dec 2024 22:19 UTC
148 points
7 comments11 min readLW link
(arxiv.org)