If you’re interested in approximating Hessian-vector products efficiently for frontier-size models, this recent Anthropic paper describes a mechanism for doing so.
Ah nice, thanks! This looks really interesting and useful
If you’re interested in approximating Hessian-vector products efficiently for frontier-size models, this recent Anthropic paper describes a mechanism for doing so.
Ah nice, thanks! This looks really interesting and useful