I think some people use the loss when all features are set to zero, instead of strictly doing
I think this is an unfinished
Current theme: default
Less Wrong (text)
Less Wrong (link)
I think this is an unfinished