I’m still confused about the part where you use the Hoeffding inequality—how is the lambda in that step and the lambda in the loss function “the same lambda”?
Because f=λ⋅ΔL. They are the same. Does that help?
I’m still confused about the part where you use the Hoeffding inequality—how is the lambda in that step and the lambda in the loss function “the same lambda”?
Because f=λ⋅ΔL. They are the same. Does that help?