J(w) = f(w)-basline
∂J(w)/∂w = ∂f(w)/∂w - ∂ baseline /∂w∂J(w)/∂w = ∂f(w)/∂w (as long as baseline is not a function of w)
Therefore, both have the same optimal point.
J(w) = f(w)-basline
∂J(w)/∂w = ∂f(w)/∂w - ∂ baseline /∂w∂J(w)/∂w = ∂f(w)/∂w (as long as baseline is not a function of w)
Therefore, both have the same optimal point.