Stephen McAleese answers When is reward ever the optimization target?