That is not what I was trying to say at all.… Lets try math’s notation. You have
Have a model M which is a function of the state of the world s and the target and hit, you take at time t, a1. The model gives a predicted state of the world at time 2, ps2.
So ps2 = M(s1, a1)
Lets say you have a measure U() over states of the world. Which gives a utility for the state.
U(s1) = u1
You build up a model of what U is Mu so that you don’t have to try to hit every state to find the U of that state. So you have
Mu(s1) = pu1
A normal optimisation process just adjusts a1 until pu2 is maximised. So
f(a1) = argmax Mu(M(s1,a1))
What I’m arguing for is is that you are not getting high U on an poorly defined problem you want to be spending a lot of time adjusting M not adjusting a if you get a poor u.
If your model M is inaccurate that is ps2 != s2 you can improve your pu2 by updating your model to minimise the sqrt of error in the prediction at time t with the action that happened at t and the actual state at t+1st+1
g(st) = argmin on M (sqrt(M(st,at)-st+1))
So in machine learnings case, your actions a are algorithms or systems you are testing and M is your model of what an intelligence is. If the turing test is giving your system a bad u, and you can’t easily predict the real world intelligences (e.g. humans) with your model of intelligence that is M(st,at)-st+1 is large, update your model of what an intelligence should be doing so that you can better predict what an intelligence should be. This will give you enable you to pick a better.
To give an example less close to home. If you are trying to teach kids, don’t just try lots of different teaching styles and see which ones produce the best test results for your tests. Instead have principled reasons for trying a particular teaching style. Try and explain why a teaching style was bad. If you can, create an explicitly bad teaching style and see whether it was as bad as you thought it would be. Once you have got a good model of learning styles to test results, then pick the optimal learning style for principled reasons. Else you could just give the kids the answers before hand, that would optimise the test results.
Does that clarify the difference?
I’ll try and figure out formatting this properly in a bit.
That is not what I was trying to say at all.… Lets try math’s notation. You have
Have a model M which is a function of the state of the world s and the target and hit, you take at time t, a1. The model gives a predicted state of the world at time 2, ps2.
So ps2 = M(s1, a1)
Lets say you have a measure U() over states of the world. Which gives a utility for the state.
U(s1) = u1
You build up a model of what U is Mu so that you don’t have to try to hit every state to find the U of that state. So you have
Mu(s1) = pu1
A normal optimisation process just adjusts a1 until pu2 is maximised. So
f(a1) = argmax Mu(M(s1,a1))
What I’m arguing for is is that you are not getting high U on an poorly defined problem you want to be spending a lot of time adjusting M not adjusting a if you get a poor u.
If your model M is inaccurate that is ps2 != s2 you can improve your pu2 by updating your model to minimise the sqrt of error in the prediction at time t with the action that happened at t and the actual state at t+1 st+1
g(st) = argmin on M (sqrt(M(st,at)-st+1))
So in machine learnings case, your actions a are algorithms or systems you are testing and M is your model of what an intelligence is. If the turing test is giving your system a bad u, and you can’t easily predict the real world intelligences (e.g. humans) with your model of intelligence that is M(st,at)-st+1 is large, update your model of what an intelligence should be doing so that you can better predict what an intelligence should be. This will give you enable you to pick a better.
To give an example less close to home. If you are trying to teach kids, don’t just try lots of different teaching styles and see which ones produce the best test results for your tests. Instead have principled reasons for trying a particular teaching style. Try and explain why a teaching style was bad. If you can, create an explicitly bad teaching style and see whether it was as bad as you thought it would be. Once you have got a good model of learning styles to test results, then pick the optimal learning style for principled reasons. Else you could just give the kids the answers before hand, that would optimise the test results.
Does that clarify the difference?
I’ll try and figure out formatting this properly in a bit.
yeah! I think we’re actually saying the same things back at each other.
I was objecting to the continuous vs boolean distinction :) . I’d boil the article down to.
It is more important to optimise the model of the world , than it is acting in the world, if your model of the world is bad.
It is lucky that this is continuous function though.