Richard_Ngo comments on Goal-directedness is behavioral, not structural

Richard_Ngo 9 Jun 2020 17:45 UTC
LW: 8 AF: 5
AF
“What I argue for is that, insofar as the predictions you want to make are about behaviors, what the internal cognition and structure give you is a way to extrapolate the behavior, perhaps more efficiently. Not a useful definition of goal-directedness.”
Perhaps it’d be useful to taboo the word “definition” here. We have this phenomenon, goal-directedness. Partly we think about it in cognitive terms; partly we think about it in behavioural terms. It sounds like you’re arguing that the former is less legitimate. But clearly we’re still going to think about it in both ways—they’re just different levels of abstraction for some pattern in the world. Or maybe you’re saying that it’ll be easier to decompose it when we think about it on a behavioural level? But the opposite seems true to me—we’re much better at reasoning about intentional systems than we are at abstractly categorising behaviour.
“What still stands for me is that given two systems with the same behavior, I want to give them the same goal-directedness.”
I don’t see how you can actually construct two generally intelligent systems which have this property, without them doing basically the same cognition. In theory, of course, you could do so using an infinite lookup table. But I claim that thinking about finite systems based on arguments about the infinite limit is often very misleading, for reasons I outline in this post. Here’s a (somewhat strained) analogy: suppose that I’m trying to build a rocket, and I have this concept “length”, which I’m using to make sure that the components are the right length. Now you approach me, and say “You’re assuming that this rocket engine is longer than this door handle. But if they’re both going at the speed of light, then they both become the same length! So in order to build a rocket, you need a concept of length which is robust to measuring things at light speed.”
To be more precise, my argument is: knowing that two AGIs have exactly the same behaviour but cognition which we evaluate as differently goal-directed is an epistemic situation that is so far removed from what we might ever experience that it shouldn’t inform our everyday concepts.
- adamShimi 10 Jun 2020 15:24 UTC
  LW: 2 AF: 1
  AF Parent
  Perhaps it’d be useful to taboo the word “definition” here. We have this phenomenon, goal-directedness. Partly we think about it in cognitive terms; partly we think about it in behavioural terms. It sounds like you’re arguing that the former is less legitimate. But clearly we’re still going to think about it in both ways—they’re just different levels of abstraction for some pattern in the world. Or maybe you’re saying that it’ll be easier to decompose it when we think about it on a behavioural level? But the opposite seems true to me—we’re much better at reasoning about intentional systems than we are at abstractly categorising behaviour.
  Rereading this comment and the ones before, I think we mean different things by “internal structure” or “cognitive terms”. What I mean is what’s inside the system (source code, physical brain states). What I think you mean is ascribing internal cognitive states to the system (in classic intentional stance fashion). Do you agree, or am I misunderstanding again?
  So I agree completely that we will need to ascribe intentional beliefs to the system. What I was pointing at is that searching a definition (sorry, used the taboo word) of goal-directedness in terms of the internal structure (that is, the source code for example), is misguided.
  What links here?
  - adamShimi's comment on Goal-directedness is behavioral, not structural by adamShimi (10 Jun 2020 15:32 UTC; 4 points)
  - Richard_Ngo 11 Jun 2020 1:12 UTC
    2 points
    AF Parent
    By “internal structure” or “cognitive terms” I also mean what’s inside the system, but usually at a higher level of abstraction than physical implementation. For instance, we can describe AlphaGo’s cognition as follows: it searches through a range of possible games, and selects moves that do well in a lot of those games. If we just take the value network by itself (which is still very good at Go) without MCTS, then it’s inaccurate to describe that network as searching over many possible games; it’s playing Go well using only a subset of the type of cognition the full system does.
    This differs from the intentional stance by paying more attention to what’s going on inside the system, as opposed to just making inferences from behaviour. It’d be difficult to tell that the full AlphaGo system and the value network alone are doing different types of cognition, just from observing their behaviour—yet knowing that they do different types of cognition is very useful for making predictions about their behaviour on unobserved board positions.
    What I was pointing at is that searching a definition (sorry, used the taboo word) of goal-directedness in terms of the internal structure (that is, the source code for example), is misguided.
    You can probably guess what I’m going to say here: I still don’t know what you mean by “definition”, or why we want to search for it.
    - adamShimi 11 Jun 2020 23:24 UTC
      LW: 4 AF: 1
      AF Parent
      After talking with Evan, I think I understand your point better. What I didn’t understand was that you seemed to argue that there was something else than the behavior that mattered for goal-directedness. But as I understand it now, what you’re saying is that, yes, the behavior is what matters, but extracting the relevant information from the behavior is really hard. And thus you believe that computing goal-directedness in any meaningful way will require normative assumptions about the cognition of the system, at an abstract level.
      If that’s right, then I would still disagree with you, but I think the case for my position is far less settled than I assumed. I believe there are lots of interesting parts of goal-directedness that can be extracted from the behavior only, while acknowledging that historically, it has been harder to compute most complex properties of a system from behavior alone.
      If that’s not right, then I propose that we schedule a call sometime, to clarify the disagreement with more bandwidth. Actually, even if it’s right, I can call to update you on the research.