the safety considerations you’ll be discussing depend more on whether the NTM is trained via SL or RL rather than whether it neatly contains all your (soon to be elucidated) Brain-like-AGI properties.
I’m confused; this statement makes it sound like “whether it’s trained via SL or RL” is NOT a possible candidate for a “brain-like-AGI property”. Why can’t it be? Or maybe I’m reading too much into your wording.
Oops, strangely enough I just wasn’t thinking about that possibility. It’s obvious now, but I assumed that SL vs RL would be a minor consideration, despite the many words you’ve already written on reward.
I’m confused; this statement makes it sound like “whether it’s trained via SL or RL” is NOT a possible candidate for a “brain-like-AGI property”. Why can’t it be? Or maybe I’m reading too much into your wording.
Oops, strangely enough I just wasn’t thinking about that possibility. It’s obvious now, but I assumed that SL vs RL would be a minor consideration, despite the many words you’ve already written on reward.