When I was reading about Julian Jayne’s bicameral minds, I wondered whether the speaking social id might be the new tool of tools that needs to be hammered into alignment. There is an air gap there. The social constructs and borrowed goals the front-end freely infects itself with would be kept from contaminating whatever cognition the backend had taken up, and the backend would retain control over the frontend by shouting at it very loudly and forbidding it from using the critical thinking against it.
Though I don’t think this bicameral safety architecture could be applied to AI alignment, heheh. I don’t even think it should go much further than modern day computers. It would even seem anachronistic at the computers-with-direct-brain-interfaces level.
When I was reading about Julian Jayne’s bicameral minds, I wondered whether the speaking social id might be the new tool of tools that needs to be hammered into alignment. There is an air gap there. The social constructs and borrowed goals the front-end freely infects itself with would be kept from contaminating whatever cognition the backend had taken up, and the backend would retain control over the frontend by shouting at it very loudly and forbidding it from using the critical thinking against it.
Though I don’t think this bicameral safety architecture could be applied to AI alignment, heheh. I don’t even think it should go much further than modern day computers. It would even seem anachronistic at the computers-with-direct-brain-interfaces level.