Ethan Perez comments on A Test for Language Model Consciousness

Ethan Perez 25 Aug 2022 21:12 UTC
LW: 6 AF: 3
0
AF
I think we can mitigate this issue by removing all data related/adjacent to consciousness and/or AIs when pretraining/finetuning the model. Here, we’d only explain the notion of phenomenal consciousness to the model at test time, when it needs to answer the consciousness-related questions