leogao comments on Current AIs Provide Nearly No Data Relevant to AGI Alignment

leogao 17 Dec 2023 9:14 UTC
LW: 13 AF: 6
6
AF
I agree with the spirit of the post but not the kinda clickbaity title. I think a lot of people are over updating on single forward pass behavior of current LLMs. However, I think it is still possible to get evidence using current models with careful experiment design and being careful with what kinds of conclusions to draw.