Have you noticed anything interesting about the CoT that may account for the mechanism of how the threat reduces the model’s performance??
Current theme: default
Less Wrong (text)
Less Wrong (link)
Have you noticed anything interesting about the CoT that may account for the mechanism of how the threat reduces the model’s performance??