After having spent a few hours playing with Opus, I think “slightly better than best public gpt-4” seems qualitatively correct—both models tend to get tripped up on the same kinds of tasks, but Opus can inconsistently solve some tasks in my workflow that gpt-4 cannot.
And yeah, it seems likely that I will also swap to Claude over ChatGPT.
After having spent a few hours playing with Opus, I think “slightly better than best public gpt-4” seems qualitatively correct—both models tend to get tripped up on the same kinds of tasks, but Opus can inconsistently solve some tasks in my workflow that gpt-4 cannot.
And yeah, it seems likely that I will also swap to Claude over ChatGPT.