Can someone create a forecasting question about which model will score better in benchmarks, Claude 3.5 Opus or GPT-5?
Can someone create a forecasting question about which model will score better in benchmarks, Claude 3.5 Opus or GPT-5?