An empirical LLM evals preprint that seems to support these observations:Large Language Models are biased to overestimate profoundness by Herrera-Berg et al
An empirical LLM evals preprint that seems to support these observations:
Large Language Models are biased to overestimate profoundness by Herrera-Berg et al