I don’t know how to read ‘19% higher,’ I presume that means 19% less hallucinations but I can also think of several other things that could mean.
This might be referring to the “Internal factual eval by category” chart that showed accuracy going from ~50% to ~70% (i.e. ~19 percentage points, which means more like 40% reduction in hallucination).
This. If they had meant 19% less hallucinations they would have said 19% reduction in whatever, which is a common way to talk about relative improvements in ML.
This might be referring to the “Internal factual eval by category” chart that showed accuracy going from ~50% to ~70% (i.e. ~19 percentage points, which means more like 40% reduction in hallucination).
This. If they had meant 19% less hallucinations they would have said 19% reduction in whatever, which is a common way to talk about relative improvements in ML.