“[...] This is because there would be no general direction towards a truth-based belief domain or away from using human modeling in output generation.”
What do you mean by “human modeling in output generation”?
Modeling how a human would respond (“human modeling in output generation”)
Modeling what the ground-truth answer is
Eg. for common misconceptions, maybe most humans would hold a certain misconception (like that South America is west of Florida), but we want the LLM to realize that we want it to actually say how things are (given it likely does represent this fact somewhere)
“[...] This is because there would be no general direction towards a truth-based belief domain or away from using human modeling in output generation.”
What do you mean by “human modeling in output generation”?
I am contrasting generating an output by:
Modeling how a human would respond (“human modeling in output generation”)
Modeling what the ground-truth answer is
Eg. for common misconceptions, maybe most humans would hold a certain misconception (like that South America is west of Florida), but we want the LLM to realize that we want it to actually say how things are (given it likely does represent this fact somewhere)