Agreed, except for the small caveat of LLMs answers which can be easily verified as approximately correct. E.g. answers to math problems where the solution is hard but the verification is easy; or Python scripts you’ve tested yourself and whose output looks correct; or reformatted text (like plaintext → BBCode) if it looks correct on a word diff website.
Incidentally, are there any LLM services which can already this kind of verification in specific domains?
Agreed, except for the small caveat of LLMs answers which can be easily verified as approximately correct. E.g. answers to math problems where the solution is hard but the verification is easy; or Python scripts you’ve tested yourself and whose output looks correct; or reformatted text (like plaintext → BBCode) if it looks correct on a word diff website.
Incidentally, are there any LLM services which can already this kind of verification in specific domains?