It’s a convenient test-bed to investigate schemes for building an agent with access to a good universal distribution approximation—which is what we (at least, me) usually assume an LLM is!
Also see my PR.
It’s a convenient test-bed to investigate schemes for building an agent with access to a good universal distribution approximation—which is what we (at least, me) usually assume an LLM is!
Also see my PR.