r(q):=max(p1,…,pk)∈ϕ−1t(q)n∑k=1qk⋅r′k(pk).
What is qk? Also, we should allow adding some valid reward function of ~t.
kth element of q
What is qk? Also, we should allow adding some valid reward function of ~t.
kth element of q