and the whole point was to create tasks where you can’t make an automated reward function.
I was gesturing towards partially automatable—the whole can’t. (Specifically find water—unless that can be crafted now, or mods are in play. (Find ice might also work though.)) Handcrafted: move, don’t hold still.
Protip: when you find yourself doing this, consider opening a duplicate tab—one for reading and one for writing the comment.
I was refering to the ‘editing a comment process.’ Though thanks for the tip, I do use that a lot.
Pong
Yeah. I mostly found it interesting to think about because it seems like a simpler environment (and might be easier to train), but the results would be a lot less interesting. And at the limit of modification, Edited Pong becomes like Minecraft 2.
Arguably Minecraft is the sort of game that embedding a mini game (say via a primitive red stone controller going to some sort of arcade machine mockup), could kind of work within. (Agents might be uninterested in such a machine if it doesn’t affect its environment at all—and simple reward distribution setups could just be found by disassembling it.)
I was gesturing towards partially automatable—the whole can’t. (Specifically find water—unless that can be crafted now, or mods are in play. (Find ice might also work though.)) Handcrafted: move, don’t hold still.
I was refering to the ‘editing a comment process.’ Though thanks for the tip, I do use that a lot.
Yeah. I mostly found it interesting to think about because it seems like a simpler environment (and might be easier to train), but the results would be a lot less interesting. And at the limit of modification, Edited Pong becomes like Minecraft 2.
Arguably Minecraft is the sort of game that embedding a mini game (say via a primitive red stone controller going to some sort of arcade machine mockup), could kind of work within. (Agents might be uninterested in such a machine if it doesn’t affect its environment at all—and simple reward distribution setups could just be found by disassembling it.)