I would also add the same point I bought up when you presented it in-person: I don’t think the “generating diverse proposals and then selecting from them” is directly providing much of the value from this proposal.
Presumably a proposal for an advanced task is going to involve a lot of information, say kilobytes of information as a lower bound. The people evaluating the proposal presumably have to spend nontrivial amounts of time reading and thinking about the proposal. So as an upper bound, it seems unlikely that it is economical for them to read through more than, say, four billion proposals.
Which gives you 4 bytes of information about their preferences, out of kilobytes of information in a proposal.
So I would deemphasize the evaluation/selection step, or maybe replace it with a different component, e.g. evaluation/feedback (if instead of generating four billion proposals and selecting one of them, you iteratively generate two approaches and give feedback for which one is better, then after four billion approaches you’ve accumulated two billion bits of information, rather than 32 bits of information).
I would also add the same point I bought up when you presented it in-person: I don’t think the “generating diverse proposals and then selecting from them” is directly providing much of the value from this proposal.
Presumably a proposal for an advanced task is going to involve a lot of information, say kilobytes of information as a lower bound. The people evaluating the proposal presumably have to spend nontrivial amounts of time reading and thinking about the proposal. So as an upper bound, it seems unlikely that it is economical for them to read through more than, say, four billion proposals.
Which gives you 4 bytes of information about their preferences, out of kilobytes of information in a proposal.
So I would deemphasize the evaluation/selection step, or maybe replace it with a different component, e.g. evaluation/feedback (if instead of generating four billion proposals and selecting one of them, you iteratively generate two approaches and give feedback for which one is better, then after four billion approaches you’ve accumulated two billion bits of information, rather than 32 bits of information).