Humans have a history of making philosophical progress. We lack similar empirical evidence for AIs.
Hybrid philosophical discourse done by human-AI collaborations can be very good. For example, I feel that Janus has been doing very strong work in this sense with base models (so, not with RLHF’d, Constitutional, or otherwise “lesioned” and “mode-collapsed” models we tend to mostly use these days).
But, indeed, this does not tell us much about what would AIs do on their own.
Thanks, that’s very informative.
Hybrid philosophical discourse done by human-AI collaborations can be very good. For example, I feel that Janus has been doing very strong work in this sense with base models (so, not with RLHF’d, Constitutional, or otherwise “lesioned” and “mode-collapsed” models we tend to mostly use these days).
But, indeed, this does not tell us much about what would AIs do on their own.