(I interpreted the bit about using llama-3 to involve fine-tuning for things other than just avoiding refusals. E.g., actually doing sufficiently high quality debates.)
That’s indeed what I meant!
(I interpreted the bit about using llama-3 to involve fine-tuning for things other than just avoiding refusals. E.g., actually doing sufficiently high quality debates.)
That’s indeed what I meant!