I think that could become a problem. I think that type of compression is useful but not really dramatic, but I’m not sure. So there would still be an alignment tax. But it might be small enough to not prevent people from paying it. Keeping internal processes in natural language will also make for easier debugging just to get the system to work well and do things you want.
People are still going have to want to make aligned systems. This approach might just make it a whole lot easier.
I doubt that any language less represented than English (or JS/Python) would be better since the amount of good data to ingest would be much less for them.
I’m not sure this point will stand because it might be cheaper to have them think in their own language: https://www.lesswrong.com/posts/bNCDexejSZpkuu3yz/you-can-use-gpt-4-to-create-prompt-injections-against-gpt-4
I think that could become a problem. I think that type of compression is useful but not really dramatic, but I’m not sure. So there would still be an alignment tax. But it might be small enough to not prevent people from paying it. Keeping internal processes in natural language will also make for easier debugging just to get the system to work well and do things you want.
People are still going have to want to make aligned systems. This approach might just make it a whole lot easier.
I doubt that any language less represented than English (or JS/Python) would be better since the amount of good data to ingest would be much less for them.