I’ve seen some presentations about how to do style-matching off of GitHub repos to pretty-confidently ID anonymous coders. While set-up requires a sizable amount of compute and data, the results have gotten quite impressive. There are ways to work against this (stuff that deliberately obscures your coding style, usually by rewriting your code), but they’re not that well known. And a similar thing can be done with writing style and writing samples.
Staying anonymous against high-effort attempts to discern your identity has gotten very hard, and is only likely to get harder.
At some point, all you can do is guard against the low-effort ones.
I’ve seen some presentations about how to do style-matching off of GitHub repos to pretty-confidently ID anonymous coders. While set-up requires a sizable amount of compute and data, the results have gotten quite impressive. There are ways to work against this (stuff that deliberately obscures your coding style, usually by rewriting your code), but they’re not that well known. And a similar thing can be done with writing style and writing samples.
Staying anonymous against high-effort attempts to discern your identity has gotten very hard, and is only likely to get harder.
At some point, all you can do is guard against the low-effort ones.