They put too much emphasis on high frequency features, suggesting a different inductive bias from humans.
Could you fix this part by adding high frequency noise to the images prior to training? Maybe lots of copies of each image with different noise patterns?
Could you fix this part by adding high frequency noise to the images prior to training? Maybe lots of copies of each image with different noise patterns?