RLHF is too complex for people starting in ML? But I’m interested by the link from the mnist demo if you have it?
Preference model : why not, but there is no clear metric. So we cannot easily determine the winner of the Hackathon.
Make an interface: this is a cool project idea. But generally, gradient based methods like The fast gradient sign lethod works very well. I have no clue what would an an adversarial GUI interface look like. So I’m not comfortable with the idea.
Interface to find the image activating the most an image classifier neuron? Cool idea but i think it’s too simple.
Thank you for your help
RLHF is too complex for people starting in ML? But I’m interested by the link from the mnist demo if you have it?
Preference model : why not, but there is no clear metric. So we cannot easily determine the winner of the Hackathon.
Make an interface: this is a cool project idea. But generally, gradient based methods like The fast gradient sign lethod works very well. I have no clue what would an an adversarial GUI interface look like. So I’m not comfortable with the idea.
Interface to find the image activating the most an image classifier neuron? Cool idea but i think it’s too simple.