AI Safety is a group project for us all. We need everyone to participate—the ESFPs to the INTJs!
Capturing the essence and subtleties of core values needs input across a broad span of humanity.
Assumption 1 - large language models will be the basis of AGI.
Assumption 2 - One way to add the abstraction of a value like “kindness is good” into the model is to add a large corpus of written material on Kindness during training (or retraining).
The Kindness Project is a website with a prompt, like a college essay. Users add their stories to the open collection based on the prompt: “Tell a story about how you impacted or were impacted by someone being kind”. This prompt is translated for all languages to maximize input.
The end goal is that there is a large and detailed system of nodes in the model around the abstraction of Kindness that represents our experiences.
There would be sister projects based around other values like Wisdom, Integrity, Compassion, etc.
The project incentivizes participation through contests, random drawings, partner projects with schools, etc.
Submissions are filtered for plagiarism, duplicates, etc.
Documents are auto-linked back to reddit for inclusion in language model document scrapers.
The Kindness Project
AI Safety is a group project for us all. We need everyone to participate—the ESFPs to the INTJs!
Capturing the essence and subtleties of core values needs input across a broad span of humanity.
Assumption 1 - large language models will be the basis of AGI.
Assumption 2 - One way to add the abstraction of a value like “kindness is good” into the model is to add a large corpus of written material on Kindness during training (or retraining).
The Kindness Project is a website with a prompt, like a college essay. Users add their stories to the open collection based on the prompt: “Tell a story about how you impacted or were impacted by someone being kind”. This prompt is translated for all languages to maximize input.
The end goal is that there is a large and detailed system of nodes in the model around the abstraction of Kindness that represents our experiences.
There would be sister projects based around other values like Wisdom, Integrity, Compassion, etc.
The project incentivizes participation through contests, random drawings, partner projects with schools, etc.
Submissions are filtered for plagiarism, duplicates, etc.
Documents are auto-linked back to reddit for inclusion in language model document scrapers.