Roko comments on Davidad’s Provably Safe AI Architecture—ARIA’s Programme Thesis

Roko 5 Feb 2024 13:07 UTC
2 points
0
Humans already do this, except we have made it politically incorrect to talk about the ways in which human-generated Goodhearting make the world worse (race, gender, politics etc)
- quetzal_rainbow 5 Feb 2024 15:21 UTC
  1 point
  0
  Parent
  Your examples are clearly visible. If your wrong alignment paradigm get reinforced because of your attachment to specific model of causality known to ten people in entire world, you risk to notice this too late.
  - Roko 5 Feb 2024 16:37 UTC
    2 points
    0
    Parent
    
    because of your attachment to specific model of causality known to ten people in entire world, you risk to notice this too late.
    
    you’re thinking about this the wrong way. AGI governance will not operate like human governance.
    - quetzal_rainbow 5 Feb 2024 16:52 UTC
      1 point
      0
      Parent
      Can you elaborate? I don’t understand where we disagree.