Chris_Leong comments on Does reducing the amount of RL for a given capability level make AI safer?

Chris_Leong 7 May 2024 4:13 UTC
2 points
0
You mention that society may do too little of the safer types of RL. Can you clarify what you mean by this?
- ryan_greenblatt 7 May 2024 17:33 UTC
  5 points
  0
  Parent
  In brief: large amounts of high quality process based RL might result in AI being more useful earlier (prior to them becoming much smarter). This might be expensive and annoying (e.g. it might require huge amounts of high quality human labor) such that by default labs do less of this relative to just scaling up models than would be optimal from a safety perspective.