Sodium comments on The shape of AGI: Cartoons and back of envelope

Sodium 16 Dec 2024 3:54 UTC
1 point
0
TL; DR: This post gives a good summary of how models can get smarter over time, but while they are superhuman at some tasks, they can still suck at others (see the chart with Naive Scenario v. Actual performance). This is a central dynamic in the development of machine intelligence and deserves more attention. Would love to hear other’s thoughts on this—I just realized that it needed one more positive vote to end up in the official review.
In other words, current machine intelligence and human intelligence are compliments, and human + AI will be more productive than human-only or AI-only organizations (conditional on the same amount of resources).

The post sparked a ton of follow up questions for me, for example:
- Will machine intelligence and human intelligence continue to be compliments? Is there some evaluation we can design that tells us the degree to which machine intelligence and human intelligence are compliments?
- Would there always be some tasks where the AIs will trip up? Why?
- Which skills will future AIs become superhuman at first, and how could we leverage that for safety research?
- When we look at AI progress, does it look like the AI steadily getting better at all tasks, or that it suddenly gets better at one or another, as opposed to across the board? How would we even split up “tasks” in a way that’s meaningful?
I’ve wanted to do a deep dive into this for a while now and keep putting it off.

I think many others have made the point about an uneven machine intelligence frontier (at least when referenced with the frontiers of human intelligence), but this is the first time I saw it so succinctly presented. I think this post warrents to be in the review, and if so it’ll be a great motivator for me to write up my thoughts on the questions above!