Pattern comments on Algorithms vs Compute

Pattern Jan 28, 2020, 7:02 PM
3 points
Which of the two would perform better?
Will the experiment be run?

What is the experiment? What is the question?
I take a vision or language model which was cutting edge in 2000, and run it with a similar amount of compute/data to what’s typically used today.
Guess A. Is the difference (between 2000 and today) modern compute?
I take a modern vision or language model, calculate how much money it costs to train, estimate the amount of compute I could have bought for that much money in 2000, then train it with that much compute
Guess B. Is the difference (between 2000 and today) modern compute costs?
But the experiment doesn’t seem to be about A or B. More likely it’s about both:
Which is more important (to modern ML performance (in what domain?*)):
- Typical compute (today versus then)?
- Or typical compute cost (today versus then)?
(Minor technical note—if you’re comparing results from the past, to results today, while it might be impossible to go back in time and test these things for a control group, rather than taking ‘things weren’t as good back then’ for granted, this should be tested as well for comparison. (Replicate earlier results.**)
This does admit other hypotheses.
For example, ‘the difference between 2020 and 2000 is that training took a long time, and if people set things up wrong, they didn’t get feedback for a long time. Perhaps modern compute enables researchers to set ML programs up correctly despite the code not being written right the first time.’)

A and B can be rephrased as:
- Do we use more compute today, but spend ‘the same amount’?
- Do we spend ‘more’ on compute today?
*This might be intended as a more general question, but the post asks about:
vision or language model[s.]

**The most extreme version would be getting/recreating old machines and then re-running old ML stuff on them.
- johnswentworth Jan 28, 2020, 7:32 PM
  3 points
  Parent
  The underlying question I want to answer is: ML performance is limited by both available algorithms and available compute. Both of those have (presumably) improved over time. Relatively speaking, how taut are those two constraints? Has progress come primarily from better algorithms, or from more/cheaper compute?

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer