Actually >1e6 was my conservative guess even without reversible computing.
Claude claimed that Landauer Limit is 2e8 times more efficient than current GPUs (A100). (I didn’t check Claude, I just asked 3.5 sonnet “How far are modern GPUs from theoretical energy efficiency for non-reversible computing?” and got this answer.) Edit: if someone knows the actual answer, please let me know.
So, even without reversible computing, you can go very far. Idk how much further reversible computing gets you, though I think it is probably possible.
I get 1e7 using 16 bit-flips per bfloat16 operation, 300K operating temperature, and 312Tflop/s (from Nvidia’s spec sheet). My guess is that this is a little high because a float multiplication involves more operations than just flipping 16 bits, but it’s the right order-of-magnitude.
Actually >1e6 was my conservative guess even without reversible computing.
Claude claimed that Landauer Limit is 2e8 times more efficient than current GPUs (A100). (I didn’t check Claude, I just asked 3.5 sonnet “How far are modern GPUs from theoretical energy efficiency for non-reversible computing?” and got this answer.) Edit: if someone knows the actual answer, please let me know.
So, even without reversible computing, you can go very far. Idk how much further reversible computing gets you, though I think it is probably possible.
I get 1e7 using 16 bit-flips per bfloat16 operation, 300K operating temperature, and 312Tflop/s (from Nvidia’s spec sheet). My guess is that this is a little high because a float multiplication involves more operations than just flipping 16 bits, but it’s the right order-of-magnitude.