How long does it take to get from janky grad-student code to polished, scalable, parallelized, hardware-accelerated, turn-key learning algorithms?
It took me on a team with two others less than a year to turn a janky paper explaining, in math, a new machine learning algorithm which kinda sorta did its job into a functional, scalable, parallelizable, hardware-accelerated[1] learning algorithm. We didn’t just build a library. We used the algorithm to solve a previously unsolved real-world problem.
The project didn’t take much compute. I ran the whole thing on an old laptop.
It took me on a team with two others less than a year to turn a janky paper explaining, in math, a new machine learning algorithm which kinda sorta did its job into a functional, scalable, parallelizable, hardware-accelerated[1] learning algorithm. We didn’t just build a library. We used the algorithm to solve a previously unsolved real-world problem.
The project didn’t take much compute. I ran the whole thing on an old laptop.
We ran it many times per second on tiny power-optimized PCBs that can be worn on one’s wrist.