Training Optimizations Deep Dive: How I Made the A100 Actually Work
The complete technical reference for achieving 16x speedup. Every optimization explained with code and diagrams.
The complete technical reference for achieving 16x speedup. Every optimization explained with code and diagrams.
A comprehensive guide to every way I shot myself in the foot training GPT-2 Small. Learn from my pain.