LLM Training From Scratch | GPT-2 Tutorial
Posts
Tags
Archives
Search
Search
GPUburnout
Will Code for Tokens
134M
Params
2.8B
Tokens
7x
Speedup