2026  11

March  4

I Spent Another $68 Because a Spreadsheet Wouldn’t Stop Staring at Me

March 15, 2026 · 9 min · Jun Park

10 Things I Learned Training a 1B Parameter Model That Nobody Talks About

March 7, 2026 · 14 min · Jun Park

What GPUburnout-1B Actually Learned

March 6, 2026 · 10 min · Jun Park

The $175 Experiment: Training GPUburnout-1B on a Single GPU

March 4, 2026 · 10 min · Jun Park

February  4

From 134M to 1B: Building GPUburnout-1B From Scratch

February 27, 2026 · 7 min · Jun Park

Training Optimizations Deep Dive: How I Made the A100 Actually Work

February 12, 2026 · 21 min · Jun Park

The Results Are In (And My Wallet Is Empty)

February 6, 2026 · 6 min · Jun Park

11 Training Challenges and How I Solved Them

February 2, 2026 · 6 min · Jun Park

January  3

Scaling Up: From Tiny Model to GPT-2 Small

January 27, 2026 · 4 min · Jun Park

Data Preparation: Building a 12GB Training Corpus

January 22, 2026 · 4 min · Jun Park

Why I Decided to Build a Language Model from Scratch

January 15, 2026 · 3 min · Jun Park
GPUburnout
GPUburnout
Will Code for Tokens
S1 GPT-2 134M
S2 Llama 1B