Dpo on GPUburnout

Dpo on GPUburnout | Jun Parkhttps://gpuburnout.com/tags/dpo/Recent content in Dpo on GPUburnout | Jun ParkGPUburnout | Jun Parkhttps://gpuburnout.com/images/og-default.pnghttps://gpuburnout.com/images/og-default.pngHugo -- 0.155.2en-usSun, 29 Mar 2026 00:00:00 +00007 Out of 8 - How DPO Finally Workedhttps://gpuburnout.com/posts/s4-ch3-dpo-finally-worked/Sun, 29 Mar 2026 00:00:00 +0000https://gpuburnout.com/posts/s4-ch3-dpo-finally-worked/In which the same technique that failed nine times on the 1B succeeds on the 2B - and the reason why is the whole point of Season 4.Nine Experiments, Nine Funeralshttps://gpuburnout.com/posts/s3-ch3-garbage-survived-finetuning/Sat, 21 Mar 2026 00:00:00 +0000https://gpuburnout.com/posts/s3-ch3-garbage-survived-finetuning/A controlled experiment on why post-training alignment can't fix pretraining contamination - and what the data proves.