How the NanoGPT Speedrun WR dropped by 20% in 3 months lesswrong.com 2 points by frozenseven 15 hours ago