Let’s reproduce NanoGPT with JAX!(Half 1) | by Louis Wang | Jul, 2024
Impressed by Andrej Kapathy’s current youtube video on Let’s reproduce GPT-2 (124M), I’d prefer to rebuild it with many of the coaching optimizations in Jax. Jax is constructed for extremely...











