NanoGPT

Let’s reproduce NanoGPT with JAX!(Part 1)

Inspired by Andrej Kapathy’s recent youtube video on Let’s reproduce GPT-2 (124M), I’d wish to rebuild it with many of the training optimizations in Jax. Jax is built for highly efficient computation speed, and...

Recent posts

Popular categories

ASK ANA