Part 3: Implementing a GPT Model from Scratch

Main Code

01_main-code contains the main part code.

Bonus Materials

02_performance-analysis contains optional code analyzing the performance of the GPT model(s).
part_4/07_gpt_to_llama contains a step-by-step guide for converting a GPT architecture implementation to Llama 3.2 and loads pretrained weights from Meta AI