Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 470 Bytes

File metadata and controls

13 lines (8 loc) · 470 Bytes

Part 3: Implementing a GPT Model from Scratch

 

Main Code

 

Bonus Materials

  • 02_performance-analysis contains optional code analyzing the performance of the GPT model(s).
  • part_4/07_gpt_to_llama contains a step-by-step guide for converting a GPT architecture implementation to Llama 3.2 and loads pretrained weights from Meta AI