Skip to content

Latest commit

 

History

History
17 lines (8 loc) · 997 Bytes

README.md

File metadata and controls

17 lines (8 loc) · 997 Bytes

Transformers

Here are two notebooks of transformer implementation walk-troughs:

And here are two notebooks of using pretrained GPT-2 model:

I also wrote a blog post about fine-tuning pre-trained GPT2 model on a jokes dataset.