aandyw / TransformerFromScratch Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

🤖 Building a transformer from scratch — because if suffering builds character, it might as well build a seq2seq translation model too!

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
images		images
notebooks		notebooks
scripts		scripts
transformer		transformer
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Repository files navigation

TransformerFromScratch

🤖

Acknowledgements

A variety of resources that really helped us out in understanding and implementing the Transformer model

Attention Is All You Need
Coding a Transformer from scratch on PyTorch, with full explanation, training and inference by Umar Jamil
Visualizing Attention, a Transformer's Heart by 3Blue1Brown
Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch by Sebastian Raschka
Self Attention in Transformer Neural Networks (with Code!) by CodeEmporium
Visualizing attention matrix using BertViz

About

🤖 Building a transformer from scratch — because if suffering builds character, it might as well build a seq2seq translation model too!

pytorch transformer from-scratch

Report repository

Releases

No releases published

Packages

No packages published

Contributors 2

Languages