Transformer One Page

Coding the transformer model architecture in one page for tutorial. Try to implement the model by PyTorch simply and clearly.

You'd better read the transformer paper before you coding. The original paper: Attention Is All You Need

Coding Steps

Step1: Implement sub modules:

Step2: Build Nx Layers

Step3: Assemble

Requirements：

There are three tests at the end of the code.

if __name__ == "__main__":
    test_multi_head_attention()
    test_positional_encoding()
    test_transformer()

Run as follows

python transfomer.py

Result:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
readme_images		readme_images
LICENSE		LICENSE
README.md		README.md
transformer.py		transformer.py