Skip to content

Asthestarfall/annotated-transformer

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Annotated Transformer

This is an annotated paper of the Transformer architecture implemented in numpy to explain the main mechanisms of self-attention and multihead attention to the students of the attention seminar.

TODO:

  • add JAX/autograd

About

Annotated transformer architecture in numpy and jax

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 100.0%