Explaining gpt visually
There is a great group of data scientists at Georgia Tech, their group is called the Polo Club. They have created and made available a useful explainer web interface that lets you interact with a GPT model and see how it takes the input in words, turns words into tokens and then vectorizes and calculates the predictions. You can try it yourself: https://poloclub.github.io/transformer-explainer/
I love how simple and visual this tool is - most people do not want to get into the weeds on matrix multiplication.