A breakdown how how transformer models work (AlexNet, an image classifier) [edit: more about CNNs then transformers, I misunderstood]

Open link in next tab

- YouTube

https://www.youtube.com/watch?v=UZDiGooFs54

Aproveite vídeos e músicas que você ama, envie e compartilhe conteúdo original com amigos, parentes e o mundo no YouTube.

A great, slightly more in depth (without being mathy) explanation of transformer models. Mostly talking about AlexNet, an image classifier from 2012. Goes over some history and has some very interesting looks under the hood.

He does use some personifying language for these models, but that's unfortunately the case for most information on the topic.