For a while now, an answer I've seen is to start with "Attention Is All You Need...

bfung · on Jan 29, 2023

YouTube channel that explains the paper in detail: https://youtu.be/iDulhoQ2pro

And subsequent follow-ups (ROME, editing transformer arch): https://youtu.be/_NMQyOu2HTo

I find the channel amazing explaining super complex topics in simple enough terms for people who have some background in AI.

peterfirefly · on Jan 29, 2023

Yannic Kilcher is great but this video worked better for me:

"LSTM is dead. Long live transformers!" (Leo Dirac): https://www.youtube.com/watch?v=S27pHKBEp30

d4rkp4ttern · on Jan 30, 2023

I second the recommendation for Peter Bloem’s tutorial.

I’m also about to read the transformer chapter from this excellent upcoming book by Simon Prince:

udlbook https://udlbook.github.io/udlbook/

nerdponx · on Jan 29, 2023

Part of the problem with self studying this stuff is that it's hard to know which resources are good, without already being at least conversant with the material already.

peterfirefly · on Jan 29, 2023

That problem doesn't really disappear with teachers and classes ;)