Tokenization Blogs.
An easy-to-understand explanation of how large language models work, from training to testing, with examples and code snippets.