Everyone keeps telling me to read this. The promise is the one thing missing from most LLM tutorials - building it from scratch instead of importing a pretrained model and calling it research.
← All books
Build a Large Language Model (From Scratch)
by Sebastian Raschka
currently reading technical
Implements a GPT-style model end to end - tokenizer, attention, training loop, fine-tuning.