Generating Shakespeare with Transformers: From Attention to Elizabethan Prose
Building a decoder-only transformer from scratch to generate Shakespeare-style text. A deep dive into multi-head attention, positional encoding, and the magic of "Attention Is All You Need".