Why this post exists
This is a technical writeup stored as a single MDX file under content/posts/. You can use normal markdown: headings, lists, and images. (Revision B: tightened intro after review.)
A little math
When we minimize a scalar loss , a vanilla update is:
Inline math works too: the learning rate controls step size.
Interactive diagram
Drag the slider and press Gradient step to watch move along a simple 1D loss curve.
Interactive diagram — adjust learning rate and take gradient steps
θ = -2.200
Writing process
Commit this file as you draft. The history control on this page reads git log for this path so you can scrub the timeline and see earlier wording.