Reading roadmap

seed#moc#literature#reading-list

Up: 50_Literature-MOC · plan

The tight path through everything collected here, sequenced to the 12-month plan. Read in service of doing (the next experiment), not completionism — but as a newcomer, read enough prior art to avoid reinventing solved wheels (Fast iteration beats parallelism). ✅ = open access.

Full lists: Foundations reading list · Clickstream foundational papers · Sequence modeling papers · Anomaly & drift papers · Uplift modeling papers.

▶ Start tomorrow (don't overthink it)

  1. MacKay ch. 2 (entropy/KL) → then do the by-hand exercise in Cross-entropy and KL divergence.
  2. Skim srivastava2000webusagemining (the WUM pipeline) — 30-min pass-1 only.
  3. Read laub2015hawkes §1–2 — you'll use it in the generator.

Phase 0 · Foundations & the lab habit (Q1)

Goal: enough math to build the synthetic generator and measure recovery honestly.

Phase 1 · Domain grounding (Q1→Q2)

Goal: don't reinvent sessionization / path modeling.

Phase 2 · First real track + skepticism reflex (Q2)

Goal: one shippable-or-killed result, with leakage/measurement caught. Read only the arm your track needs.

Phase 3 · The causal turn (Q3)

Goal: predict vs cause — uplift, not just propensity.

Phase 4 · Communicate & compound (Q4)