Commit Graph

2 Commits

Author SHA1 Message Date
Alex Verkhovsky d9accf6e4e docs: add QD2 skeleton eval report comparing QD2 vs old QD baseline
Metrics from JSONL log mining: 56% fewer human turns (18→8),
32% fewer API turns (72→49), token cost at parity (1.14x).
One-shot path identified as broken (P0 fix priority).
2026-02-22 21:37:37 -07:00
Alex Verkhovsky df8548b2d9 docs: add QD2 skeleton test findings from 3 run analyses
Cross-references 3 QD2 test runs against the redesign plan.
Classifies 24 observations as Works/Gap/Execution Gap/Plan Gap/Plumbing.
Key findings: one-shot path bypasses step architecture, terse step files
cause execution gaps, plan-code-review path works end-to-end.
2026-02-22 20:52:55 -07:00