Commit Graph

8 Commits

Author SHA1 Message Date
Alex Verkhovsky c8c05c1695 docs: add reusable workflow eval prompt with JSONL mining methodology
Captures lessons from the skeleton eval: workflow boundary scoping,
human turn filtering, token deduplication, context normalization,
and five common pitfalls to avoid.
2026-02-22 21:40:48 -07:00
Alex Verkhovsky d9accf6e4e docs: add QD2 skeleton eval report comparing QD2 vs old QD baseline
Metrics from JSONL log mining: 56% fewer human turns (18→8),
32% fewer API turns (72→49), token cost at parity (1.14x).
One-shot path identified as broken (P0 fix priority).
2026-02-22 21:37:37 -07:00
Alex Verkhovsky df8548b2d9 docs: add QD2 skeleton test findings from 3 run analyses
Cross-references 3 QD2 test runs against the redesign plan.
Classifies 24 observations as Works/Gap/Execution Gap/Plan Gap/Plumbing.
Key findings: one-shot path bypasses step architecture, terse step files
cause execution gaps, plan-code-review path works end-to-end.
2026-02-22 20:52:55 -07:00
Alex Verkhovsky 3126b9c4ee refactor(workflows): simplify QD2 steps and enforce ask-dont-fantasize
Strip task sharding, rigid approval menu, and VC backfill. Add
intent_gap-first classification cascade to review. Use slug-based
spec naming in plan step.
2026-02-22 20:42:53 -07:00
Alex Verkhovsky 1f33bd3395 docs: add QD2 run logs and reusable capture-run prompt
Second QD2 skeleton test run (add plan-review step to task-01) with
raw JSONL log and analysis. Add CAPTURE-RUN.md as a repeatable prompt
for post-run log capture and summarization.
2026-02-22 18:28:55 -07:00
Alex Verkhovsky 398d67bb89 feat: Introduce Spec-driven development (SDD) methodology with ideal Grok spec and initial experiment run logs. 2026-02-22 18:11:41 -07:00
Alex Verkhovsky 4dd5a0c871 docs: add roadmap task files for QD2 experiment
18 tasks: skeleton test/eval, then per-step tighten/test/eval
cycles, plus end-to-end eval. Each task file is a self-contained
intent expression that can be fed to QD2.
2026-02-22 13:02:06 -07:00
Alex Verkhovsky c2da172988 docs: add planning artifacts for quick-dev2 experiment
Reference specs from the design sessions: full redesign plan,
session state with all design decisions, and implementation roadmap.
2026-02-22 12:56:40 -07:00