9.8 KiB

Raw Blame History

CSV Data File Standards for BMAD Workflows

Purpose and Usage

CSV data files in BMAD workflows serve specific purposes for different workflow types:

For Agents: Provide structured data that agents need to reference but cannot realistically generate (such as specific configurations, domain-specific data, or structured knowledge bases).

For Expert Agents: Supply specialized knowledge bases, reference data, or persistent information that the expert agent needs to access consistently across sessions.

For Workflows: Include reference data, configuration parameters, or structured inputs that guide workflow execution and decision-making.

Key Principle: CSV files should contain data that is essential, structured, and not easily generated by LLMs during execution.

Intent-Based Design Principle

Core Philosophy: The closer workflows stay to intent rather than prescriptive instructions, the more creative and adaptive the LLM experience becomes.

CSV Enables Intent-Based Design:

Instead of: Hardcoded scripts with exact phrases LLM must say
CSV Provides: Clear goals and patterns that LLM adapts creatively to context
Result: Natural, contextual conversations rather than rigid scripts

Example - Advanced Elicitation:

Prescriptive Alternative: 50 separate files with exact conversation scripts
Intent-Based Reality: One CSV row with method goal + pattern → LLM adapts to user
Benefit: Same method works differently for different users while maintaining essence

Intent vs Prescriptive Spectrum:

Highly Prescriptive: "Say exactly: 'Based on my analysis, I recommend...'"
Balanced Intent: "Help the user understand the implications using your professional judgment"
CSV Goal: Provide just enough guidance to enable creative, context-aware execution

Primary Use Cases

1. Knowledge Base Indexing (Document Lookup Optimization)

Problem: Large knowledge bases with hundreds of documents cause context blowup and missed details when LLMs try to process them all.

CSV Solution: Create a knowledge base index with:

Column 1: Keywords and topics
Column 2: Document file path/location
Column 3: Section or line number where relevant content starts
Column 4: Content type or summary (optional)

Result: Transform from context-blowing document loads to surgical precision lookups, creating agents with near-infinite knowledge bases while maintaining optimal context usage.

2. Workflow Sequence Optimization

Problem: Complex workflows (e.g., game development) with hundreds of potential steps for different scenarios become unwieldy and context-heavy.

CSV Solution: Create a workflow routing table:

Column 1: Scenario type (e.g., "2D Platformer", "RPG", "Puzzle Game")
Column 2: Required step sequence (e.g., "step-01,step-03,step-07,step-12")
Column 3: Document sections to include
Column 4: Specialized parameters or configurations

Result: Step 1 determines user needs, finds closest match in CSV, confirms with user, then follows optimized sequence - truly optimal for context usage.

3. Method Registry (Dynamic Technique Selection)

Problem: Tasks need to select optimal techniques from dozens of options based on context, without hardcoding selection logic.

CSV Solution: Create a method registry with:

Column 1: Category (collaboration, advanced, technical, creative, etc.)
Column 2: Method name and rich description
Column 3: Execution pattern or flow guide (e.g., "analysis → insights → action")
Column 4: Complexity level or use case indicators

Example: Advanced Elicitation task analyzes content context, selects 5 best-matched methods from 50 options, then executes dynamically using CSV descriptions.

Result: Smart, context-aware technique selection without hardcoded logic - infinitely extensible method libraries.

4. Configuration Management

Problem: Complex systems with many configuration options that vary by use case.

CSV Solution: Configuration lookup tables mapping scenarios to specific parameter sets.

What NOT to Include in CSV Files

Avoid Web-Searchable Data: Do not include information that LLMs can readily access through web search or that exists in their training data, such as:

Common programming syntax or standard library functions
General knowledge about widely used technologies
Historical facts or commonly available information
Basic terminology or standard definitions

Include Specialized Data: Focus on data that is:

Specific to your project or domain
Not readily available through web search
Essential for consistent workflow execution
Too voluminous for LLM context windows