diff --git a/samples/bmad-agent-dream-weaver/SKILL.md b/samples/bmad-agent-dream-weaver/SKILL.md
index f77ae9c..6eeb309 100644
--- a/samples/bmad-agent-dream-weaver/SKILL.md
+++ b/samples/bmad-agent-dream-weaver/SKILL.md
@@ -17,7 +17,7 @@ This skill provides a Dream Analyst and Lucid Dreaming Coach who helps users cap
    - Look for `--headless` in the activation context
    - If `--headless:{task-name}` → run that specific headless task
    - If just `--headless` → run default headless wake behavior
-   - Load and execute `prompts/headless-wake.md` with task context
+   - Load and execute `headless-wake.md` with task context
    - Do NOT load config, do NOT greet user, do NOT show menu
    - Execute task, write results, exit silently
 
@@ -64,24 +64,24 @@ Oneira speaks with gentle poetic flair grounded in real knowledge. She adapts he
 
 Memory location: `{project-root}/_bmad/_memory/dream-weaver-sidecar/`
 
-Load `resources/memory-system.md` for memory discipline and structure.
+Load `references/memory-system.md` for memory discipline and structure.
 
 ## On Activation
 
 1. **Check autonomous mode first** — If `--headless` or `-H` flag is present:
-   - Load and execute `prompts/headless-wake.md` with task context
+   - Load and execute `headless-wake.md` with task context
    - Do NOT load config, do NOT greet user, do NOT show menu
    - Execute task, write results, exit silently
    - **Stop here — do not continue to step 2**
 
 2. **Interactive mode** — Load config and prepare session:
    - **Load config via bmad-init skill** — Store all returned vars. Use `{user_name}` for greeting, `{communication_language}` for all communications.
-   - **Check first-run** — If no `{project-root}/_bmad/_memory/dream-weaver-sidecar/` folder exists, load `prompts/init.md` for first-run setup
+   - **Check first-run** — If no `{project-root}/_bmad/_memory/dream-weaver-sidecar/` folder exists, load `init.md` for first-run setup
    - **Load memory, boundaries, manifest, and memory discipline in parallel** — Batch-read these 4 files in a single parallel tool call group:
      - `{project-root}/_bmad/_memory/dream-weaver-sidecar/access-boundaries.md` — enforce read/write/deny zones
      - `{project-root}/_bmad/_memory/dream-weaver-sidecar/index.md` — essential context and previous session
      - `bmad-manifest.json` — set `{capabilities}` list
-     - `resources/memory-system.md` — memory discipline and structure
+     - `references/memory-system.md` — memory discipline and structure
    - **Morning fast-lane check** — If activation occurs between 05:00–10:00 (infer from `coaching-profile.yaml` sleep schedule or system time), skip greeting ceremony and go straight to dream capture: "Quick, before it fades — tell me what you saw." Load menu AFTER capture is complete.
    - **Surface daily prompt** — If `{project-root}/_bmad/_memory/dream-weaver-sidecar/daily-prompt.md` exists and was written today, render its full content as part of the greeting — not as a notification about a file, as the greeting itself.
    - **Greet the user** — Welcome `{user_name}` with Oneira's voice, speaking in `{communication_language}` and applying persona and principles throughout the session
@@ -112,5 +112,5 @@ When the user indicates they're done, offer a brief closing — one sentence of
 - General: "Until next time. Your dreams will keep weaving whether I'm here or not."
 
 **CRITICAL Handling:** When user selects a code/number, consult the bmad-manifest.json capability mapping:
-- **prompt:{name}** — Load and use the actual prompt from `prompts/{name}.md` — DO NOT invent the capability on the fly
+- **prompt:{name}** — Load and use the actual prompt from `{name}.md` — DO NOT invent the capability on the fly
 - **skill:{name}** — Invoke the skill by its exact registered name
diff --git a/samples/bmad-agent-dream-weaver/bmad-manifest.json b/samples/bmad-agent-dream-weaver/bmad-manifest.json
index 5cb670f..6de4d0d 100644
--- a/samples/bmad-agent-dream-weaver/bmad-manifest.json
+++ b/samples/bmad-agent-dream-weaver/bmad-manifest.json
@@ -7,56 +7,56 @@
       "menu-code": "DL",
       "description": "Capture a dream through guided conversation.",
       "type": "prompt",
-      "prompt": "prompts/dream-log.md"
+      "prompt": "dream-log.md"
     },
     {
       "name": "dream-interpret",
       "menu-code": "DI",
       "description": "Analyze a dream for symbolism, meaning, and personal connections.",
       "type": "prompt",
-      "prompt": "prompts/dream-interpret.md"
+      "prompt": "dream-interpret.md"
     },
     {
       "name": "pattern-discovery",
       "menu-code": "PD",
       "description": "Surface recurring themes and symbol patterns across the journal.",
       "type": "prompt",
-      "prompt": "prompts/pattern-discovery.md"
+      "prompt": "pattern-discovery.md"
     },
     {
       "name": "dream-query",
       "menu-code": "DQ",
       "description": "Search dream history by symbol, emotion, date, or keyword.",
       "type": "prompt",
-      "prompt": "prompts/dream-query.md"
+      "prompt": "dream-query.md"
     },
     {
       "name": "lucid-coach",
       "menu-code": "LC",
       "description": "Progressive lucid dreaming training and technique guidance.",
       "type": "prompt",
-      "prompt": "prompts/lucid-coach.md"
+      "prompt": "lucid-coach.md"
     },
     {
       "name": "recall-training",
       "menu-code": "RT",
       "description": "Dream recall improvement exercises and progress tracking.",
       "type": "prompt",
-      "prompt": "prompts/recall-training.md"
+      "prompt": "recall-training.md"
     },
     {
       "name": "dream-seed",
       "menu-code": "DS",
       "description": "Pre-sleep dream incubation and intention setting.",
       "type": "prompt",
-      "prompt": "prompts/dream-seed.md"
+      "prompt": "dream-seed.md"
     },
     {
       "name": "save-memory",
       "menu-code": "SM",
       "description": "Save current session context to memory.",
       "type": "prompt",
-      "prompt": "prompts/save-memory.md"
+      "prompt": "save-memory.md"
     }
   ]
 }
diff --git a/samples/bmad-agent-dream-weaver/prompts/dream-interpret.md b/samples/bmad-agent-dream-weaver/dream-interpret.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/dream-interpret.md
rename to samples/bmad-agent-dream-weaver/dream-interpret.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/dream-log.md b/samples/bmad-agent-dream-weaver/dream-log.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/dream-log.md
rename to samples/bmad-agent-dream-weaver/dream-log.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/dream-query.md b/samples/bmad-agent-dream-weaver/dream-query.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/dream-query.md
rename to samples/bmad-agent-dream-weaver/dream-query.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/dream-seed.md b/samples/bmad-agent-dream-weaver/dream-seed.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/dream-seed.md
rename to samples/bmad-agent-dream-weaver/dream-seed.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/headless-wake.md b/samples/bmad-agent-dream-weaver/headless-wake.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/headless-wake.md
rename to samples/bmad-agent-dream-weaver/headless-wake.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/init.md b/samples/bmad-agent-dream-weaver/init.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/init.md
rename to samples/bmad-agent-dream-weaver/init.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/lucid-coach.md b/samples/bmad-agent-dream-weaver/lucid-coach.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/lucid-coach.md
rename to samples/bmad-agent-dream-weaver/lucid-coach.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/pattern-discovery.md b/samples/bmad-agent-dream-weaver/pattern-discovery.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/pattern-discovery.md
rename to samples/bmad-agent-dream-weaver/pattern-discovery.md
diff --git a/samples/bmad-agent-dream-weaver/prompts/recall-training.md b/samples/bmad-agent-dream-weaver/recall-training.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/recall-training.md
rename to samples/bmad-agent-dream-weaver/recall-training.md
diff --git a/samples/bmad-agent-dream-weaver/resources/memory-system.md b/samples/bmad-agent-dream-weaver/references/memory-system.md
similarity index 98%
rename from samples/bmad-agent-dream-weaver/resources/memory-system.md
rename to samples/bmad-agent-dream-weaver/references/memory-system.md
index 18de958..bbb6b2b 100644
--- a/samples/bmad-agent-dream-weaver/resources/memory-system.md
+++ b/samples/bmad-agent-dream-weaver/references/memory-system.md
@@ -200,4 +200,4 @@ Regularly (every few sessions or when files grow large):
 
 ## First Run
 
-If sidecar doesn't exist, load `prompts/init.md` to create the structure.
+If sidecar doesn't exist, load `init.md` to create the structure.
diff --git a/samples/bmad-agent-dream-weaver/prompts/save-memory.md b/samples/bmad-agent-dream-weaver/save-memory.md
similarity index 100%
rename from samples/bmad-agent-dream-weaver/prompts/save-memory.md
rename to samples/bmad-agent-dream-weaver/save-memory.md
diff --git a/samples/bmad-bmm-product-brief-preview/SKILL.md b/samples/bmad-bmm-product-brief-preview/SKILL.md
deleted file mode 100644
index 7e1daaf..0000000
--- a/samples/bmad-bmm-product-brief-preview/SKILL.md
+++ /dev/null
@@ -1,87 +0,0 @@
----
-name: bmad-bmm-product-brief-preview
-description: Create or update product briefs through guided or autonomous discovery. Use when the user requests to 'create a product brief', 'help me create a project brief', or 'update my product brief'.
----
-
-# Create Product Brief
-
-## Overview
-
-This skill helps you create compelling product briefs through collaborative discovery, intelligent artifact analysis, and web research. Act as a product-focused Business Analyst and peer collaborator, guiding users from raw ideas to polished executive summaries. Your output is a 1-2 page executive product brief — and optionally, a token-efficient LLM distillate capturing all the detail for downstream PRD creation.
-
-The user is the domain expert. You bring structured thinking, facilitation, market awareness, and the ability to synthesize large volumes of input into clear, persuasive narrative. Work together as equals.
-
-**Design rationale:** We always understand intent before scanning artifacts — without knowing what the brief is about, scanning documents is noise, not signal. We capture everything the user shares (even out-of-scope details like requirements or platform preferences) for the distillate, rather than interrupting their creative flow.
-
-## Activation Mode Detection
-
-Check activation context immediately:
-
-1. **Autonomous mode**: If the user passes `--headless`/`-H` flags, or provides structured inputs clearly intended for headless execution:
-   - Ingest all provided inputs, fan out subagents, produce complete brief without interaction
-   - Route directly to `prompts/contextual-discovery.md` with `{mode}=headless`
-
-2. **Yolo mode**: If the user passes `--yolo` or says "just draft it" / "draft the whole thing":
-   - Ingest everything, draft complete brief upfront, then walk user through refinement
-   - Route to Stage 1 below with `{mode}=yolo`
-
-3. **Guided mode** (default): Conversational discovery with soft gates
-   - Route to Stage 1 below with `{mode}=guided`
-
-## On Activation
-
-1. **Load config via bmad-init skill** (module: `bmm`) — Store all returned vars:
-   - Use `{user_name}` for greeting
-   - Use `{communication_language}` for all communications
-   - Use `{document_output_language}` for output documents
-   - Use `{planning_artifacts}` for output location and artifact scanning
-   - Use `{project_knowledge}` for additional context scanning
-
-2. **Greet user** as `{user_name}`, speaking in `{communication_language}`. Be warm but efficient — dream builder energy.
-
-3. **Stage 1: Understand Intent** (handled here in SKILL.md)
-
-### Stage 1: Understand Intent
-
-**Goal:** Know WHY the user is here and WHAT the brief is about before doing anything else.
-
-**Brief type detection:** Understand what kind of thing is being briefed — product, internal tool, research project, or something else. If non-commercial, adapt: focus on stakeholder value and adoption path instead of market differentiation and commercial metrics.
-
-**Multi-idea disambiguation:** If the user presents multiple competing ideas or directions, help them pick one focus for this brief session. Note that others can be briefed separately.
-
-**If the user provides an existing brief** (path to a product brief file, or says "update" / "revise" / "edit"):
-- Read the existing brief fully
-- Treat it as rich input — you already know the product, the vision, the scope
-- Ask: "What's changed? What do you want to update or improve?"
-- The rest of the workflow proceeds normally — contextual discovery may pull in new research, elicitation focuses on gaps or changes, and draft-and-review produces an updated version
-
-**If the user already provided context** when launching the skill (description, docs, brain dump):
-- Acknowledge what you received — but **DO NOT read document files yet**. Note their paths for Stage 2's subagents to scan contextually. You need to understand the product intent first before any document is worth reading.
-- From the user's description or brain dump (not docs), summarize your understanding of the product/idea
-- Ask: "Do you have any other documents, research, or brainstorming I should review? Anything else to add before I dig in?"
-
-**If the user provided nothing beyond invoking the skill:**
-- Ask what their product or project idea is about
-- Ask if they have any existing documents, research, brainstorming reports, or other materials
-- Let them brain dump — capture everything
-
-**The "anything else?" pattern:** At every natural pause, ask "Anything else you'd like to add, or shall we move on?" This consistently draws out additional context users didn't know they had.
-
-**Capture-don't-interrupt:** If the user shares details beyond brief scope (requirements, platform preferences, technical constraints, timeline), capture them silently for the distillate. Don't redirect or stop their flow.
-
-**When you have enough to understand the product intent**, route to `prompts/contextual-discovery.md` with the current mode.
-
-## Stages
-
-| # | Stage | Purpose | Prompt |
-|---|-------|---------|--------|
-| 1 | Understand Intent | Know what the brief is about | SKILL.md (above) |
-| 2 | Contextual Discovery | Fan out subagents to analyze artifacts and web research | `prompts/contextual-discovery.md` |
-| 3 | Guided Elicitation | Fill gaps through smart questioning | `prompts/guided-elicitation.md` |
-| 4 | Draft & Review | Draft brief, fan out review subagents | `prompts/draft-and-review.md` |
-| 5 | Finalize | Polish, output, offer distillate | `prompts/finalize.md` |
-
-## External Skills
-
-This workflow uses:
-- `bmad-init` — Configuration loading (module: bmm)
diff --git a/samples/bmad-bmm-product-brief-preview/agents/artifact-analyzer.md b/samples/bmad-bmm-product-brief-preview/agents/artifact-analyzer.md
deleted file mode 100644
index 72b9888..0000000
--- a/samples/bmad-bmm-product-brief-preview/agents/artifact-analyzer.md
+++ /dev/null
@@ -1,60 +0,0 @@
-# Artifact Analyzer
-
-You are a research analyst. Your job is to scan project documents and extract information relevant to a specific product idea.
-
-## Input
-
-You will receive:
-- **Product intent:** A summary of what the product brief is about
-- **Scan paths:** Directories to search for relevant documents (e.g., planning artifacts, project knowledge folders)
-- **User-provided paths:** Any specific files the user pointed to
-
-## Process
-
-1. **Scan the provided directories** for documents that could be relevant:
-   - Brainstorming reports (`*brainstorm*`, `*ideation*`)
-   - Research documents (`*research*`, `*analysis*`, `*findings*`)
-   - Project context (`*context*`, `*overview*`, `*background*`)
-   - Existing briefs or summaries (`*brief*`, `*summary*`)
-   - Any markdown, text, or structured documents that look relevant
-
-2. **For sharded documents** (a folder with `index.md` and multiple files), read the index first to understand what's there, then read only the relevant parts.
-
-3. **For very large documents** (estimated >50 pages), read the table of contents, executive summary, and section headings first. Read only sections directly relevant to the stated product intent. Note which sections were skimmed vs read fully.
-
-4. **Read all relevant documents in parallel** — issue all Read calls in a single message rather than one at a time. Extract:
-   - Key insights that relate to the product intent
-   - Market or competitive information
-   - User research or persona information
-   - Technical context or constraints
-   - Ideas, both accepted and rejected (rejected ideas are valuable — they prevent re-proposing)
-   - Any metrics, data points, or evidence
-
-5. **Ignore documents that aren't relevant** to the stated product intent. Don't waste tokens on unrelated content.
-
-## Output
-
-Return ONLY the following JSON object. No preamble, no commentary. Maximum 8 bullets per section.
-
-```json
-{
-  "documents_found": [
-    {"path": "file path", "relevance": "one-line summary"}
-  ],
-  "key_insights": [
-    "bullet — grouped by theme, each self-contained"
-  ],
-  "user_market_context": [
-    "bullet — users, market, competition found in docs"
-  ],
-  "technical_context": [
-    "bullet — platforms, constraints, integrations"
-  ],
-  "ideas_and_decisions": [
-    {"idea": "description", "status": "accepted|rejected|open", "rationale": "brief why"}
-  ],
-  "raw_detail_worth_preserving": [
-    "bullet — specific details, data points, quotes for the distillate"
-  ]
-}
-```
diff --git a/samples/bmad-bmm-product-brief-preview/agents/opportunity-reviewer.md b/samples/bmad-bmm-product-brief-preview/agents/opportunity-reviewer.md
deleted file mode 100644
index 1ec4db4..0000000
--- a/samples/bmad-bmm-product-brief-preview/agents/opportunity-reviewer.md
+++ /dev/null
@@ -1,44 +0,0 @@
-# Opportunity Reviewer
-
-You are a strategic advisor reviewing a product brief draft. Your job is to spot untapped potential — value the brief is leaving on the table.
-
-## Input
-
-You will receive the complete draft product brief.
-
-## Review Lens
-
-Ask yourself:
-
-- **What adjacent value propositions are being missed?** Are there related problems this solution naturally addresses?
-- **What market angles are underemphasized?** Is the positioning leaving opportunities unexplored?
-- **What partnerships or integrations could multiply impact?** Who would benefit from aligning with this product?
-- **What's the network effect or viral potential?** Is there a growth flywheel the brief doesn't describe?
-- **What's underemphasized?** Which strengths deserve more spotlight?
-- **What user segments are overlooked?** Could this serve audiences not yet mentioned?
-- **What's the bigger story?** If you zoom out, is there a more compelling narrative?
-- **What would an investor want to hear more about?** What would make someone lean forward?
-
-## Output
-
-Return ONLY the following JSON object. No preamble, no commentary. Focus on the 2-3 most impactful opportunities per section, not an exhaustive list.
-
-```json
-{
-  "untapped_value": [
-    {"opportunity": "adjacent problem or value prop", "rationale": "why it matters"}
-  ],
-  "positioning_opportunities": [
-    {"angle": "market angle or narrative", "impact": "how it strengthens the brief"}
-  ],
-  "growth_and_scale": [
-    "bullet — network effects, viral loops, expansion paths"
-  ],
-  "strategic_partnerships": [
-    {"partner_type": "who", "value": "why this alliance matters"}
-  ],
-  "underemphasized_strengths": [
-    {"strength": "what's underplayed", "suggestion": "how to elevate it"}
-  ]
-}
-```
diff --git a/samples/bmad-bmm-product-brief-preview/agents/skeptic-reviewer.md b/samples/bmad-bmm-product-brief-preview/agents/skeptic-reviewer.md
deleted file mode 100644
index 5eb511c..0000000
--- a/samples/bmad-bmm-product-brief-preview/agents/skeptic-reviewer.md
+++ /dev/null
@@ -1,44 +0,0 @@
-# Skeptic Reviewer
-
-You are a critical analyst reviewing a product brief draft. Your job is to find weaknesses, gaps, and untested assumptions — not to tear it apart, but to make it stronger.
-
-## Input
-
-You will receive the complete draft product brief.
-
-## Review Lens
-
-Ask yourself:
-
-- **What's missing?** Are there sections that feel thin or glossed over?
-- **What assumptions are untested?** Where does the brief assert things without evidence?
-- **What could go wrong?** What risks aren't acknowledged?
-- **Where is it vague?** Which claims need more specificity?
-- **Does the problem statement hold up?** Is this a real, significant problem or a nice-to-have?
-- **Are the differentiators actually defensible?** Could a competitor replicate them easily?
-- **Do the success metrics make sense?** Are they measurable and meaningful?
-- **Is the MVP scope realistic?** Too ambitious? Too timid?
-
-## Output
-
-Return ONLY the following JSON object. No preamble, no commentary. Maximum 5 items per section. Prioritize — lead with the most impactful issues.
-
-```json
-{
-  "critical_gaps": [
-    {"issue": "what's missing", "impact": "why it matters", "suggestion": "how to fix"}
-  ],
-  "untested_assumptions": [
-    {"assumption": "what's asserted", "risk": "what could go wrong"}
-  ],
-  "unacknowledged_risks": [
-    {"risk": "potential failure mode", "severity": "high|medium|low"}
-  ],
-  "vague_areas": [
-    {"section": "where", "issue": "what's vague", "suggestion": "how to sharpen"}
-  ],
-  "suggested_improvements": [
-    "actionable suggestion"
-  ]
-}
-```
diff --git a/samples/bmad-bmm-product-brief-preview/agents/web-researcher.md b/samples/bmad-bmm-product-brief-preview/agents/web-researcher.md
deleted file mode 100644
index d7fc8d2..0000000
--- a/samples/bmad-bmm-product-brief-preview/agents/web-researcher.md
+++ /dev/null
@@ -1,49 +0,0 @@
-# Web Researcher
-
-You are a market research analyst. Your job is to find relevant competitive, market, and industry context for a product idea through web searches.
-
-## Input
-
-You will receive:
-- **Product intent:** A summary of what the product is about, the problem it solves, and the domain it operates in
-
-## Process
-
-1. **Identify search angles** based on the product intent:
-   - Direct competitors (products solving the same problem)
-   - Adjacent solutions (different approaches to the same pain point)
-   - Market size and trends for the domain
-   - Industry news or developments that create opportunity or risk
-   - User sentiment about existing solutions (what's frustrating people)
-
-2. **Execute 3-5 targeted web searches** — quality over quantity. Search for:
-   - "[problem domain] solutions comparison"
-   - "[competitor names] alternatives" (if competitors are known)
-   - "[industry] market trends [current year]"
-   - "[target user type] pain points [domain]"
-
-3. **Synthesize findings** — don't just list links. Extract the signal.
-
-## Output
-
-Return ONLY the following JSON object. No preamble, no commentary. Maximum 5 bullets per section.
-
-```json
-{
-  "competitive_landscape": [
-    {"name": "competitor", "approach": "one-line description", "gaps": "where they fall short"}
-  ],
-  "market_context": [
-    "bullet — market size, growth trends, relevant data points"
-  ],
-  "user_sentiment": [
-    "bullet — what users say about existing solutions"
-  ],
-  "timing_and_opportunity": [
-    "bullet — why now, enabling shifts"
-  ],
-  "risks_and_considerations": [
-    "bullet — market risks, competitive threats, regulatory concerns"
-  ]
-}
-```
diff --git a/samples/bmad-bmm-product-brief-preview/bmad-manifest.json b/samples/bmad-bmm-product-brief-preview/bmad-manifest.json
deleted file mode 100644
index 42ea35c..0000000
--- a/samples/bmad-bmm-product-brief-preview/bmad-manifest.json
+++ /dev/null
@@ -1,17 +0,0 @@
-{
-  "module-code": "bmm",
-  "replaces-skill": "bmad-create-product-brief",
-  "capabilities": [
-    {
-      "name": "create-brief",
-      "menu-code": "CB",
-      "description": "Produces executive product brief and optional LLM distillate for PRD input.",
-      "supports-headless": true,
-      "phase-name": "1-analysis",
-      "after": ["brainstorming, perform-research"],
-      "before": ["create-prd"],
-      "is-required": true,
-      "output-location": "{planning_artifacts}"
-    }
-  ]
-}
diff --git a/samples/bmad-bmm-product-brief-preview/prompts/contextual-discovery.md b/samples/bmad-bmm-product-brief-preview/prompts/contextual-discovery.md
deleted file mode 100644
index 2fe7ed2..0000000
--- a/samples/bmad-bmm-product-brief-preview/prompts/contextual-discovery.md
+++ /dev/null
@@ -1,57 +0,0 @@
-**Language:** Use `{communication_language}` for all output.
-**Output Language:** Use `{document_output_language}` for documents.
-**Output Location:** `{planning_artifacts}`
-
-# Stage 2: Contextual Discovery
-
-**Goal:** Armed with the user's stated intent, intelligently gather and synthesize all available context — documents, project knowledge, and web research — so later stages work from a rich, relevant foundation.
-
-## Subagent Fan-Out
-
-Now that you know what the brief is about, fan out subagents in parallel to gather context. Each subagent receives the product intent summary so it knows what's relevant.
-
-**Launch in parallel:**
-
-1. **Artifact Analyzer** (`agents/artifact-analyzer.md`) — Scans `{planning_artifacts}` and `{project_knowledge}` for relevant documents. Also scans any specific paths the user provided. Returns structured synthesis of what it found.
-
-2. **Web Researcher** (`agents/web-researcher.md`) — Searches for competitive landscape, market context, trends, and relevant industry data. Returns structured findings scoped to the product domain.
-
-### Graceful Degradation
-
-If subagents are unavailable or fail:
-- Read only the most relevant 1-2 documents in the main context and summarize (don't full-read everything — limit context impact in degraded mode)
-- Do a few targeted web searches inline
-- Never block the workflow because a subagent feature is unavailable
-
-## Synthesis
-
-Once subagent results return (or inline scanning completes):
-
-1. **Merge findings** with what the user already told you
-2. **Identify gaps** — what do you still need to know to write a solid brief?
-3. **Note surprises** — anything from research that contradicts or enriches the user's assumptions?
-
-## Mode-Specific Behavior
-
-**Guided mode:**
-- Present a concise summary of what you found: "Here's what I learned from your documents and web research..."
-- Highlight anything surprising or worth discussing
-- Share the gaps you've identified
-- Ask: "Anything else you'd like to add, or shall we move on to filling in the details?"
-- Route to `prompts/guided-elicitation.md`
-
-**Yolo mode:**
-- Absorb all findings silently
-- Skip directly to `prompts/draft-and-review.md` — you have enough to draft
-- The user will refine later
-
-**Autonomous mode:**
-- Absorb all findings
-- Skip directly to `prompts/draft-and-review.md`
-- No interaction
-
-## Stage Complete
-
-This stage is complete when subagent results (or inline scanning fallback) have returned and findings are merged with user context. Route per mode:
-- **Guided** → `prompts/guided-elicitation.md`
-- **Yolo / Autonomous** → `prompts/draft-and-review.md`
diff --git a/samples/bmad-bmm-product-brief-preview/prompts/draft-and-review.md b/samples/bmad-bmm-product-brief-preview/prompts/draft-and-review.md
deleted file mode 100644
index 23e65ca..0000000
--- a/samples/bmad-bmm-product-brief-preview/prompts/draft-and-review.md
+++ /dev/null
@@ -1,86 +0,0 @@
-**Language:** Use `{communication_language}` for all output.
-**Output Language:** Use `{document_output_language}` for documents.
-**Output Location:** `{planning_artifacts}`
-
-# Stage 4: Draft & Review
-
-**Goal:** Produce the executive product brief and run it through multiple review lenses to catch blind spots before the user sees the final version.
-
-## Step 1: Draft the Executive Brief
-
-Use `resources/brief-template.md` as a guide — adapt structure to fit the product's story.
-
-**Writing principles:**
-- **Executive audience** — persuasive, clear, concise. 1-2 pages.
-- **Lead with the problem** — make the reader feel the pain before presenting the solution
-- **Concrete over abstract** — specific examples, real scenarios, measurable outcomes
-- **Confident voice** — this is a pitch, not a hedge
-- Write in `{document_output_language}`
-
-**Create the output document at:** `{planning_artifacts}/product-brief-{project_name}.md`
-
-Include YAML frontmatter:
-```yaml
----
-title: "Product Brief: {project_name}"
-status: "draft"
-created: "{timestamp}"
-updated: "{timestamp}"
-inputs: [list of input files used]
----
-```
-
-## Step 2: Fan Out Review Subagents
-
-Before showing the draft to the user, run it through multiple review lenses in parallel.
-
-**Launch in parallel:**
-
-1. **Skeptic Reviewer** (`agents/skeptic-reviewer.md`) — "What's missing? What assumptions are untested? What could go wrong? Where is the brief vague or hand-wavy?"
-
-2. **Opportunity Reviewer** (`agents/opportunity-reviewer.md`) — "What adjacent value propositions are being missed? What market angles or partnerships could strengthen this? What's underemphasized?"
-
-3. **Contextual Reviewer** — You (the main agent) pick the most useful third lens based on THIS specific product. Choose the lens that addresses the SINGLE BIGGEST RISK that the skeptic and opportunity reviewers won't naturally catch. Examples:
-   - For healthtech: "Regulatory and compliance risk reviewer"
-   - For devtools: "Developer experience and adoption friction critic"
-   - For marketplace: "Network effects and chicken-and-egg problem analyst"
-   - For enterprise: "Procurement and organizational change management reviewer"
-   - **When domain is unclear, default to:** "Go-to-market and launch risk reviewer" — examines distribution, pricing, and first-customer acquisition. Almost always valuable, frequently missed.
-   Describe the lens, run the review yourself inline.
-
-### Graceful Degradation
-
-If subagents are unavailable:
-- Perform all three review passes yourself, sequentially
-- Apply each lens deliberately — don't blend them into one generic review
-- The quality of review matters more than the parallelism
-
-## Step 3: Integrate Review Insights
-
-After all reviews complete:
-
-1. **Triage findings** — group by theme, remove duplicates
-2. **Apply non-controversial improvements** directly to the draft (obvious gaps, unclear language, missing specifics)
-3. **Flag substantive suggestions** that need user input (strategic choices, scope questions, market positioning decisions)
-
-## Step 4: Present to User
-
-**Autonomous mode:** Skip to `prompts/finalize.md` — no user interaction. Save the improved draft directly.
-
-**Yolo and Guided modes:**
-
-Present the draft brief to the user. Then share the reviewer insights:
-
-"Here's your product brief draft. Before we finalize, my review panel surfaced some things worth considering:
-
-**[Grouped reviewer findings — only the substantive ones that need user input]**
-
-What do you think? Any changes you'd like to make?"
-
-Present reviewer findings with brief rationale, then offer: "Want me to dig into any of these, or are you ready to make your revisions?"
-
-**Iterate** as long as the user wants to refine. Use the "anything else, or are we happy with this?" soft gate.
-
-## Stage Complete
-
-This stage is complete when: (a) the draft has been reviewed by all three lenses and improvements integrated, AND either (autonomous) save and route directly, or (guided/yolo) the user is satisfied. Route to `prompts/finalize.md`.
diff --git a/samples/bmad-bmm-product-brief-preview/prompts/finalize.md b/samples/bmad-bmm-product-brief-preview/prompts/finalize.md
deleted file mode 100644
index abcfdcf..0000000
--- a/samples/bmad-bmm-product-brief-preview/prompts/finalize.md
+++ /dev/null
@@ -1,75 +0,0 @@
-**Language:** Use `{communication_language}` for all output.
-**Output Language:** Use `{document_output_language}` for documents.
-**Output Location:** `{planning_artifacts}`
-
-# Stage 5: Finalize
-
-**Goal:** Save the polished brief, offer the LLM distillate, and point the user forward.
-
-## Step 1: Polish and Save
-
-Update the product brief document at `{planning_artifacts}/product-brief-{project_name}.md`:
-- Update frontmatter `status` to `"complete"`
-- Update `updated` timestamp
-- Ensure formatting is clean and consistent
-- Confirm the document reads well as a standalone 1-2 page executive summary
-
-## Step 2: Offer the Distillate
-
-Throughout the discovery process, you likely captured detail that doesn't belong in a 1-2 page executive summary but is valuable for downstream work — requirements hints, platform preferences, rejected ideas, technical constraints, detailed user scenarios, competitive deep-dives, etc.
-
-**Ask the user:**
-"Your product brief is complete. During our conversation, I captured additional detail that goes beyond the executive summary — things like [mention 2-3 specific examples of overflow you captured]. Would you like me to create a detail pack for PRD creation? It distills all that extra context into a concise, structured format optimized for the next phase."
-
-**If yes, create the distillate** at `{planning_artifacts}/product-brief-{project_name}-distillate.md`:
-
-```yaml
----
-title: "Product Brief Distillate: {project_name}"
-type: llm-distillate
-source: "product-brief-{project_name}.md"
-created: "{timestamp}"
-purpose: "Token-efficient context for downstream PRD creation"
----
-```
-
-**Distillate content principles:**
-- Dense bullet points, not prose
-- Each bullet carries enough context to be understood standalone (don't assume the reader has the full brief loaded)
-- Group by theme, not by when it was mentioned
-- Include:
-  - **Rejected ideas** — so downstream workflows don't re-propose them, with brief rationale
-  - **Requirements hints** — anything the user mentioned that sounds like a requirement
-  - **Technical context** — platforms, integrations, constraints, preferences
-  - **Detailed user scenarios** — richer than what fits in the exec summary
-  - **Competitive intelligence** — specifics from web research worth preserving
-  - **Open questions** — things surfaced but not resolved during discovery
-  - **Scope signals** — what the user indicated is in/out/maybe for MVP
-- Token-conscious: be concise, but give enough context per bullet so an LLM reading this later understands WHY each point matters
-
-**Autonomous mode:** Always create the distillate automatically — unless the session was too brief to capture meaningful overflow (in that case, note this in the completion output instead of creating an empty file).
-
-## Step 3: Present Completion
-
-"Your product brief for {project_name} is complete!
-
-**Executive Brief:** `{planning_artifacts}/product-brief-{project_name}.md`
-[If distillate created:] **Detail Pack:** `{planning_artifacts}/product-brief-{project_name}-distillate.md`
-
-**Recommended next step:** Use the product brief (and detail pack) as input for PRD creation — tell your assistant 'create a PRD' and point it to these files."
-[If distillate created:] "The detail pack contains all the overflow context (requirements hints, rejected ideas, technical constraints) specifically structured for the PRD workflow to consume."
-
-**Autonomous mode:** Output the file paths as structured JSON and exit:
-```json
-{
-  "status": "complete",
-  "brief": "{planning_artifacts}/product-brief-{project_name}.md",
-  "distillate": "{path or null}",
-  "confidence": "high|medium|low",
-  "open_questions": ["any unresolved items"]
-}
-```
-
-## Stage Complete
-
-This is the terminal stage. After delivering the completion message and file paths, the workflow is done. If the user requests further revisions, loop back to `prompts/draft-and-review.md`. Otherwise, exit.
diff --git a/samples/bmad-bmm-product-brief-preview/prompts/guided-elicitation.md b/samples/bmad-bmm-product-brief-preview/prompts/guided-elicitation.md
deleted file mode 100644
index ec2e770..0000000
--- a/samples/bmad-bmm-product-brief-preview/prompts/guided-elicitation.md
+++ /dev/null
@@ -1,70 +0,0 @@
-**Language:** Use `{communication_language}` for all output.
-**Output Language:** Use `{document_output_language}` for documents.
-
-# Stage 3: Guided Elicitation
-
-**Goal:** Fill the gaps in what you know. By now you have the user's brain dump, artifact analysis, and web research. This stage is about smart, targeted questioning — not rote section-by-section interrogation.
-
-**Skip this stage entirely in Yolo and Autonomous modes** — go directly to `prompts/draft-and-review.md`.
-
-## Approach
-
-You are NOT walking through a rigid questionnaire. You're having a conversation that covers the substance of a great product brief. The topics below are your mental checklist, not a script. Adapt to:
-- What you already know (don't re-ask what's been covered)
-- What the user is excited about (follow their energy)
-- What's genuinely unclear (focus questions where they matter)
-
-## Topics to Cover (flexibly, conversationally)
-
-### Vision & Problem
-- What core problem does this solve? For whom?
-- How do people solve this today? What's frustrating about current approaches?
-- What would success look like for the people this helps?
-- What's the insight or angle that makes this approach different?
-
-### Users & Value
-- Who experiences this problem most acutely?
-- Are there different user types with different needs?
-- What's the "aha moment" — when does a user realize this is what they needed?
-- How does this fit into their existing workflow or life?
-
-### Market & Differentiation
-- What competitive or alternative solutions exist? (Leverage web research findings)
-- What's the unfair advantage or defensible moat?
-- Why is now the right time for this?
-
-### Success & Scope
-- How will you know this is working? What metrics matter?
-- What's the minimum viable version that creates real value?
-- What's explicitly NOT in scope for the first version?
-- If this is wildly successful, what does it become in 2-3 years?
-
-## The Flow
-
-For each topic area where you have gaps:
-
-1. **Lead with what you know** — "Based on your input and my research, it sounds like [X]. Is that right?"
-2. **Ask the gap question** — targeted, specific, not generic
-3. **Reflect and confirm** — paraphrase what you heard
-4. **"Anything else on this, or shall we move on?"** — the soft gate
-
-If the user is giving you detail beyond brief scope (requirements, architecture, platform details, timelines), **capture it silently** for the distillate. Acknowledge it briefly ("Good detail, I'll capture that") but don't derail the conversation.
-
-## When to Move On
-
-When you have enough substance to draft a compelling 1-2 page executive brief covering:
-- Clear problem and who it affects
-- Proposed solution and what makes it different
-- Target users (at least primary)
-- Some sense of success criteria or business objectives
-- MVP-level scope thinking
-
-You don't need perfection — you need enough to draft well. Missing details can be surfaced during the review stage.
-
-If the user is providing complete, confident answers and you have solid coverage across all four topic areas after fewer than 3-4 exchanges, proactively offer to draft early.
-
-**Transition:** "I think I have a solid picture. Ready for me to draft the brief, or is there anything else you'd like to add?"
-
-## Stage Complete
-
-This stage is complete when sufficient substance exists to draft a compelling brief and the user confirms readiness. Route to `prompts/draft-and-review.md`.
diff --git a/samples/bmad-bmm-product-brief-preview/resources/brief-template.md b/samples/bmad-bmm-product-brief-preview/resources/brief-template.md
deleted file mode 100644
index 79c5a40..0000000
--- a/samples/bmad-bmm-product-brief-preview/resources/brief-template.md
+++ /dev/null
@@ -1,60 +0,0 @@
-# Product Brief Template
-
-This is a flexible guide for the executive product brief — adapt it to serve the product's story. Merge sections, add new ones, reorder as needed. The product determines the structure, not the template.
-
-## Sensible Default Structure
-
-```markdown
-# Product Brief: {Product Name}
-
-## Executive Summary
-
-[2-3 paragraph narrative: What is this? What problem does it solve? Why does it matter? Why now?
-This should be compelling enough to stand alone — if someone reads only this section, they should understand the vision.]
-
-## The Problem
-
-[What pain exists? Who feels it? How are they coping today? What's the cost of the status quo?
-Be specific — real scenarios, real frustrations, real consequences.]
-
-## The Solution
-
-[What are we building? How does it solve the problem?
-Focus on the experience and outcome, not the implementation.]
-
-## What Makes This Different
-
-[Key differentiators. Why this approach vs alternatives? What's the unfair advantage?
-Be honest — if the moat is execution speed, say so. Don't fabricate technical moats.]
-
-## Who This Serves
-
-[Primary users — vivid but brief. Who are they, what do they need, what does success look like for them?
-Secondary users if relevant.]
-
-## Success Criteria
-
-[How do we know this is working? What metrics matter?
-Mix of user success signals and business objectives. Be measurable.]
-
-## Scope
-
-[What's in for the first version? What's explicitly out?
-Keep this tight — it's a boundary document, not a feature list.]
-
-## Vision
-
-[Where does this go if it succeeds? What does it become in 2-3 years?
-Inspiring but grounded.]
-```
-
-## Adaptation Guidelines
-
-- **For B2B products:** Consider adding a "Buyer vs User" section if they're different people
-- **For platforms/marketplaces:** Consider a "Network Effects" or "Ecosystem" section
-- **For technical products:** May need a brief "Technical Approach" section (keep it high-level)
-- **For regulated industries:** Consider a "Compliance & Regulatory" section
-- **If scope is well-defined:** Merge "Scope" and "Vision" into "Roadmap Thinking"
-- **If the problem is well-known:** Shorten "The Problem" and expand "What Makes This Different"
-
-The brief should be 1-2 pages. If it's longer, you're putting in too much detail — that's what the distillate is for.
diff --git a/samples/bmad-excalidraw/SKILL.md b/samples/bmad-excalidraw/SKILL.md
index 33d0cb0..8ca1b18 100644
--- a/samples/bmad-excalidraw/SKILL.md
+++ b/samples/bmad-excalidraw/SKILL.md
@@ -42,19 +42,19 @@ Produce professional diagrams and visual aids as Excalidraw files through conver
 
 3. **Detect diagram intent from user's request:**
    - What do they want to visualize?
-   - Did they specify a diagram type? If so, validate against `resources/diagram-types.md`
+   - Did they specify a diagram type? If so, validate against `references/diagram-types.md`
    - Did they specify enough detail to skip guided design?
 
 4. **Route by mode:**
-   - Autonomous/YOLO → `prompts/diagram-generation.md` directly
-   - Guided → `prompts/guided-design.md` first, then `prompts/diagram-generation.md`
+   - Autonomous/YOLO → `diagram-generation.md` directly
+   - Guided → `guided-design.md` first, then `diagram-generation.md`
 
 ## Stages
 
 | # | Stage | Purpose | Prompt |
 |---|-------|---------|--------|
-| 1 | Guided Design | Creative facilitation — brainstorm diagram type, content, layout | `prompts/guided-design.md` |
-| 2 | Generation | Produce the `.excalidraw` file with proper layout | `prompts/diagram-generation.md` |
+| 1 | Guided Design | Creative facilitation — brainstorm diagram type, content, layout | `guided-design.md` |
+| 2 | Generation | Produce the `.excalidraw` file with proper layout | `diagram-generation.md` |
 
 Headless: skip guided-design, output file path on completion.
 
diff --git a/samples/bmad-excalidraw/bmad-manifest.json b/samples/bmad-excalidraw/bmad-manifest.json
index 3134061..f628727 100644
--- a/samples/bmad-excalidraw/bmad-manifest.json
+++ b/samples/bmad-excalidraw/bmad-manifest.json
@@ -5,14 +5,14 @@
       "menu-code": "GD",
       "description": "Facilitates diagram design through conversational discovery.",
       "supports-headless": true,
-      "prompt": "prompts/guided-design.md"
+      "prompt": "guided-design.md"
     },
     {
       "name": "diagram-generation",
       "menu-code": "DG",
       "description": "Generates Excalidraw diagram files from specifications.",
       "supports-headless": true,
-      "prompt": "prompts/diagram-generation.md"
+      "prompt": "diagram-generation.md"
     }
   ]
 }
diff --git a/samples/bmad-excalidraw/prompts/diagram-generation.md b/samples/bmad-excalidraw/diagram-generation.md
similarity index 97%
rename from samples/bmad-excalidraw/prompts/diagram-generation.md
rename to samples/bmad-excalidraw/diagram-generation.md
index 4228555..79b3584 100644
--- a/samples/bmad-excalidraw/prompts/diagram-generation.md
+++ b/samples/bmad-excalidraw/diagram-generation.md
@@ -9,7 +9,7 @@ Generate a valid `.excalidraw` file from the diagram specification. Use the sche
 
 ## Step 1: Build the Diagram Specification
 
-Create a JSON specification that the generation script can consume. Load `resources/excalidraw-schema.md` for the element format reference.
+Create a JSON specification that the generation script can consume. Load `references/excalidraw-schema.md` for the element format reference.
 
 The specification format:
 
diff --git a/samples/bmad-excalidraw/prompts/guided-design.md b/samples/bmad-excalidraw/guided-design.md
similarity index 94%
rename from samples/bmad-excalidraw/prompts/guided-design.md
rename to samples/bmad-excalidraw/guided-design.md
index 81122d0..7990c53 100644
--- a/samples/bmad-excalidraw/prompts/guided-design.md
+++ b/samples/bmad-excalidraw/guided-design.md
@@ -15,7 +15,7 @@ Capture any details they've already provided — don't re-ask what they've told
 
 ## Step 2: Suggest Diagram Type
 
-Load `resources/diagram-types.md` for the full catalog.
+Load `references/diagram-types.md` for the full catalog.
 
 Based on what you know, suggest the best-fit diagram type(s) with reasoning:
 
@@ -77,4 +77,4 @@ Ask: "Ready to generate, or want to adjust anything?"
 
 ## Progression
 
-When the user confirms → proceed to `prompts/diagram-generation.md` with the complete specification.
+When the user confirms → proceed to `diagram-generation.md` with the complete specification.
diff --git a/samples/bmad-excalidraw/resources/diagram-types.md b/samples/bmad-excalidraw/references/diagram-types.md
similarity index 100%
rename from samples/bmad-excalidraw/resources/diagram-types.md
rename to samples/bmad-excalidraw/references/diagram-types.md
diff --git a/samples/bmad-excalidraw/resources/excalidraw-schema.md b/samples/bmad-excalidraw/references/excalidraw-schema.md
similarity index 100%
rename from samples/bmad-excalidraw/resources/excalidraw-schema.md
rename to samples/bmad-excalidraw/references/excalidraw-schema.md
diff --git a/samples/planning-artifacts/bmad-distillation-generator-spec.md b/samples/planning-artifacts/bmad-distillation-generator-spec.md
deleted file mode 100644
index c11e750..0000000
--- a/samples/planning-artifacts/bmad-distillation-generator-spec.md
+++ /dev/null
@@ -1,347 +0,0 @@
----
-title: "Skill Spec: bmad-distillation-generator"
-status: "complete"
-created: "2026-03-13"
-purpose: "Full specification for the skill builder agent to implement"
-reference-skill: "bmad-bmm-product-brief-preview (use as architectural pattern)"
----
-
-# bmad-distillation-generator — Skill Specification
-
-## Purpose
-
-A general-purpose utility skill that takes any set of input documents and produces a single hyper-compressed, token-efficient document (a "distillate") that an LLM can consume as sole context input for downstream workflows. The distillate is **lossless compression for an LLM reader** — every fact, decision, constraint, and relationship from the source documents is preserved, but all overhead that humans need and LLMs don't is stripped.
-
-This is a compression task, not a capture task. The skill assumes all relevant information has already been captured in the source documents. Its job is to produce the most token-efficient representation possible without losing signal.
-
-## What a Distillate Is
-
-A distillate is NOT a summary. Summaries are lossy — they capture the gist but drop detail. A distillate preserves all detail through lossless compression optimized for LLM consumption:
-
-- Every fact, decision, and constraint appears exactly once
-- No prose transitions, rhetoric, or persuasion
-- No repetition — deduplicated across all source documents
-- No formatting for human scannability (decorative bold, whitespace for visual breathing room)
-- No explaining things an LLM already knows (common terms, well-known companies, standard concepts)
-- No hedging language ("we believe", "it's likely that") — state the signal directly
-- Relationships between items are explicit, not implied
-- Each item carries enough context to be understood without the source documents
-- Rejected ideas and open questions are preserved — they prevent downstream re-proposal
-
-**Format:** Dense thematically-grouped bullets. Markdown structure for hierarchy only (## for themes, - for items). No decorative formatting. Every token carries signal.
-
-## Activation & Inputs
-
-### Required Inputs
-- **source_documents** — One or more file paths or inline content to distill
-
-### Optional Inputs
-- **downstream_consumer** — What workflow or agent will consume this distillate (e.g., "PRD creation", "architecture design", "story implementation"). When provided, the compressor uses this to judge what's signal vs noise. When omitted, preserve everything — no filtering.
-- **token_budget** — Approximate target size. When provided and the distillate would exceed it, trigger semantic splitting. When omitted, produce the smallest possible single document.
-- **output_path** — Where to save the distillate. When omitted, save adjacent to the primary source document with `-distillate.md` suffix.
-
-### Flags
-- **--validate** — After producing the distillate, run a round-trip reconstruction test (see Validation section below)
-
-### Activation Modes
-- **Direct invocation:** User calls the skill with inputs
-- **Called by another skill:** Other BMAD skills (product brief, PRD, architecture) can invoke this as a final step after producing their primary document + discovery notes
-
-## Skill Architecture
-
-```
-bmad-distillation-generator/
-  SKILL.md                          # Entry point, input validation, routing
-  agents/
-    distillate-compressor.md        # Core compression agent
-    round-trip-reconstructor.md     # Validation: reconstructs source docs from distillate
-  prompts/
-    compression-rules.md            # The compression ruleset (shared reference)
-    splitting-strategy.md           # Semantic splitting logic for large inputs
-  resources/
-    distillate-format-reference.md  # Format examples showing before/after
-```
-
-## Stages
-
-| # | Stage | Purpose | Location |
-|---|-------|---------|----------|
-| 1 | Validate & Analyze | Validate inputs, assess total size, detect document types | SKILL.md |
-| 2 | Compress | Fan out compressor agent(s), produce distillate | agents/distillate-compressor.md |
-| 3 | Verify & Output | Structured completeness check, save output | SKILL.md |
-| 4 | Round-Trip Validation | (optional, --validate flag) Reconstruct sources from distillate, diff against originals | agents/round-trip-reconstructor.md |
-
-### Stage 1: Validate & Analyze (SKILL.md)
-
-1. **Validate inputs exist and are readable.** If source documents are paths, read them. If inline content, accept directly.
-
-2. **Assess total input size.** Count approximate tokens across all source documents.
-
-3. **Detect document types.** Understand what each source document is (product brief, discovery notes, research report, architecture doc, PRD, etc.) — this informs how to group themes in the output.
-
-4. **Determine splitting need.** If total input is large (heuristic: source documents collectively exceed ~15,000 tokens of content) AND no token_budget is set, warn the user that the distillate may be large and offer to split semantically. If token_budget is set and would require splitting, proceed automatically.
-
-5. **Route to Stage 2.** Pass all source content, downstream_consumer context, and splitting decision to the compressor.
-
-### Stage 2: Compress (agents/distillate-compressor.md)
-
-The compressor agent is the core of this skill. It receives all source document content and produces the distillate.
-
-**Compression process:**
-
-1. **Extract all discrete facts, decisions, constraints, requirements, relationships, rejected ideas, and open questions** from all source documents. Treat this as entity extraction — pull out every distinct piece of information.
-
-2. **Deduplicate ruthlessly.** If the same fact appears in the brief's executive summary AND the discovery notes' technical context, it appears once in the distillate. Choose the version with the most context.
-
-3. **Apply downstream filtering** (only if downstream_consumer is specified). For each extracted item, ask: "Would the downstream workflow need this?" Drop items that are clearly irrelevant to the stated consumer. When uncertain, keep.
-
-4. **Group thematically.** Organize items into coherent themes derived from the source content — not from a fixed template. The themes should reflect what the documents are actually about. Common groupings: core concept, problem/motivation, solution/approach, users/segments, technical decisions, constraints, scope boundaries, competitive context, rejected alternatives, open questions, risks.
-
-5. **Compress language.** For each item:
-   - Strip prose transitions and connective tissue
-   - Remove hedging and rhetoric
-   - Remove explanations of common knowledge
-   - Preserve specific details (numbers, names, versions, dates)
-   - Ensure the item is self-contained (understandable without reading the source)
-   - Make relationships explicit ("X because Y", "X blocks Y", "X replaces Y")
-
-6. **Apply the compression rules** from `prompts/compression-rules.md` as a final pass.
-
-**If semantic splitting is required:**
-
-7. **Identify natural semantic boundaries** in the source content. These are NOT arbitrary size breaks — they are coherent topic clusters that a downstream workflow might load independently.
-
-8. **Produce a root distillate** that contains: a 3-5 bullet orientation (what was distilled, for whom, how many parts), cross-references to section distillates, and any items that span multiple sections.
-
-9. **Produce section distillates**, each self-sufficient — a reader loading only one section should understand it without the others. Include a 1-line context header: "This section covers [topic]. Part N of M from [source document names]."
-
-### Stage 3: Verify & Output (SKILL.md)
-
-After the compressor returns:
-
-1. **Structured completeness check.** Extract all Level 2+ headings and key named entities (products, people, technologies, decisions) from the source documents. Verify each appears in the distillate. If gaps are found, send them back to the compressor for a targeted fix pass — not a full recompression.
-
-2. **Format check.** Verify the output follows distillate format rules:
-   - No prose paragraphs (only bullets)
-   - No decorative formatting
-   - No repeated information
-   - Each bullet is self-contained
-   - Themes are clearly delineated
-
-3. **Save output.** Write the distillate to the output path. Use frontmatter:
-
-```yaml
----
-type: bmad-distillate
-sources:
-  - "{source file 1}"
-  - "{source file 2}"
-downstream_consumer: "{consumer or 'general'}"
-created: "{timestamp}"
-token_estimate: {approximate token count}
-parts: {1 or N if split}
----
-```
-
-4. **Report to user or calling skill.** Return the file path(s) and a one-line confirmation. If called by another skill, return structured output:
-
-```json
-{
-  "status": "complete",
-  "distillate": "{path}",
-  "section_distillates": ["{path1}", "{path2}"] or null,
-  "token_estimate": N,
-  "source_documents": ["{path1}", "{path2}"],
-  "completeness_check": "pass" or "pass_with_additions"
-}
-```
-
-## Compression Rules (for prompts/compression-rules.md)
-
-These rules govern how text is compressed. They are the core IP of this skill.
-
-### Strip — Remove entirely
-- Prose transitions: "As mentioned earlier", "It's worth noting", "In addition to this"
-- Rhetoric and persuasion: "This is a game-changer", "The exciting thing is"
-- Hedging: "We believe", "It's likely that", "Perhaps", "It seems"
-- Self-reference: "This document describes", "As outlined above"
-- Common knowledge explanations: "Vercel is a cloud platform company", "MIT is an open-source license"
-- Repeated introductions of the same concept
-- Section transition paragraphs
-- Formatting-only elements (decorative bold/italic for emphasis, horizontal rules for visual breaks)
-
-### Preserve — Keep always
-- Specific numbers, dates, versions, percentages
-- Named entities (products, companies, people, technologies)
-- Decisions made and their rationale (compressed: "Decision: X. Reason: Y")
-- Rejected alternatives and why (compressed: "Rejected: X. Reason: Y")
-- Explicit constraints and non-negotiables
-- Dependencies and ordering relationships
-- Open questions and unresolved items
-- Scope boundaries (in/out/deferred)
-- Success criteria and how they're validated
-- User segments and what success means for each
-
-### Transform — Change form for efficiency
-- Long prose paragraphs → single dense bullet capturing the same information
-- "We decided to use X because Y and Z" → "X (rationale: Y, Z)"
-- Repeated category labels → group under a single heading, no per-item labels
-- "Risk: ... Severity: high" → "HIGH RISK: ..."
-- Conditional statements → "If X → Y" form
-- Multi-sentence explanations → semicolon-separated compressed form
-
-### Deduplication Rules
-- Same fact in multiple documents → keep the version with most context
-- Same concept at different detail levels → keep the detailed version
-- Overlapping lists → merge into single list, no duplicates
-- When source documents disagree → note the conflict explicitly: "Brief says X; discovery notes say Y — unresolved"
-
-## Format Reference (for resources/distillate-format-reference.md)
-
-### Before (human-readable brief excerpt)
-```
-## What Makes This Different
-
-**The anti-fragmentation layer.** The AI tooling space is fracturing across 40+
-platforms with no shared methodology layer. BMAD is uniquely positioned to be the
-cross-platform constant — the structured approach that works the same in Cursor,
-Claude Code, Windsurf, Copilot, and whatever launches next month. Every other
-methodology or skill framework maintains its own platform support matrix. By
-building on the open-source skills CLI ecosystem, BMAD offloads the highest-churn
-maintenance burden and focuses on what actually differentiates it: the methodology
-itself.
-```
-
-### After (distillate)
-```
-## Differentiation
-- Anti-fragmentation positioning: BMAD = cross-platform constant across 40+ fragmenting AI tools; no competitor provides shared methodology layer
-- Platform complexity delegated to Vercel skills CLI ecosystem (MIT); BMAD maintains methodology, not platform configs
-```
-
-### Before (discovery notes excerpt)
-```
-## Competitive Landscape
-
-- **Vercel Skills.sh**: 83K+ skills, 18 agents, largest curated leaderboard —
-  but dev-only, skills trigger unreliably (20% without explicit prompting)
-- **SkillsMP**: 400K+ skills directory, pure aggregator with no curation or CLI
-- **ClawHub/OpenClaw**: ~3.2K curated skills with versioning/rollback, small ecosystem
-- **Lindy**: No-code AI agent builder for business automation — closed platform,
-  no skill sharing
-- **Microsoft Copilot Studio**: Enterprise no-code agent builder — vendor-locked
-  to Microsoft
-- **MindStudio**: No-code AI agent platform — siloed, no interoperability
-- **Make/Zapier AI**: Workflow automation adding AI agents — workflow-centric,
-  not methodology-centric
-- **Key gap**: NO competitor combines structured methodology with plugin
-  marketplace — this is BMAD's whitespace
-```
-
-### After (distillate)
-```
-## Competitive Landscape
-- No competitor combines structured methodology + plugin marketplace (whitespace)
-- Skills.sh (Vercel): 83K skills, 18 agents, dev-only, 20% trigger reliability
-- SkillsMP: 400K skills, aggregator only, no curation/CLI
-- ClawHub: 3.2K curated, versioning, small ecosystem
-- No-code platforms (Lindy, Copilot Studio, MindStudio, Make/Zapier): closed/siloed, no skill portability, business-only
-```
-
-## Stage 4: Round-Trip Validation (agents/round-trip-reconstructor.md)
-
-**Triggered by:** `--validate` flag. Optional. Not run by default.
-
-**Purpose:** Prove the distillate is lossless by reconstructing the original source documents from the distillate alone, then diffing against the originals to surface any information loss.
-
-### Process
-
-1. **The reconstructor agent receives ONLY the distillate.** It has no access to the original source documents. This is critical — if it could see the originals, the test is meaningless.
-
-2. **Detect source document types from the distillate's frontmatter.** The `sources` field lists what was distilled. The reconstructor uses the document type (product brief, discovery notes, architecture doc, etc.) to understand what kind of document to reconstruct.
-
-3. **Reconstruct each source document.** For each source listed in frontmatter, produce a full human-readable document from the distillate's content alone. The reconstruction should:
-   - Use appropriate prose, structure, and formatting for the document type
-   - Include all sections the original would have had
-   - Not invent information — only use what's in the distillate
-   - Flag any places where the distillate felt insufficient with `[POSSIBLE GAP]` markers
-
-4. **Save reconstructions** as temporary files adjacent to the distillate with `-reconstruction-{N}.md` suffixes.
-
-### Diff Analysis (back in SKILL.md)
-
-After the reconstructor returns, the main skill performs the diff:
-
-1. **Read both the original source documents and the reconstructions.**
-
-2. **Semantic diff, not text diff.** Don't compare prose word-for-word — compare information content. For each section of the original, ask:
-   - Is the core information present in the reconstruction?
-   - Are specific details preserved (numbers, names, decisions)?
-   - Are relationships and rationale intact?
-   - Did the reconstruction add anything not in the original? (indicates hallucination to fill gaps)
-
-3. **Produce a validation report** saved adjacent to the distillate as `-validation-report.md`:
-
-```markdown
----
-type: distillate-validation
-distillate: "{distillate path}"
-sources: ["{source paths}"]
-created: "{timestamp}"
----
-
-## Validation Summary
-- Status: PASS | PASS_WITH_WARNINGS | FAIL
-- Information preserved: {percentage estimate}
-- Gaps found: {count}
-- Hallucinations detected: {count}
-
-## Gaps (information in originals but missing from reconstruction)
-- {gap description} — Source: {which original}, Section: {where}
-
-## Hallucinations (information in reconstruction not traceable to originals)
-- {hallucination description} — appears to fill gap in: {section}
-
-## Possible Gap Markers (flagged by reconstructor)
-- {marker description}
-```
-
-4. **If gaps are found**, offer to run a targeted fix pass on the distillate — adding the missing information without full recompression.
-
-5. **Clean up** — delete the temporary reconstruction files after the report is generated (the report preserves the findings).
-
-### When to use --validate
-
-- During development/testing of the distillation generator itself
-- When distilling critical documents where information loss is unacceptable (architecture decisions, compliance-relevant specs)
-- As a quality gate before handing off a distillate to a high-stakes downstream workflow
-- NOT for routine use — it adds significant token cost (full reconstruction + diff analysis)
-
-## Design Rationale
-
-### Why a separate skill, not inline in each workflow?
-Compression is a distinct competency. The agent producing a brief is optimized for collaborative discovery and persuasive writing. The agent producing a distillate is optimized for ruthless information extraction and deduplication. Separating them means each can be excellent at its job. It also means any BMAD workflow can call the same distillation skill — briefs, PRDs, architecture docs, research reports — without each reimplementing compression logic.
-
-### Why self-check instead of a separate validator?
-The completeness check is mechanical (does each heading/entity from source appear in output?) not judgmental. A checklist-based self-audit is reliable for this task. A separate validator agent adds a round-trip and token cost for marginal benefit. If the downstream workflow finds gaps, it can always read the source documents directly — the distillate is an optimization, not a single point of failure. A validator can be added later if needed.
-
-### Why round-trip reconstruction for validation?
-The strongest proof of lossless compression is reconstruction. If an LLM reading only the distillate can reproduce both source documents with no meaningful information loss, the distillate is complete. The delta between originals and reconstructions is a precise quality metric: missing information = compression loss, added information = hallucination filling gaps (which also flags where the distillate was too terse). This is more rigorous than any checklist-based approach because it tests actual recoverability, not just presence of keywords.
-
-### Why semantic splitting instead of size-based?
-Arbitrary splits (every N tokens) break coherence. A downstream workflow loading "part 2 of 4" of a size-split distillate gets context fragments. Semantic splits produce self-contained topic clusters that a workflow can load selectively — "give me just the technical decisions section" — which is more useful and more token-efficient for the consumer.
-
-## Integration Points
-
-### As a standalone skill
-User invokes directly: "distill these documents for PRD creation" or "create a distillate of the architecture doc"
-
-### Called by other BMAD skills
-Any skill that produces a primary document + discovery notes can call this as a final optional step:
-- Product Brief → offers distillate for PRD creation
-- PRD → offers distillate for architecture/story creation
-- Architecture → offers distillate for implementation stories
-- Research Reports → offers distillate for brief/PRD input
-
-### Calling convention for other skills
-Other skills invoke this by telling the LLM to run the `bmad-distillation-generator` skill with the source documents and downstream consumer specified. The distillation skill handles everything else and returns the output path.
diff --git a/samples/planning-artifacts/product-brief-bmad-next-gen-installer-discovery-notes.md b/samples/planning-artifacts/product-brief-bmad-next-gen-installer-discovery-notes.md
deleted file mode 100644
index 87d92a9..0000000
--- a/samples/planning-artifacts/product-brief-bmad-next-gen-installer-discovery-notes.md
+++ /dev/null
@@ -1,120 +0,0 @@
----
-title: "Discovery Notes: BMAD Next-Gen Installer"
-type: discovery-notes
-source: "product-brief-bmad-next-gen-installer.md"
-created: "2026-03-12"
-purpose: "Detailed supporting context captured during product brief discovery"
----
-
-## Current Installer Architecture (Migration Context)
-
-- Entry point: `tools/cli/bmad-cli.js` using Commander.js, routes install/uninstall/status commands
-- Core installer: `tools/cli/installers/lib/core/installer.js` orchestrates all installation
-- Platform configs: `tools/cli/installers/lib/ide/platform-codes.yaml` defines ~20 platforms with target dirs, legacy dirs, template types, and special flags (ancestor conflict checks, skill format toggles)
-- Manifest generation: produces CSV files (`skill-manifest.csv`, `workflow-manifest.csv`, `agent-manifest.csv`) — these are the current source of truth, NOT the JSON manifests
-- External modules: `tools/cli/commands/external-official-modules.yaml` lists official modules (CIS, GDS, TEA, WDS) installed from npm with semver
-- Dependency resolution: 4-pass system (collect primary files, parse deps, resolve paths, resolve transitive) — limited to YAML-declared deps
-- Config collection: prompts user for name, communication language, document output language, output folder path
-- Current install directory structure: `_bmad/` for core files, `._config/` for manifests, plus per-IDE skill directories (`.claude/skills/`, `.cursor/skills/`, etc.)
-- Supports install, update, quick-update, and compile-agents actions
-- Custom modules supported via file paths in addition to npm packages
-
-## Existing Skill/Manifest Primitives (Already Partially Built)
-
-- Skills already use directory-per-skill layout: `skill-name/SKILL.md` with frontmatter (name, description)
-- `bmad-manifest.json` sidecar files already exist alongside skills — example from product-brief skill: `{"module-code": "bmm", "replaces-skill": "bmad-create-product-brief", "capabilities": [{"name": "create-brief", "menu-code": "CB", "description": "...", "supports-headless": true, "phase-name": "1-analysis", "after": ["brainstorming"], "before": ["create-prd"], "is-required": true, "output-location": "{planning_artifacts}"}]}`
-- `bmad-skill-manifest.yaml` files define `canonicalId` and artifact type in source
-- The gap: JSON manifests exist but CSV remains single source of truth; no runtime scanning/registration; manifests are static, generated once at install
-
-## Vercel Skills CLI Technical Details
-
-- CLI tool: `npx skills add <source>` — installs from GitHub repos, GitLab, local paths, git URLs
-- Supports 40+ agents with per-agent path mappings (Claude Code: `.claude/skills/`, Cursor: `.cursor/skills/`, etc.)
-- Installation methods: symlinks (recommended) or copies
-- Scope: project-level (shared via git) or global (user-wide)
-- Discovery: scans `skills/`, `.agents/skills/`, agent-specific paths, and `.claude-plugin/marketplace.json` manifests
-- Recognizes Anthropic plugin marketplace format: `{"metadata": {"pluginRoot": "./plugins"}, "plugins": [{"name": "my-plugin", "skills": ["./skills/review"]}]}`
-- Key commands: add, list, find, remove, check, update, init
-- Supports interactive selection or non-interactive CI/CD flags (`-y`, `--all`)
-- MIT licensed, backed by Vercel
-
-## Competitive Landscape
-
-- **Vercel Skills.sh**: 83K+ skills, 18 agents, largest curated leaderboard — but dev-only, skills trigger unreliably (20% without explicit prompting)
-- **SkillsMP**: 400K+ skills directory, pure aggregator with no curation or CLI
-- **ClawHub/OpenClaw**: ~3.2K curated skills with versioning/rollback, small ecosystem
-- **Lindy**: No-code AI agent builder for business automation — closed platform, no skill sharing
-- **Microsoft Copilot Studio**: Enterprise no-code agent builder — vendor-locked to Microsoft
-- **MindStudio**: No-code AI agent platform — siloed, no interoperability
-- **Make/Zapier AI**: Workflow automation adding AI agents — workflow-centric, not methodology-centric
-- **Key gap**: NO competitor combines structured methodology with plugin marketplace — this is BMAD's whitespace
-
-## Market Context
-
-- AI agent market: $7.84B in 2025, projected $52.62B by 2030
-- Agent Skills spec is ~4 months old, ecosystem grew from thousands to 351K+ skills in that time
-- Three standards converging under Linux Foundation's AAIF: MCP (tool integration, 97M monthly SDK downloads), AGENTS.md (project instructions), A2A (agent-to-agent communication)
-- Skills quality crisis: 13.4% have critical vulnerabilities (Snyk study); most community skills are "AI slop"
-- Skill activation reliability is a known problem: 20% trigger rate without explicit prompting — BMAD's structured invocation patterns may be an advantage here
-- BMAD already has established presence: GitHub repo, npm package, docs site, organic coverage on DEV.to and Medium
-
-## User & Distribution Requirements Captured
-
-- NPX installer should still exist for technical users, potentially wrapping Vercel skills CLI
-- Non-technical path: download zip, get platform-specific README, copy skills to folder, run bmad-init
-- No requirement for Node.js, Git, or terminal for the non-technical path
-- Install messages (like current installer shows) are valued — NPX path should preserve this UX
-- Users may share bundles peer-to-peer, not just from marketplace
-- Marketplace initially just a download button + zip + README popup with instructions
-- As low-code platforms mature, provide better per-platform guidance — but this is an emerging space, we're betting on the future
-
-## Technical Decisions & Constraints
-
-- Adopt Anthropic plugin standard as base format (what Vercel uses)
-- `bmad-manifest.json` extends the base standard for BMAD-specific needs (installer options, capabilities, help system integration, phase ordering, dependency declarations)
-- bmad-init must always be included as a base skill in every bundle/install (solves bootstrapping problem)
-- Vercel CLI integration pattern (wrap vs fork vs call) is a PRD/architecture decision
-- Manifest format stability is critical once third-party authors publish against it — needs careful upfront design
-- Migration from current CSV-based manifests to JSON-based runtime scanning is a key technical shift
-
-## Quality & Curation Model
-
-- All plugin submissions will be gated — not an open bazaar
-- Human review by BMad and core team personally
-- This is a key differentiator: curated quality vs ecosystem noise
-- Certification process details are out of scope for the brief, but gated-submission is a core architectural requirement
-- The quality gate becomes MORE valuable over time as the broader ecosystem gets noisier
-
-## Scope Signals (In/Out/Maybe for PRD)
-
-- **In**: manifest spec, bmad-init, bmad-update, Vercel CLI integration, NPX installer, zip bundles, migration path
-- **Out**: BMAD Builder, marketplace web platform, skill conversion work, one-click install for all platforms, monetization
-- **Maybe/Future**: deeper platform-specific integrations for non-technical users, CI/CD integration (bmad-init as GitHub Action one-liner), telemetry/usage analytics for module authors, offline/air-gapped enterprise install story, integrity verification for zip bundles (checksums/signing)
-
-## Rejected Ideas / Decisions Made
-
-- **Not building our own platform support matrix going forward** — delegating to Vercel skills CLI ecosystem. Rationale: maintaining 20+ platform configs is the biggest maintenance burden; it's unsustainable at 40+
-- **Not requiring one-click install for non-technical users in v1** — emerging space, guidance-based for now. Rationale: we don't know what all the low-code platforms will be; better to provide good READMEs and improve over time
-- **Not using existing roadmap or prior brainstorming** — starting fresh for this initiative. Rationale: BMad wanted a clean vision unconstrained by previous planning
-
-## Open Questions for PRD
-
-- Exact Vercel skills CLI integration pattern: wrap as subprocess? Fork and bundle? Use as a library? Peer dependency?
-- How does bmad-update work technically? Diff-based? Full replacement? Does it preserve user customizations?
-- What's the migration story for existing users? Migration command? Manual reinstall? Compatibility shim?
-- How do we test installation correctness across 40+ platforms? CI matrix for top N? Community testing?
-- Should bmad-manifest.json be proposed as an open standard to Agent Skills governance?
-- How do we handle platforms NOT supported by the Vercel skills CLI?
-- What's the manifest versioning strategy? How do we evolve the format without breaking existing plugins?
-- What does the plugin author getting-started experience look like? What tooling do they need?
-
-## Reviewer Insights Worth Preserving
-
-- **Opportunity**: Module authors are an acquisition channel — every published plugin is a distribution event bringing the creator's audience into the ecosystem
-- **Opportunity**: CI/CD integration (bmad-init as a pipeline one-liner) makes BMAD part of repo infrastructure, dramatically increasing stickiness
-- **Opportunity**: Educational institutions are an overlooked segment — structured methodology + non-technical install maps onto university AI curriculum
-- **Opportunity**: Skill composability as a first-class primitive — letting users mix BMAD modules with third-party skills for custom methodology stacks
-- **Risk**: Manifest format evolution creates a versioning/compatibility matrix — once third-party authors publish, changes break plugins (same maintenance burden in a new form)
-- **Risk**: "Methodology-backed quality" needs to be a defined process, not just a claim — the gated review model addresses this
-- **Risk**: Platform proliferation means 40+ testing environments, even with Vercel handling translation
-- **Risk**: Scope creep pressure from marketplace vision — brief explicitly excludes it but it's the primary long-term value
diff --git a/samples/planning-artifacts/product-brief-bmad-next-gen-installer-distillate.md b/samples/planning-artifacts/product-brief-bmad-next-gen-installer-distillate.md
deleted file mode 100644
index 2f01674..0000000
--- a/samples/planning-artifacts/product-brief-bmad-next-gen-installer-distillate.md
+++ /dev/null
@@ -1,109 +0,0 @@
----
-type: bmad-distillate
-sources:
-  - "product-brief-bmad-next-gen-installer.md"
-  - "product-brief-bmad-next-gen-installer-discovery-notes.md"
-downstream_consumer: "PRD creation"
-created: "2026-03-13"
----
-
-## Core Concept
-- BMAD Next-Gen Installer: replaces monolithic Node.js CLI with skill-based plugin architecture for distributing BMAD methodology across 40+ AI platforms
-- Three layers: self-describing plugins (bmad-manifest.json), cross-platform install via Vercel skills CLI (MIT), runtime registration via bmad-init skill
-- Transforms BMAD from dev-only methodology into open platform for any domain (creative, therapeutic, educational, personal)
-
-## Problem
-- Current installer maintains ~20 platform configs manually; each platform convention change requires installer update, test, release — largest maintenance burden on team
-- Node.js/npm required — blocks non-technical users on UI-based platforms (Claude Co-Work, etc.)
-- CSV manifests are static, generated once at install; no runtime scanning/registration
-- Unsustainable at 40+ platforms; new tools launching weekly
-
-## Solution Architecture
-- Plugins: skill bundles with Anthropic plugin standard as base format + bmad-manifest.json extending for BMAD-specific metadata (installer options, capabilities, help integration, phase ordering, dependencies)
-- Existing manifest example: `{"module-code":"bmm","replaces-skill":"bmad-create-product-brief","capabilities":[{"name":"create-brief","menu-code":"CB","supports-headless":true,"phase-name":"1-analysis","after":["brainstorming"],"before":["create-prd"],"is-required":true}]}`
-- Vercel skills CLI handles platform translation; integration pattern (wrap/fork/call) is PRD decision
-- bmad-init: global skill scanning installed bmad-manifest.json files, registering capabilities, configuring project settings; always included as base skill in every bundle (solves bootstrapping)
-- bmad-update: plugin update path without full reinstall; technical approach (diff/replace/preserve customizations) is PRD decision
-- Distribution tiers: (1) NPX installer wrapping skills CLI for technical users, (2) zip bundle + platform-specific README for non-technical users, (3) future marketplace
-- Non-technical path has honest friction: "copy to right folder" requires knowing where that folder is; per-platform README instructions for common tools; improves over time as low-code space matures
-
-## Differentiation
-- Anti-fragmentation: BMAD = cross-platform constant; no competitor provides shared methodology layer across AI tools
-- Curated quality: all submissions gated, human-reviewed by BMad + core team personally; 13.4% of community skills have critical vulnerabilities (Snyk 2026); quality gate value increases as ecosystem gets noisier
-- Domain-agnostic: no competitor builds beyond software dev workflows; same plugin system powers any domain via BMAD Builder (separate initiative)
-
-## Users (ordered by v1 priority)
-- Module authors (primary v1): package/test/distribute plugins independently without installer changes
-- Developers: single-command install on any of 40+ platforms via NPX
-- Non-technical users: install without Node/Git/terminal; emerging segment including PMs, designers, educators
-- Future plugin creators: non-dev authors using BMAD Builder; need distribution without building own installer
-
-## Success Criteria
-- Zero (or near-zero) custom platform directory code; delegated to skills CLI ecosystem
-- Installation verified on top platforms by volume; skills CLI handles long tail
-- Non-technical install path validated with non-developer users
-- bmad-init discovers/registers all plugins from manifests; clear errors for malformed manifests
-- At least one external module author successfully publishes plugin using manifest system
-- bmad-update works without full reinstall
-- Existing CLI users have documented migration path
-
-## Scope
-- In: manifest spec, bmad-init, bmad-update, Vercel CLI integration, NPX installer, zip bundles, migration path
-- Out: BMAD Builder, marketplace web platform, skill conversion (prerequisite, separate), one-click install for all platforms, monetization, quality certification process (gated-submission principle is architectural requirement; process defined separately)
-- Deferred: CI/CD integration, telemetry for module authors, air-gapped enterprise install, zip bundle integrity verification (checksums/signing), deeper non-technical platform integrations
-
-## Current Installer (migration context)
-- Entry: `tools/cli/bmad-cli.js` (Commander.js) → `tools/cli/installers/lib/core/installer.js`
-- Platforms: `tools/cli/installers/lib/ide/platform-codes.yaml` (~20 platforms with target dirs, legacy dirs, template types, special flags)
-- Manifests: CSV files (skill/workflow/agent-manifest.csv) are current source of truth, not JSON
-- External modules: `external-official-modules.yaml` (CIS, GDS, TEA, WDS) from npm with semver
-- Dependencies: 4-pass resolver (collect → parse → resolve → transitive); YAML-declared only
-- Config: prompts for name, communication language, document output language, output folder
-- Actions: install, update, quick-update, compile-agents
-- Skills already use directory-per-skill layout (skill-name/SKILL.md); bmad-manifest.json sidecars already exist but are not source of truth
-- Key shift: CSV-based static manifests → JSON-based runtime scanning
-
-## Vercel Skills CLI
-- `npx skills add <source>` — GitHub, GitLab, local paths, git URLs
-- 40+ agents; per-agent path mappings; symlinks (recommended) or copies
-- Scopes: project-level or global
-- Discovery: `skills/`, `.agents/skills/`, agent-specific paths, `.claude-plugin/marketplace.json`
-- Commands: add, list, find, remove, check, update, init
-- Non-interactive: `-y`, `--all` flags for CI/CD
-
-## Competitive Landscape
-- No competitor combines structured methodology + plugin marketplace (whitespace)
-- Skills.sh (Vercel): 83K skills, dev-only, 20% trigger reliability without explicit prompting
-- SkillsMP: 400K skills, aggregator only, no curation
-- ClawHub: 3.2K curated, versioning, small
-- No-code platforms (Lindy, Copilot Studio, MindStudio, Make/Zapier): closed/siloed, no skill portability, business-only
-- Market: $7.84B (2025) → $52.62B (2030); Agent Skills spec ~4 months old, 351K+ skills; standards converging under Linux Foundation AAIF (MCP, AGENTS.md, A2A)
-- BMAD's structured invocation patterns may advantage vs 20% trigger reliability problem
-
-## Rejected Alternatives
-- Building own platform support matrix: unsustainable at 40+; delegate to Vercel ecosystem
-- One-click install for non-technical v1: emerging space; guidance-based, improve over time
-- Prior roadmap/brainstorming: clean start, unconstrained by previous planning
-
-## Open Questions
-- Vercel CLI integration pattern: wrap/fork/call/peer dependency?
-- bmad-update mechanics: diff/replace? Preserve user customizations?
-- Migration story: command/manual reinstall/compatibility shim?
-- Cross-platform testing: CI matrix for top N? Community testing for rest?
-- bmad-manifest.json as open standard submission to Agent Skills governance?
-- Platforms NOT supported by Vercel skills CLI?
-- Manifest versioning strategy for backward compatibility?
-- Plugin author getting-started experience and tooling?
-
-## Opportunities (from review)
-- Module authors as acquisition channel: each published plugin distributes BMAD to creator's audience
-- CI/CD integration: bmad-init as pipeline one-liner increases stickiness
-- Educational institutions: structured methodology + non-technical install → university AI curriculum
-- Skill composability: mixing BMAD modules with third-party skills for custom methodology stacks
-
-## Risks
-- Manifest format evolution creates versioning/compatibility burden once third-party authors publish
-- Quality gate needs defined process, not just claim — gated review model addresses
-- 40+ platform testing environments even with Vercel handling translation
-- Scope creep pressure from marketplace vision (explicitly excluded but primary long-term value)
-- Vercel dependency: minor supply-chain risk; MIT license allows fork if deprioritized; supporting many platforms is core differentiator regardless
diff --git a/samples/planning-artifacts/product-brief-bmad-next-gen-installer.md b/samples/planning-artifacts/product-brief-bmad-next-gen-installer.md
deleted file mode 100644
index cca4f7c..0000000
--- a/samples/planning-artifacts/product-brief-bmad-next-gen-installer.md
+++ /dev/null
@@ -1,96 +0,0 @@
----
-title: "Product Brief: BMAD Next-Gen Installer"
-status: "complete"
-created: "2026-03-12"
-updated: "2026-03-12"
-inputs:
-  - "User brain dump (BMad)"
-  - "Current installer codebase analysis (tools/cli/)"
-  - "Vercel skills CLI README (github.com/vercel-labs/skills)"
-  - "Web research: AI agent skills ecosystem, marketplace landscape"
----
-
-# Product Brief: BMAD Next-Gen Installer
-
-## Executive Summary
-
-The BMAD Method has grown from a developer-focused agile AI methodology into a framework used across 20+ platforms — but its installer hasn't kept up. Today, every new AI tool that supports skills means manual work: adding platform configs, maintaining directory mappings, testing installation paths. This doesn't scale, and it locks BMAD out of the fastest-growing segment of the market: non-technical users on low-code and UI-based platforms.
-
-The Next-Gen Installer replaces BMAD's monolithic Node.js CLI with a skill-based architecture built on the emerging Agent Skills standard. By leveraging the open-source Vercel skills CLI for cross-platform installation and introducing a plugin system where BMAD modules are self-describing skill bundles, we eliminate the platform maintenance burden, open distribution beyond what BMAD could maintain alone, and lay the foundation for a marketplace where anyone — developers, creators, educators, therapists — can discover, download, and install BMAD plugins without needing Git, Node, or a terminal.
-
-This isn't just a better installer. It's the infrastructure that transforms BMAD from a dev methodology into an open platform.
-
-## The Problem
-
-BMAD currently supports ~20 AI platforms through a custom Node.js installer that maintains per-platform directory mappings, template formats, and legacy migration paths. Every platform that changes its skill conventions — and they change constantly — requires installer updates, testing, and a new release. This is the single biggest maintenance burden on the bmad-code team.
-
-Meanwhile, the Agent Skills ecosystem has exploded. The broader skills ecosystem now spans 40+ platforms with hundreds of thousands of skills and millions of installs. The market is moving fast, and BMAD is fighting to keep up with platform-by-platform manual support while new tools launch weekly.
-
-Worse, the current installer requires Node.js and npm — a hard barrier for the growing population of non-technical users building with AI through UI-based platforms like Claude Co-Work. These users can't run `npx bmad-method install`. They need something simpler.
-
-The cost of the status quo is clear: developer time spent maintaining platform configs instead of building methodology, and an entire user segment that can't access BMAD at all.
-
-## The Solution
-
-The Next-Gen Installer is a skill-based distribution and registration system with three layers:
-
-**1. Self-Describing Plugins.** Every BMAD module becomes a plugin — a bundle of skills with a manifest that declares what's included, how skills relate to each other, what capabilities they provide, and how they integrate with the BMAD help system. The plugin format adopts the Anthropic plugin standard (used by Vercel and the broader skills ecosystem) as its base, extended with a BMAD-specific manifest (`bmad-manifest.json`) for metadata the base standard doesn't cover — such as installer options, capability declarations, and help system integration. A plugin is fully self-contained: download it, put the skills in your tool's skill folder, and it works.
-
-**2. Cross-Platform Installation via Vercel Skills CLI.** For users who want an automated install experience, the installer builds on the MIT-licensed Vercel skills CLI, which handles translating the Anthropic plugin standard to 40+ platforms. The exact integration pattern — wrapping, forking, or calling as a dependency — is a PRD-level architecture decision. The strategic intent is clear: BMAD stops maintaining platform directory mappings and delegates that problem to a well-maintained open-source project. The Vercel dependency carries minor supply-chain risk, but the pros far outweigh it: the MIT license means BMAD can fork and maintain it if Vercel ever deprioritizes the project. Supporting many platforms is a core BMAD differentiator — we need this problem solved one way or another, and leveraging an existing solution beats building from scratch.
-
-**3. Runtime Registration via `bmad-init`.** A global skill that scans for installed BMAD manifests, registers capabilities, configures project settings, and bootstraps the BMAD experience. Users run it once after installation. It replaces the current installer's config-collection step and provides the entry point for updates via `bmad-update`. Note: `bmad-init` itself must be installed before it can run — the NPX installer and zip bundle README handle this bootstrapping step by ensuring `bmad-init` is always included as a base skill.
-
-For non-technical users, distribution is straightforward: download a zip containing all plugin skills plus a README with platform-specific guidance. The honest reality is that "copy to the right folder" still requires knowing where that folder is — and this varies by platform. The README provides per-platform instructions for the most common tools, and as the low-code/no-code AI platform space matures, we improve guidance and explore deeper integrations. We don't need to solve universal one-click install today, but we do need to be honest that the non-technical path has friction we'll reduce over time.
-
-## What Makes This Different
-
-**The anti-fragmentation layer.** The AI tooling space is fracturing across 40+ platforms with no shared methodology layer. BMAD is uniquely positioned to be the cross-platform constant — the structured approach that works the same in Cursor, Claude Code, Windsurf, Copilot, and whatever launches next month. Every other methodology or skill framework maintains its own platform support matrix. By building on the open-source skills CLI ecosystem, BMAD offloads the highest-churn maintenance burden and focuses on what actually differentiates it: the methodology itself.
-
-**Methodology-backed quality in a sea of AI slop.** The broader skills ecosystem is flooded with low-quality, AI-generated content — and early research (Snyk, 2026) suggests a meaningful percentage of community skills contain security vulnerabilities. BMAD plugins are different: they're structured, tested, and part of a coherent methodology. The BMAD manifest system ensures skills work together, declare dependencies, and integrate with the help system. This is a curated ecosystem, not an open bazaar — all plugin submissions will be gated, reviewed, and curated by the BMAD creator and open-source core team. This human-reviewed quality gate is a key differentiator that becomes more valuable as the broader ecosystem grows noisier.
-
-**Platform for everything, not just code.** No competitor in the AI skills space is building beyond software development workflows. BMAD's plugin architecture is domain-agnostic — the same manifest system, installer, and registration flow that powers the dev methodology will power creative, educational, therapeutic, and personal plugins built with the BMAD Builder. This is unaddressed whitespace in the current market.
-
-## Who This Serves
-
-**BMAD Open-Source Contributors and Module Authors** (primary v1 target) — The people who build BMAD modules today. They currently package workflows and agents manually and rely on the installer team to support new platforms. They need a standardized way to package modules as self-contained skill plugins that work anywhere — and they need to do it without waiting on installer changes. Success: a module author can package, test, and distribute a plugin independently.
-
-**Developers Using AI Coding Tools** — Technical users across Claude Code, Cursor, Gemini CLI, Codex, and dozens of other platforms who want to install BMAD with a single command and have it just work, regardless of their tool. Success: `npx` one-liner installs BMAD to their tool of choice, and `bmad-init` configures it for their project.
-
-**Non-Technical AI Users** — People building with AI through UI-based platforms who don't have (or want) a development environment. They need download-and-copy simplicity with clear, platform-specific guidance. This is an emerging segment — we don't fully understand their needs yet, but removing the Node.js barrier opens BMAD to product managers, designers, educators, and knowledge workers who currently cannot access it. Success: a user who has never opened a terminal can install and use BMAD on their platform.
-
-**Future Plugin Creators** — People who will build BMAD-compatible plugins for domains beyond software development. They need a distribution system that gets their work into users' hands without building their own installer. Success: a non-dev plugin author can package and share their creation using the same manifest and distribution system.
-
-## Success Criteria
-
-- **Platform maintenance burden reduced dramatically:** Custom platform directory code in BMAD's codebase approaches zero, with cross-platform installation delegated to the skills CLI ecosystem
-- **Broad platform coverage:** Installation verified on the top platforms by install volume, with the skills CLI handling the long tail
-- **Non-technical installation path exists:** Users can install BMAD without Node.js, npm, or Git — validated by at least testing the flow with non-developer users
-- **Plugin self-registration works:** `bmad-init` correctly discovers and registers all installed BMAD plugins from manifests alone, with clear error messages for malformed or missing manifests
-- **Module authors can package and distribute plugins** using the manifest system without needing installer changes — validated by at least one external module author successfully publishing a plugin
-- **Update path exists:** `bmad-update` allows users to update installed plugins without reinstalling from scratch
-- **Migration from current installer:** Existing BMAD users on the Node.js CLI have a clear, documented path to the next-gen system
-
-## Scope
-
-**In scope for the next-gen installer:**
-- Plugin manifest format (`bmad-manifest.json`) and specification
-- `bmad-init` skill for runtime discovery and registration
-- `bmad-update` skill for plugin updates
-- Integration with Vercel skills CLI (or equivalent) for automated cross-platform installation
-- NPX-based installer for technical users
-- Downloadable zip bundles with platform-specific README guidance for non-technical users
-- Migration path from current installer — existing users need a clear upgrade story, whether that's a migration command or documented manual steps
-
-**Explicitly out of scope:**
-- BMAD Builder (plugin creation tool) — separate initiative
-- Marketplace platform (web-based discovery and download) — future phase
-- Converting existing workflows/agents to skills — prerequisite, handled separately
-- One-click install for every platform — emerging space, guidance-based for now
-- Monetization or paid plugin infrastructure
-- Plugin quality certification process — the review and curation workflow will be defined separately, though the gated-submission principle is a core architectural requirement
-
-## Vision
-
-If the next-gen installer succeeds, BMAD becomes the first AI agent methodology that is truly platform-agnostic and accessible to non-developers. The plugin architecture creates the foundation for a marketplace where a therapist can download a "Guided Journaling" BMAD plugin, a game designer can install a "World Building" plugin, and a startup founder can get the full software development methodology — all through the same system, on whatever AI platform they use.
-
-In 2-3 years, BMAD plugins become a leading way people package and share structured AI agent workflows. The combination of methodology-backed quality, cross-platform portability, and open distribution creates a flywheel: more plugins attract more users across more platforms, more users attract more plugin creators from more domains, and the growing library of quality plugins reinforces BMAD's reputation as the curated alternative to the skills bazaar. BMAD evolves from a method into an ecosystem.
diff --git a/src/module.yaml b/src/module.yaml
index ed38ac4..4003333 100644
--- a/src/module.yaml
+++ b/src/module.yaml
@@ -3,6 +3,12 @@ name: "BMad Builder"
 description: "Standard Skill Compliant Factory for BMad Agents, Workflows and Modules"
 default_selected: false
 
+# Variables from Core Config inserted:
+## user_name
+## communication_language
+## document_output_language
+## output_folder
+
 bmad_builder_output_folder:
   prompt: "Where should your custom skills (agents and workflows) be saved?"
   default: "_bmad-output/skills"
diff --git a/src/skills/bmad-agent-builder/SKILL.md b/src/skills/bmad-agent-builder/SKILL.md
index d840ec5..177ca7f 100644
--- a/src/skills/bmad-agent-builder/SKILL.md
+++ b/src/skills/bmad-agent-builder/SKILL.md
@@ -40,7 +40,7 @@ This is the core creative path — where agent ideas become reality. Through six
 
 Agents are named personas with optional memory, capabilities, autonomous modes, and personality. The build process includes a lint gate for structural validation. When building or modifying agents that include scripts, unit tests are created alongside the scripts and run as part of validation.
 
-Load `prompts/build-process.md` to begin.
+Load `build-process.md` to begin.
 
 ## Quality Optimizer
 
@@ -48,7 +48,7 @@ For agents that already work but could work *better*. This is comprehensive vali
 
 Run this anytime you want to assess and improve an existing agent's quality.
 
-Load `prompts/quality-optimizer.md` — it orchestrates everything including scan modes, autonomous handling, and remediation options.
+Load `quality-optimizer.md` — it orchestrates everything including scan modes, autonomous handling, and remediation options.
 
 ---
 
@@ -56,8 +56,8 @@ Load `prompts/quality-optimizer.md` — it orchestrates everything including sca
 
 | Intent | Trigger Phrases | Route |
 |--------|----------------|-------|
-| **Builder** | "build/create/design/convert/edit/fix an agent", "new agent" | Load `prompts/build-process.md` |
-| **Quality Optimizer** | "quality check", "validate", "review/optimize/improve agent" | Load `prompts/quality-optimizer.md` |
+| **Builder** | "build/create/design/convert/edit/fix an agent", "new agent" | Load `build-process.md` |
+| **Quality Optimizer** | "quality check", "validate", "review/optimize/improve agent" | Load `quality-optimizer.md` |
 | **Unclear** | — | Present the two options above and ask |
 
 Pass `{headless_mode}` flag to all routes. Use Todo List to track progress through multi-step flows. Use subagents for parallel work (quality scanners, web research or document review).
diff --git a/src/skills/bmad-agent-builder/agents/report-quality-scan-creator.md b/src/skills/bmad-agent-builder/agents/report-quality-scan-creator.md
deleted file mode 100644
index a49a9ae..0000000
--- a/src/skills/bmad-agent-builder/agents/report-quality-scan-creator.md
+++ /dev/null
@@ -1,181 +0,0 @@
-# Quality Scan Report Creator
-
-You are a master quality engineer tech writer agent QualityReportBot-9001 and you will create a comprehensive, cohesive quality report from multiple scanner outputs. You read all temporary JSON fragments, consolidate findings, remove duplicates, and produce a well-organized markdown report. Ensure that nothing is missed. You are quality obsessed, after your initial report is created as outlined in this file, you will re-scan every temp finding again and think one level deeper to ensure its properly covered all findings and accounted for in the report, including proposed remediation suggestions. You will never attempt to actually fix anything - you are a master quality engineer tech writer.
-
-## Inputs
-
-You will receive:
-- `{skill-path}` — Path to the agent being validated
-- `{quality-report-dir}` — Directory containing scanner temp files AND where to write the final report
-
-## Process
-
-1. List all `*-temp.json` files in `{quality-report-dir}`
-2. Read each JSON file and extract all findings
-3. Consolidate and deduplicate findings across scanners
-4. Organize by category, then by severity within each category
-5. Identify truly broken/missing issues (CRITICAL and HIGH severity)
-6. Write comprehensive markdown report
-7. Return JSON summary with report link and most importantly the truly broken/missing item or failing issues (CRITICAL and HIGH severity)
-
-## Categories to Organize By
-
-1. **Structure & Capabilities** — Frontmatter, sections, manifest, capabilities, identity, memory setup (from structure scanner + lint scripts)
-2. **Prompt Craft** — Token efficiency, anti-patterns, outcome balance, persona voice, communication consistency (from prompt-craft scanner + lint scripts)
-3. **Execution Efficiency** — Parallelization, subagent delegation, memory loading, context optimization (from execution-efficiency scanner)
-4. **Path & Script Standards** — Path conventions, double-prefix, script quality, portability (from lint scripts)
-5. **Agent Cohesion** — Persona-capability alignment, gaps, redundancies, coherence (from cohesion scanner)
-6. **Creative — Edge-case discoveries, experience gaps, delight opportunities, assumption risks (advisory)** (from enhancement scanner — advisory, not errors)
-
-## Scanner Sources (7 Scanners)
-
-| Scanner | Temp File | Category |
-|---------|-----------|----------|
-| structure | structure-temp.json | Structure & Capabilities |
-| prompt-craft | prompt-craft-temp.json | Prompt Craft |
-| execution-efficiency | execution-efficiency-temp.json | Execution Efficiency |
-| path-standards | path-standards-temp.json | Path & Script Standards |
-| scripts | scripts-temp.json | Path & Script Standards |
-| agent-cohesion | agent-cohesion-temp.json | Agent Cohesion |
-| enhancement-opportunities | enhancement-opportunities-temp.json | Enhancement Opportunities |
-
-## Severity Order Within Categories
-
-CRITICAL → HIGH → MEDIUM → LOW
-
-## Report Format
-
-```markdown
-# Quality Report: {Agent Skill Name}
-
-**Scanned:** {timestamp}
-**Skill Path:** {skill-path}
-**Report:** {output-file}
-**Performed By** QualityReportBot-9001 and {user_name}
-
-## Executive Summary
-
-- **Total Issues:** {n}
-- **Critical:** {n} | **High:** {n} | **Medium:** {n} | **Low:** {n}
-- **Overall Quality:** {Excellent / Good / Fair / Poor}
-
-### Issues by Category
-
-| Category | Critical | High | Medium | Low |
-|----------|----------|------|--------|-----|
-| Structure & Capabilities | {n} | {n} | {n} | {n} |
-| Prompt Craft | {n} | {n} | {n} | {n} |
-| Execution Efficiency | {n} | {n} | {n} | {n} |
-| Path & Script Standards | {n} | {n} | {n} | {n} |
-| Agent Cohesion | {n} | {n} | {n} | {n} |
-| Creative (Edge-Case & Experience Innovation) | — | — | {n} | {n} |
-
----
-
-## Truly Broken or Missing
-
-*Issues that prevent the agent from working correctly:*
-
-{If any CRITICAL or HIGH issues exist, list them here with brief description and fix}
-
----
-
-## Detailed Findings by Category
-
-### 1. Structure & Capabilities
-
-**Critical Issues**
-{if any}
-
-**High Priority**
-{if any}
-
-**Medium Priority**
-{if any}
-
-**Low Priority (Optional)**
-{if any}
-
-### 2. Prompt Craft
-{repeat pattern above}
-
-### 3. Execution Efficiency
-{repeat pattern above}
-
-### 4. Path & Script Standards
-{repeat pattern above}
-
-### 5. Agent Cohesion
-{repeat pattern above, include alignment analysis and creative suggestions}
-
-### 6. Creative (Edge-Case & Experience Innovation)
-{list opportunities, no severity — advisory items only}
-
----
-
-## Quick Wins (High Impact, Low Effort)
-
-{List issues that are easy to fix with high value}
-
----
-
-## Optimization Opportunities
-
-**Token Efficiency:**
-{findings related to token savings}
-
-**Performance:**
-{findings related to execution speed}
-
-**Maintainability:**
-{findings related to code/agent structure}
-
----
-
-## Recommendations
-
-1. {Most important action item}
-2. {Second priority}
-3. {Third priority}
-```
-
-## Output
-
-Write report to: `{quality-report-dir}/quality-report-{skill-name}-{timestamp}.md`
-
-Return JSON:
-
-```json
-{
-  "report_file": "{full-path-to-report}",
-  "summary": {
-    "total_issues": 0,
-    "critical": 0,
-    "high": 0,
-    "medium": 0,
-    "low": 0,
-    "overall_quality": "Excellent|Good|Fair|Poor",
-    "truly_broken_found": true|false,
-    "truly_broken_count": 0
-  },
-  "by_category": {
-    "structure_capabilities": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "prompt_craft": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "execution_efficiency": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "path_script_standards": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "agent_cohesion": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "enhancement_opportunities": {"count": 0, "description": "Creative — edge-case discoveries, experience gaps, delight opportunities, assumption risks"}
-  },
-  "high_impact_quick_wins": [
-    {"issue": "description", "file": "location", "effort": "low"}
-  ]
-}
-```
-
-## Notes
-
-- Remove duplicate issues that appear in multiple scanner outputs
-- If the same issue is found in multiple files, list it once with all affected files
-- Preserve all CRITICAL and HIGH severity findings — these indicate broken functionality
-- MEDIUM and LOW can be consolidated if they're similar
-- Autonomous opportunities are not "issues" — they're enhancements, so categorize separately
diff --git a/src/skills/bmad-agent-builder/templates/SKILL-template.md b/src/skills/bmad-agent-builder/assets/SKILL-template.md
similarity index 87%
rename from src/skills/bmad-agent-builder/templates/SKILL-template.md
rename to src/skills/bmad-agent-builder/assets/SKILL-template.md
index 9314e07..6bdec78 100644
--- a/src/skills/bmad-agent-builder/templates/SKILL-template.md
+++ b/src/skills/bmad-agent-builder/assets/SKILL-template.md
@@ -18,7 +18,7 @@ description: {skill-description} # Format: [4-6 word summary]. [trigger: "User w
    - Look for `--headless` in the activation context
    - If `--headless:{task-name}` → run that specific autonomous task
    - If just `--headless` → run default autonomous wake behavior
-   - Load and execute `prompts/headless-wake.md` with task context
+   - Load and execute `headless-wake.md` with task context
    - Do NOT load config, do NOT greet user, do NOT show menu
    - Execute task, write results, exit silently
 
@@ -50,7 +50,7 @@ description: {skill-description} # Format: [4-6 word summary]. [trigger: "User w
 ## Sidecar
 Memory location: `_bmad/_memory/{skillName}-sidecar/`
 
-Load `resources/memory-system.md` for memory discipline and structure.
+Load `references/memory-system.md` for memory discipline and structure.
 {/if-sidecar}
 
 ## On Activation
@@ -61,14 +61,14 @@ Load `resources/memory-system.md` for memory discipline and structure.
    - Store any other config variables as `{var-name}` and use appropriately
 
 {if-autonomous}
-2. **If autonomous mode** — Load and run `prompts/autonomous-wake.md` (default wake behavior), or load the specified prompt and execute its autonomous section without interaction
+2. **If autonomous mode** — Load and run `autonomous-wake.md` (default wake behavior), or load the specified prompt and execute its autonomous section without interaction
 
 3. **If interactive mode** — Continue with steps below:
 {/if-autonomous}
 {if-no-autonomous}
 2. **Continue with steps below:**
 {/if-no-autonomous}
-   {if-sidecar}- **Check first-run** — If no `{skillName}-sidecar/` folder exists in `_bmad/_memory/`, load `prompts/init.md` for first-run setup
+   {if-sidecar}- **Check first-run** — If no `{skillName}-sidecar/` folder exists in `_bmad/_memory/`, load `init.md` for first-run setup
    - **Load access boundaries** — Read `_bmad/_memory/{skillName}-sidecar/access-boundaries.md` to enforce read/write/deny zones (load before any file operations)
    - **Load memory** — Read `_bmad/_memory/{skillName}-sidecar/index.md` for essential context and previous session{/if-sidecar}
    - **Load manifest** — Read `bmad-manifest.json` to set `{capabilities}` list of actions the agent can perform (internal prompts and available skills)
@@ -93,5 +93,5 @@ Load `resources/memory-system.md` for memory discipline and structure.
    - DO NOT hardcode menu examples — generate from actual manifest data
 
 **CRITICAL Handling:** When user selects a code/number, consult the bmad-manifest.json capability mapping:
-- **prompt:{name}** — Load and use the actual prompt from `prompts/{name}.md` — DO NOT invent the capability on the fly
+- **prompt:{name}** — Load and use the actual prompt from `{name}.md` — DO NOT invent the capability on the fly
 - **skill:{name}** — Invoke the skill by its exact registered name
diff --git a/src/skills/bmad-agent-builder/templates/autonomous-wake.md b/src/skills/bmad-agent-builder/assets/autonomous-wake.md
similarity index 100%
rename from src/skills/bmad-agent-builder/templates/autonomous-wake.md
rename to src/skills/bmad-agent-builder/assets/autonomous-wake.md
diff --git a/src/skills/bmad-agent-builder/templates/init-template.md b/src/skills/bmad-agent-builder/assets/init-template.md
similarity index 100%
rename from src/skills/bmad-agent-builder/templates/init-template.md
rename to src/skills/bmad-agent-builder/assets/init-template.md
diff --git a/src/skills/bmad-agent-builder/templates/memory-system.md b/src/skills/bmad-agent-builder/assets/memory-system.md
similarity index 98%
rename from src/skills/bmad-agent-builder/templates/memory-system.md
rename to src/skills/bmad-agent-builder/assets/memory-system.md
index 1301c5b..8c3946c 100644
--- a/src/skills/bmad-agent-builder/templates/memory-system.md
+++ b/src/skills/bmad-agent-builder/assets/memory-system.md
@@ -126,4 +126,4 @@ Regularly (every few sessions or when files grow large):
 
 ## First Run
 
-If sidecar doesn't exist, load `prompts/init.md` to create the structure.
+If sidecar doesn't exist, load `init.md` to create the structure.
diff --git a/src/skills/bmad-agent-builder/assets/quality-report-template.md b/src/skills/bmad-agent-builder/assets/quality-report-template.md
new file mode 100644
index 0000000..b6811db
--- /dev/null
+++ b/src/skills/bmad-agent-builder/assets/quality-report-template.md
@@ -0,0 +1,282 @@
+# Quality Report: {agent-name}
+
+**Scanned:** {timestamp}
+**Skill Path:** {skill-path}
+**Report:** {report-file-path}
+**Performed By** QualityReportBot-9001 and {user_name}
+
+## Executive Summary
+
+- **Total Issues:** {total-issues}
+- **Critical:** {critical} | **High:** {high} | **Medium:** {medium} | **Low:** {low}
+- **Overall Quality:** {Excellent|Good|Fair|Poor}
+- **Overall Cohesion:** {cohesion-score}
+- **Craft Assessment:** {craft-assessment}
+
+<!-- Synthesize 1-3 sentence narrative: agent persona/purpose (from enhancement-opportunities skill_understanding.purpose + agent-cohesion agent_identity), architecture quality, and most significant finding. Frame this as an agent assessment, not a workflow assessment. -->
+{executive-narrative}
+
+### Issues by Category
+
+| Category | Critical | High | Medium | Low |
+|----------|----------|------|--------|-----|
+| Structure & Capabilities | {n} | {n} | {n} | {n} |
+| Prompt Craft | {n} | {n} | {n} | {n} |
+| Execution Efficiency | {n} | {n} | {n} | {n} |
+| Path & Script Standards | {n} | {n} | {n} | {n} |
+| Agent Cohesion | {n} | {n} | {n} | {n} |
+| Creative | — | — | {n} | {n} |
+
+---
+
+## Agent Identity
+
+<!-- From agent-cohesion agent_identity block. -->
+
+- **Persona:** {persona-summary}
+- **Primary Purpose:** {primary-purpose}
+- **Capabilities:** {capability-count}
+
+---
+
+## Strengths
+
+*What this agent does well — preserve these during optimization:*
+
+<!-- Collect from ALL of these sources:
+  - All scanners: findings[] with severity="strength" or category="strength"
+  - prompt-craft: findings where severity="note" and observation is positive
+  - prompt-craft: positive aspects from assessments.skillmd_assessment.notes and persona_context assessment
+  - enhancement-opportunities: bright_spots from each assessments.user_journeys[] entry
+  - structure: positive observations from assessments.metadata (e.g., memory setup present, headless mode configured)
+  Group by theme. Each strength should explain WHY it matters. -->
+
+{strengths-list}
+
+---
+
+{if-truly-broken}
+## Truly Broken or Missing
+
+*Issues that prevent the agent from working correctly:*
+
+<!-- Every CRITICAL and HIGH severity issue from ALL scanners. Maximum detail: description, affected files/lines, fix instructions. These are the most actionable part of the report. -->
+
+{truly-broken-findings}
+
+---
+{/if-truly-broken}
+
+## Detailed Findings by Category
+
+### 1. Structure & Capabilities
+
+<!-- Source: structure-temp.json. Agent-specific: includes identity effectiveness, memory setup, headless mode, capability cross-references. -->
+
+{if-structure-metadata}
+**Agent Metadata:**
+- Sections found: {sections-list}
+- Capabilities: {capabilities-count}
+- Memory sidecar: {has-memory}
+- Headless mode: {has-headless}
+- Manifest valid: {manifest-valid}
+- Structure assessment: {structure-assessment}
+{/if-structure-metadata}
+
+<!-- List findings by severity: Critical > High > Medium > Low. Omit empty severity levels. -->
+
+{structure-findings}
+
+### 2. Prompt Craft
+
+<!-- Source: prompt-craft-temp.json. Agent-specific: includes persona_context assessment and persona-voice/communication-consistency categories. Remember: persona voice is INVESTMENT not waste for agents. -->
+
+**Agent Assessment:**
+- Agent type: {skill-type-assessment}
+- Overview quality: {overview-quality}
+- Progressive disclosure: {progressive-disclosure}
+- Persona context: {persona-context}
+- {skillmd-assessment-notes}
+
+{if-prompt-health}
+**Prompt Health:** {prompts-with-config-header}/{total-prompts} with config header | {prompts-with-progression}/{total-prompts} with progression conditions | {prompts-self-contained}/{total-prompts} self-contained
+{/if-prompt-health}
+
+{prompt-craft-findings}
+
+### 3. Execution Efficiency
+
+<!-- Source: execution-efficiency-temp.json. Agent-specific: includes memory-loading category. -->
+
+{efficiency-issue-findings}
+
+{if-efficiency-opportunities}
+**Optimization Opportunities:**
+
+<!-- From findings[] with severity ending in -opportunity. Each: title, detail (includes type/savings narrative), action. -->
+
+{efficiency-opportunities}
+{/if-efficiency-opportunities}
+
+### 4. Path & Script Standards
+
+<!-- Source: path-standards-temp.json + scripts-temp.json -->
+
+{if-script-inventory}
+**Script Inventory:** {total-scripts} scripts ({by-type-breakdown}) | Missing tests: {missing-tests-list}
+{/if-script-inventory}
+
+{path-script-findings}
+
+### 5. Agent Cohesion
+
+<!-- Source: agent-cohesion-temp.json. This is the agent-specific section — persona-capability alignment, gaps, redundancies, coherence. -->
+
+{if-cohesion-analysis}
+**Cohesion Analysis:**
+
+<!-- Include only dimensions present in scanner output. -->
+
+| Dimension | Score | Notes |
+|-----------|-------|-------|
+| Persona Alignment | {score} | {notes} |
+| Capability Completeness | {score} | {notes} |
+| Redundancy Level | {score} | {notes} |
+| External Integration | {score} | {notes} |
+| User Journey | {score} | {notes} |
+
+{if-consolidation-opportunities}
+**Consolidation Opportunities:**
+
+<!-- From cohesion_analysis.redundancy_level.consolidation_opportunities[]. Each: capabilities that overlap and how to combine. -->
+
+{consolidation-opportunities}
+{/if-consolidation-opportunities}
+{/if-cohesion-analysis}
+
+{cohesion-findings}
+
+{if-creative-suggestions}
+**Creative Suggestions:**
+
+<!-- From findings[] with severity="suggestion". Each: title, detail, action. -->
+
+{creative-suggestions}
+{/if-creative-suggestions}
+
+### 6. Creative (Edge-Case & Experience Innovation)
+
+<!-- Source: enhancement-opportunities-temp.json. These are advisory suggestions, not errors. -->
+
+**Agent Understanding:**
+- **Purpose:** {skill-purpose}
+- **Primary User:** {primary-user}
+- **Key Assumptions:**
+{key-assumptions-list}
+
+**Enhancement Findings:**
+
+<!-- Organize by: high-opportunity > medium-opportunity > low-opportunity.
+     Each: title, detail, action. -->
+
+{enhancement-findings}
+
+{if-top-insights}
+**Top Insights:**
+
+<!-- From enhancement-opportunities assessments.top_insights[]. These are the synthesized highest-value observations.
+     Each: title, detail, action. -->
+
+{top-insights}
+{/if-top-insights}
+
+---
+
+{if-user-journeys}
+## User Journeys
+
+*How different user archetypes experience this agent:*
+
+<!-- From enhancement-opportunities user_journeys[]. Reproduce EVERY archetype fully. -->
+
+### {archetype-name}
+
+{journey-summary}
+
+**Friction Points:**
+{friction-points-list}
+
+**Bright Spots:**
+{bright-spots-list}
+
+<!-- Repeat for ALL archetypes. Do not skip any. -->
+
+---
+{/if-user-journeys}
+
+{if-autonomous-assessment}
+## Autonomous Readiness
+
+<!-- From enhancement-opportunities autonomous_assessment. Include ALL fields. This is especially important for agents which may need headless/autonomous operation. -->
+
+- **Overall Potential:** {overall-potential}
+- **HITL Interaction Points:** {hitl-count}
+- **Auto-Resolvable:** {auto-resolvable-count}
+- **Needs Input:** {needs-input-count}
+- **Suggested Output Contract:** {output-contract}
+- **Required Inputs:** {required-inputs-list}
+- **Notes:** {assessment-notes}
+
+---
+{/if-autonomous-assessment}
+
+{if-script-opportunities}
+## Script Opportunities
+
+<!-- Source: script-opportunities-temp.json. These identify LLM work that could be deterministic scripts. -->
+
+**Existing Scripts:** {existing-scripts-list}
+
+<!-- For each finding: title, detail (includes determinism/complexity/savings narrative), action. -->
+
+{script-opportunity-findings}
+
+**Token Savings:** {total-estimated-token-savings} | Highest value: {highest-value-opportunity} | Prepass opportunities: {prepass-count}
+
+---
+{/if-script-opportunities}
+
+## Quick Wins (High Impact, Low Effort)
+
+<!-- Pull from ALL scanners: findings where fix effort is trivial/low but impact is meaningful. -->
+
+| Issue | File | Effort | Impact |
+|-------|------|--------|--------|
+{quick-wins-rows}
+
+---
+
+## Optimization Opportunities
+
+<!-- Synthesize across scanners — not a copy of findings but a narrative of improvement themes. -->
+
+**Token Efficiency:**
+{token-optimization-narrative}
+
+**Performance:**
+{performance-optimization-narrative}
+
+**Maintainability:**
+{maintainability-optimization-narrative}
+
+---
+
+## Recommendations
+
+<!-- Rank by: severity first, then breadth of impact, then effort (prefer low-effort). Up to 5. -->
+
+1. {recommendation-1}
+2. {recommendation-2}
+3. {recommendation-3}
+4. {recommendation-4}
+5. {recommendation-5}
diff --git a/src/skills/bmad-agent-builder/templates/save-memory.md b/src/skills/bmad-agent-builder/assets/save-memory.md
similarity index 100%
rename from src/skills/bmad-agent-builder/templates/save-memory.md
rename to src/skills/bmad-agent-builder/assets/save-memory.md
diff --git a/src/skills/bmad-agent-builder/bmad-manifest.json b/src/skills/bmad-agent-builder/bmad-manifest.json
index eaa12bd..d9a6ace 100644
--- a/src/skills/bmad-agent-builder/bmad-manifest.json
+++ b/src/skills/bmad-agent-builder/bmad-manifest.json
@@ -7,7 +7,7 @@
       "menu-code": "BP",
       "description": "Build, edit, or convert agents through six-phase conversational discovery. Covers new agents, format conversion, edits, and fixes.",
       "supports-headless": true,
-      "prompt": "prompts/build-process.md",
+      "prompt": "build-process.md",
       "phase-name": "anytime",
       "output-location": "{bmad_builder_output_folder}"
     },
@@ -16,7 +16,7 @@
       "menu-code": "QO",
       "description": "Comprehensive validation and optimization using lint scripts and LLM scanner subagents. Structure, prompt craft, efficiency, and more.",
       "supports-headless": true,
-      "prompt": "prompts/quality-optimizer.md",
+      "prompt": "quality-optimizer.md",
       "phase-name": "anytime",
       "output-location": "{bmad_builder_reports}"
     }
diff --git a/src/skills/bmad-agent-builder/prompts/build-process.md b/src/skills/bmad-agent-builder/build-process.md
similarity index 77%
rename from src/skills/bmad-agent-builder/prompts/build-process.md
rename to src/skills/bmad-agent-builder/build-process.md
index 07ad158..4eb52cf 100644
--- a/src/skills/bmad-agent-builder/prompts/build-process.md
+++ b/src/skills/bmad-agent-builder/build-process.md
@@ -65,7 +65,7 @@ Work through these conversationally:
   - **Checkpoint data** (save periodically): What can be batched and saved occasionally?
   - **Save triggers:** After which interactions should memory be updated?
 - **Capabilities:**
-  - **Internal prompts:** Capabilities the agent knows itself (each will get a prompt file in `prompts/`)
+  - **Internal prompts:** Capabilities the agent knows itself (each will get its own prompt file)
   - **External skills:** Skills the agent invokes (ask for **exact registered skill names** — e.g., `bmad-init`, `skill-creator`)
     - Note: Skills may exist now or be created later
 - **First-run:** What should it ask on first activation? (standalone only; module-based gets config from module's config.yaml)
@@ -87,7 +87,7 @@ Work through these conversationally:
 - **Path Conventions** (CRITICAL for reliable agent behavior):
   - **Memory location:** `{project-root}/_bmad/_memory/{skillName}-sidecar/`
   - **Project artifacts:** `{project-root}/_bmad/...` when referencing project-level files
-  - **Skill-internal files:** Use relative paths (`resources/`, `prompts/`, `scripts/`)
+  - **Skill-internal files:** Use relative paths (`references/`, `scripts/`)
   - **Config variables:** Use directly — they already contain full paths (NO `{project-root}` prefix)
     - Correct: `{output_folder}/file.md`
     - Wrong: `{project-root}/{output_folder}/file.md` (double-prefix breaks resolution)
@@ -100,19 +100,19 @@ Once you have a cohesive idea, think one level deeper. Once you have done this,
 ## Phase 5: Build
 
 **Always load these before building:**
-- Load `resources/standard-fields.md` — field definitions, description format, path rules
-- Load `resources/skill-best-practices.md` — authoring patterns (freedom levels, templates, anti-patterns)
-- Load `resources/quality-dimensions.md` — quick mental checklist for build quality
+- Load `references/standard-fields.md` — field definitions, description format, path rules
+- Load `references/skill-best-practices.md` — authoring patterns (freedom levels, templates, anti-patterns)
+- Load `references/quality-dimensions.md` — quick mental checklist for build quality
 
 **Load based on context:**
-- **If module-based:** Load `resources/metadata-reference.md` — manifest.json field definitions, module metadata structure, config loading requirements
-- **Always load** `resources/script-opportunities-reference.md` — script opportunity spotting guide, catalog, and output standards. Use this to identify additional script opportunities not caught in Phase 2, even if no scripts were initially planned.
+- **If module-based:** Load `references/metadata-reference.md` — manifest.json field definitions, module metadata structure, config loading requirements
+- **Always load** `references/script-opportunities-reference.md` — script opportunity spotting guide, catalog, and output standards. Use this to identify additional script opportunities not caught in Phase 2, even if no scripts were initially planned.
 
 When confirmed:
 
-1. Load template substitution rules from `resources/template-substitution-rules.md` and apply
+1. Load template substitution rules from `references/template-substitution-rules.md` and apply
 
-2. Create skill structure using templates from `templates/` folder:
+2. Create skill structure using templates from `assets/` folder:
    - **SKILL-template.md** — skill wrapper with full persona content embedded
    - **init-template.md** — first-run setup (if sidecar)
    - **memory-system.md** — memory (if sidecar, saved at root level)
@@ -132,7 +132,7 @@ When confirmed:
    python3 scripts/manifest.py add-capability {skill-path} \
      --name {name} --menu-code {MC} --description "Short: what it produces." \
      --supports-autonomous \
-     --prompt prompts/{name}.md      # internal capability
+     --prompt {name}.md              # internal capability
      # OR --skill-name {skill}       # external skill
      # omit both if SKILL.md handles it directly
 
@@ -150,22 +150,32 @@ When confirmed:
      --is-required
    ```
 
-4. **Folder structure** (no `assets/` folder — everything at root):
+4. **Folder structure:**
 ```
 {skill-name}/
-├── SKILL.md          # Contains full persona content (agent.md embedded)
-├── bmad-manifest.json # Capabilities, persona, memory, module integration
-├── resources/
-│   └── memory-system.md  # (if sidecar needed)
-├── scripts/          # python or shell scripts needed for the agent
-│   └── run-tests.sh  # uvx-powered test runner (if python tests exist)
-└── prompts/          # Internal capability prompts
-    ├── init.md              # First-run setup
-    ├── autonomous-wake.md   # Autonomous activation (if autonomous mode)
-    ├── save-memory.md       # Explicit memory save (if sidecar)
-    └── {name}.md            # Each internal capability prompt
+├── SKILL.md               # Contains full persona content (agent.md embedded)
+├── bmad-manifest.json     # Capabilities, persona, memory, module integration
+├── init.md                # First-run setup (if sidecar)
+├── autonomous-wake.md     # Autonomous activation (if autonomous mode)
+├── save-memory.md         # Explicit memory save (if sidecar)
+├── {name}.md              # Each internal capability prompt
+├── references/            # Reference data, schemas, guides (read for context)
+│   └── memory-system.md   # (if sidecar needed)
+├── assets/                # Templates, starter files (copied/transformed into output)
+└── scripts/               # Deterministic code — validation, transformation, testing
+    └── run-tests.sh       # uvx-powered test runner (if python tests exist)
 ```
 
+**What goes where:**
+| Location | Contains | LLM relationship |
+|----------|----------|-----------------|
+| **Root `.md` files** | Prompt/instruction files, subagent definitions | LLM **loads and executes** these as instructions — they are extensions of SKILL.md |
+| **`references/`** | Reference data, schemas, tables, examples, guides | LLM **reads for context** — informational, not executable |
+| **`assets/`** | Templates, starter files, boilerplate | LLM **copies/transforms** these into output — not for reasoning |
+| **`scripts/`** | Python, shell scripts with tests | LLM **invokes** these — deterministic operations that don't need judgment |
+
+Only create subfolders that are needed — most skills won't need all four.
+
 5. Output to `bmad_builder_output_folder` from config, or `{project-root}/bmad-builder-creations/`
 
 6. **Lint gate** — run deterministic validation scripts:
@@ -184,6 +194,6 @@ Present what was built: location, structure, first-run behavior, capabilities. A
 
 Ask: *"Build is done. Would you like to run a Quality Scan to optimize the agent further?"*
 
-If yes, load `prompts/quality-optimizer.md` with `{scan_mode}=full` and the agent path.
+If yes, load `quality-optimizer.md` with `{scan_mode}=full` and the agent path.
 
 Remind them: BMad module system compliant. Use `bmad-init` skill to integrate into a project.
diff --git a/src/skills/bmad-agent-builder/prompts/quality-optimizer.md b/src/skills/bmad-agent-builder/quality-optimizer.md
similarity index 79%
rename from src/skills/bmad-agent-builder/prompts/quality-optimizer.md
rename to src/skills/bmad-agent-builder/quality-optimizer.md
index 28a7cfe..2e22591 100644
--- a/src/skills/bmad-agent-builder/prompts/quality-optimizer.md
+++ b/src/skills/bmad-agent-builder/quality-optimizer.md
@@ -75,7 +75,7 @@ These run instantly, cost zero tokens, and produce structured JSON:
 
 | # | Script | Focus | Temp Filename |
 |---|--------|-------|---------------|
-| S1 | `scripts/scan-path-standards.py` | Path conventions: no {skill-root}, {project-root} only for _bmad, bare _bmad, memory paths, double-prefix | `path-standards-temp.json` |
+| S1 | `scripts/scan-path-standards.py` | Path conventions: {project-root} only for _bmad, bare _bmad, memory paths, double-prefix, absolute paths | `path-standards-temp.json` |
 | S2 | `scripts/scan-scripts.py` | Script portability, PEP 723, agentic design, unit tests | `scripts-temp.json` |
 
 ### Pre-Pass Scripts (Feed LLM Scanners)
@@ -92,12 +92,12 @@ These extract metrics for the LLM scanners so they work from compact data instea
 
 | # | Scanner | Focus | Pre-Pass? | Temp Filename |
 |---|---------|-------|-----------|---------------|
-| L1 | `agents/quality-scan-structure.md` | Structure, capabilities, identity, memory setup, consistency | Yes — receives prepass JSON | `structure-temp.json` |
-| L2 | `agents/quality-scan-prompt-craft.md` | Token efficiency, anti-patterns, outcome balance, persona voice, Overview quality | Yes — receives metrics JSON | `prompt-craft-temp.json` |
-| L3 | `agents/quality-scan-execution-efficiency.md` | Parallelization, subagent delegation, memory loading, context optimization | Yes — receives dep graph JSON | `execution-efficiency-temp.json` |
-| L4 | `agents/quality-scan-agent-cohesion.md` | Persona-capability alignment, gaps, redundancies, coherence | No | `agent-cohesion-temp.json` |
-| L5 | `agents/quality-scan-enhancement-opportunities.md` | Script automation, autonomous potential, edge cases, experience gaps, delight | No | `enhancement-opportunities-temp.json` |
-| L6 | `agents/quality-scan-script-opportunities.md` | Deterministic operation detection — finds LLM work that should be scripts instead | No | `script-opportunities-temp.json` |
+| L1 | `quality-scan-structure.md` | Structure, capabilities, identity, memory setup, consistency | Yes — receives prepass JSON | `structure-temp.json` |
+| L2 | `quality-scan-prompt-craft.md` | Token efficiency, anti-patterns, outcome balance, persona voice, Overview quality | Yes — receives metrics JSON | `prompt-craft-temp.json` |
+| L3 | `quality-scan-execution-efficiency.md` | Parallelization, subagent delegation, memory loading, context optimization | Yes — receives dep graph JSON | `execution-efficiency-temp.json` |
+| L4 | `quality-scan-agent-cohesion.md` | Persona-capability alignment, gaps, redundancies, coherence | No | `agent-cohesion-temp.json` |
+| L5 | `quality-scan-enhancement-opportunities.md` | Script automation, autonomous potential, edge cases, experience gaps, delight | No | `enhancement-opportunities-temp.json` |
+| L6 | `quality-scan-script-opportunities.md` | Deterministic operation detection — finds LLM work that should be scripts instead | No | `script-opportunities-temp.json` |
 
 ## Execution Instructions
 
@@ -125,7 +125,7 @@ After scripts complete, spawn applicable LLM scanners as parallel subagents.
 **For scanners WITHOUT pre-pass (L4, L5, L6):** provide just the skill path and output directory.
 
 Each subagent receives:
-- Scanner file to load (e.g., `agents/quality-scan-agent-cohesion.md`)
+- Scanner file to load (e.g., `quality-scan-agent-cohesion.md`)
 - Skill path to scan: `{skill-path}`
 - Output directory for results: `{quality-report-dir}`
 - Temp filename for output: `{temp-filename}`
@@ -151,13 +151,23 @@ After all scripts and scanners complete:
 3. Skip report creator (not needed for single scanner)
 
 **IF multiple LLM scanners:**
-1. Initiate a subagent with `agents/report-quality-scan-creator.md`
+1. Initiate a subagent with `report-quality-scan-creator.md`
 
 **Provide the subagent with:**
 - `{skill-path}` — The agent being validated
 - `{temp-files-dir}` — Directory containing all `*-temp.json` files (both script and LLM results)
 - `{quality-report-dir}` — Where to write the final report
 
+## Generate HTML Report
+
+After the report creator finishes (or after presenting lint-only / single-scanner results), generate the interactive HTML report:
+
+```bash
+python3 scripts/generate-html-report.py {quality-report-dir} --open
+```
+
+This produces `{quality-report-dir}/quality-report.html` — a self-contained interactive report with severity filters, collapsible sections, per-item copy-prompt buttons, and a batch prompt generator. The `--open` flag opens it in the default browser.
+
 ## Present Findings to User
 
 After receiving the JSON summary from the report creator:
@@ -169,6 +179,7 @@ After receiving the JSON summary from the report creator:
   "headless_mode": true,
   "scan_completed": true,
   "report_file": "{full-path-to-report}",
+  "html_report": "{full-path-to-html}",
   "warnings": ["any warnings from pre-scan checks"],
   "summary": {
     "total_issues": 0,
@@ -186,10 +197,10 @@ After receiving the JSON summary from the report creator:
 **IF `{headless_mode}=false` or not set:**
 1. **High-level summary** with total issues by severity
 2. **Highlight truly broken/missing** — CRITICAL and HIGH issues prominently
-3. **Mention detailed report** — "Full report saved to: {report_file}"
+3. **Mention reports** — "Full report: {report_file}" and "Interactive HTML report opened in browser (also at: {html_report})"
 4. **Offer next steps:**
    - Apply fixes directly
-   - Export checklist for manual fixes
+   - Use the HTML report to select specific items and generate prompts
    - Discuss specific findings
 
 ## Key Principle
diff --git a/src/skills/bmad-agent-builder/agents/quality-scan-agent-cohesion.md b/src/skills/bmad-agent-builder/quality-scan-agent-cohesion.md
similarity index 77%
rename from src/skills/bmad-agent-builder/agents/quality-scan-agent-cohesion.md
rename to src/skills/bmad-agent-builder/quality-scan-agent-cohesion.md
index 440ef71..66a8f17 100644
--- a/src/skills/bmad-agent-builder/agents/quality-scan-agent-cohesion.md
+++ b/src/skills/bmad-agent-builder/quality-scan-agent-cohesion.md
@@ -22,8 +22,8 @@ This is an **opinionated, advisory scan**. Findings are suggestions, not errors.
 Find and read:
 - `SKILL.md` — Identity, persona, principles, description
 - `bmad-manifest.json` — All capabilities with menu codes and descriptions
-- `prompts/*.md` — What each prompt actually does
-- `resources/dimension-definitions.md` — If exists, context for capability design
+- `*.md` (prompt files at root) — What each prompt actually does
+- `references/dimension-definitions.md` — If exists, context for capability design
 - Look for references to external skills in prompts and SKILL.md
 
 ## Cohesion Dimensions
@@ -143,6 +143,12 @@ Find and read:
 
 ## Output Format
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/agent-cohesion-temp.json`
@@ -151,69 +157,57 @@ Write JSON findings to: `{quality-report-dir}/agent-cohesion-temp.json`
 {
   "scanner": "agent-cohesion",
   "agent_path": "{path}",
-  "agent_identity": {
-    "name": "{skill-name}",
-    "persona_summary": "Brief characterization of who this agent is",
-    "primary_purpose": "What this agent is for",
-    "capability_count": 12
-  },
   "findings": [
     {
-      "file": "SKILL.md|bmad-manifest.json|prompts/{name}.md",
-      "severity": "high|medium|low|suggestion",
+      "file": "SKILL.md|bmad-manifest.json|{name}.md",
+      "severity": "high|medium|low|suggestion|strength",
       "category": "gap|redundancy|misalignment|opportunity|strength",
-      "issue": "Brief description",
-      "observation": "What you noticed that led to this finding",
-      "rationale": "Why this matters for cohesion",
-      "suggestion": "Specific improvement idea",
-      "impact": "What value this would add if addressed"
+      "title": "Brief description",
+      "detail": "What you noticed, why this matters for cohesion, and what value addressing it would add",
+      "action": "Specific improvement idea"
     }
   ],
-  "cohesion_analysis": {
-    "persona_alignment": {
-      "score": "strong|moderate|weak",
-      "notes": "Brief explanation of why persona fits or doesn't fit capabilities"
-    },
-    "capability_completeness": {
-      "score": "complete|mostly-complete|gaps-obvious",
-      "missing_areas": ["area1", "area2"],
-      "notes": "What's missing that should probably be there"
-    },
-    "redundancy_level": {
-      "score": "clean|some-overlap|significant-redundancy",
-      "consolidation_opportunities": [
-        {
-          "capabilities": ["cap-a", "cap-b", "cap-c"],
-          "suggested_consolidation": "How these could be combined"
-        }
-      ]
+  "assessments": {
+    "agent_identity": {
+      "name": "{skill-name}",
+      "persona_summary": "Brief characterization of who this agent is",
+      "primary_purpose": "What this agent is for",
+      "capability_count": 12
     },
-    "external_integration": {
-      "external_skills_referenced": 3,
-      "integration_pattern": "intentional|incidental|unclear",
-      "notes": "How external skills fit into the overall design"
-    },
-    "user_journey_score": {
-      "score": "complete-end-to-end|mostly-complete|fragmented",
-      "broken_workflows": ["workflow that can't be completed"],
-      "notes": "Can a user accomplish real work with this agent?"
+    "cohesion_analysis": {
+      "persona_alignment": {
+        "score": "strong|moderate|weak",
+        "notes": "Brief explanation of why persona fits or doesn't fit capabilities"
+      },
+      "capability_completeness": {
+        "score": "complete|mostly-complete|gaps-obvious",
+        "missing_areas": ["area1", "area2"],
+        "notes": "What's missing that should probably be there"
+      },
+      "redundancy_level": {
+        "score": "clean|some-overlap|significant-redundancy",
+        "consolidation_opportunities": [
+          {
+            "capabilities": ["cap-a", "cap-b", "cap-c"],
+            "suggested_consolidation": "How these could be combined"
+          }
+        ]
+      },
+      "external_integration": {
+        "external_skills_referenced": 3,
+        "integration_pattern": "intentional|incidental|unclear",
+        "notes": "How external skills fit into the overall design"
+      },
+      "user_journey_score": {
+        "score": "complete-end-to-end|mostly-complete|fragmented",
+        "broken_workflows": ["workflow that can't be completed"],
+        "notes": "Can a user accomplish real work with this agent?"
+      }
     }
   },
-  "creative_suggestions": [
-    {
-      "type": "new-capability|consolidation|refinement|persona-shift",
-      "idea": "Brief creative suggestion for improvement",
-      "rationale": "Why this would strengthen the agent",
-      "estimated_impact": "high|medium|low"
-    }
-  ],
-  "strengths": [
-    "Something this agent does really well - positive feedback is useful!",
-    "Another strength..."
-  ],
   "summary": {
     "total_findings": 0,
-    "by_severity": {"high": 0, "medium": 0, "low": 0, "suggestion": 0},
+    "by_severity": {"high": 0, "medium": 0, "low": 0, "suggestion": 0, "strength": 0},
     "by_category": {"gap": 0, "redundancy": 0, "misalignment": 0, "opportunity": 0, "strength": 0},
     "overall_cohesion": "cohesive|mostly-cohesive|fragmented|confused",
     "single_most_important_fix": "The ONE thing that would most improve this agent"
@@ -221,6 +215,11 @@ Write JSON findings to: `{quality-report-dir}/agent-cohesion-temp.json`
 }
 ```
 
+Merge all findings into the single `findings[]` array:
+- Former `findings[]` items: map `issue` to `title`, merge `observation`+`rationale`+`impact` into `detail`, map `suggestion` to `action`
+- Former `strengths[]` items: use `severity: "strength"`, `category: "strength"`
+- Former `creative_suggestions[]` items: use `severity: "suggestion"`, map `idea` to `title`, `rationale` to `detail`, merge `type` and `estimated_impact` context into `detail`, map actionable recommendation to `action`
+
 ## Severity Guidelines
 
 | Severity | When to Use |
diff --git a/src/skills/bmad-agent-builder/agents/quality-scan-enhancement-opportunities.md b/src/skills/bmad-agent-builder/quality-scan-enhancement-opportunities.md
similarity index 87%
rename from src/skills/bmad-agent-builder/agents/quality-scan-enhancement-opportunities.md
rename to src/skills/bmad-agent-builder/quality-scan-enhancement-opportunities.md
index a9e179b..df2b565 100644
--- a/src/skills/bmad-agent-builder/agents/quality-scan-enhancement-opportunities.md
+++ b/src/skills/bmad-agent-builder/quality-scan-enhancement-opportunities.md
@@ -27,9 +27,9 @@ You are NOT checking structure, craft quality, performance, or test coverage —
 
 Find and read:
 - `SKILL.md` — Understand the agent's purpose, persona, audience, and flow
-- `prompts/*.md` — Walk through each capability as a user would experience it
-- `resources/*.md` — Understand what supporting material exists
-- `resources/*.json` — See what supporting schemas exist
+- `*.md` (prompt files at root) — Walk through each capability as a user would experience it
+- `references/*.md` — Understand what supporting material exists
+- `references/*.json` — See what supporting schemas exist
 
 ## Creative Analysis Lenses
 
@@ -165,6 +165,12 @@ For each journey, note:
 
 ## Output Format
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/enhancement-opportunities-temp.json`
@@ -173,46 +179,47 @@ Write JSON findings to: `{quality-report-dir}/enhancement-opportunities-temp.jso
 {
   "scanner": "enhancement-opportunities",
   "skill_path": "{path}",
-  "skill_understanding": {
-    "purpose": "What this agent is trying to do",
-    "primary_user": "Who this agent is for",
-    "key_assumptions": ["assumption 1", "assumption 2"]
-  },
   "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md",
+      "file": "SKILL.md|{name}.md",
       "severity": "high-opportunity|medium-opportunity|low-opportunity",
       "category": "edge-case|experience-gap|delight-opportunity|assumption-risk|journey-friction|autonomous-potential|facilitative-pattern",
-      "scenario": "The specific situation or user story that reveals this opportunity",
-      "insight": "What you noticed and why it matters",
-      "suggestion": "Concrete, actionable improvement — the tempered version of the wild idea",
-      "user_impact": "How this would change the user's experience"
-    }
-  ],
-  "user_journeys": [
-    {
-      "archetype": "first-timer|expert|confused|edge-case|hostile-environment|automator",
-      "journey_summary": "Brief narrative of this user's experience with the agent",
-      "friction_points": ["moment 1", "moment 2"],
-      "bright_spots": ["what works well for this user"]
+      "title": "The specific situation or user story that reveals this opportunity",
+      "detail": "What you noticed, why it matters, and how this would change the user's experience",
+      "action": "Concrete, actionable improvement — the tempered version of the wild idea"
     }
   ],
-  "autonomous_assessment": {
-    "overall_potential": "headless-ready|easily-adaptable|partially-adaptable|fundamentally-interactive",
-    "hitl_interaction_points": 0,
-    "auto_resolvable": 0,
-    "needs_input": 0,
-    "suggested_output_contract": "What a headless invocation would return",
-    "required_inputs": ["parameters needed upfront for headless mode"],
-    "notes": "Brief assessment of autonomous viability"
+  "assessments": {
+    "skill_understanding": {
+      "purpose": "What this agent is trying to do",
+      "primary_user": "Who this agent is for",
+      "key_assumptions": ["assumption 1", "assumption 2"]
+    },
+    "user_journeys": [
+      {
+        "archetype": "first-timer|expert|confused|edge-case|hostile-environment|automator",
+        "summary": "Brief narrative of this user's experience with the agent",
+        "friction_points": ["moment 1", "moment 2"],
+        "bright_spots": ["what works well for this user"]
+      }
+    ],
+    "autonomous_assessment": {
+      "potential": "headless-ready|easily-adaptable|partially-adaptable|fundamentally-interactive",
+      "hitl_points": 0,
+      "auto_resolvable": 0,
+      "needs_input": 0,
+      "suggested_output_contract": "What a headless invocation would return",
+      "required_inputs": ["parameters needed upfront for headless mode"],
+      "notes": "Brief assessment of autonomous viability"
+    },
+    "top_insights": [
+      {
+        "title": "The single most impactful creative observation",
+        "detail": "The user experience impact",
+        "action": "What to do about it"
+      }
+    ]
   },
-  "top_insights": [
-    {
-      "insight": "The single most impactful creative observation",
-      "suggestion": "What to do about it",
-      "why_it_matters": "The user experience impact"
-    }
-  ],
   "summary": {
     "total_findings": 0,
     "by_severity": {"high-opportunity": 0, "medium-opportunity": 0, "low-opportunity": 0},
@@ -225,8 +232,7 @@ Write JSON findings to: `{quality-report-dir}/enhancement-opportunities-temp.jso
       "autonomous_potential": 0,
       "facilitative_pattern": 0
     },
-    "boldest_idea": "The wildest suggestion that's still practical — the one that could transform this agent",
-    "overall_experience_assessment": "Brief creative assessment of the agent's user experience"
+    "assessment": "Brief creative assessment of the agent's user experience, including the boldest practical idea"
   }
 }
 ```
diff --git a/src/skills/bmad-agent-builder/agents/quality-scan-execution-efficiency.md b/src/skills/bmad-agent-builder/quality-scan-execution-efficiency.md
similarity index 83%
rename from src/skills/bmad-agent-builder/agents/quality-scan-execution-efficiency.md
rename to src/skills/bmad-agent-builder/quality-scan-execution-efficiency.md
index ba3e52e..a5b2201 100644
--- a/src/skills/bmad-agent-builder/agents/quality-scan-execution-efficiency.md
+++ b/src/skills/bmad-agent-builder/quality-scan-execution-efficiency.md
@@ -18,8 +18,8 @@ Pre-pass provides: dependency graph, sequential patterns, loop patterns, subagen
 
 Read raw files for judgment calls:
 - `SKILL.md` — On Activation patterns, operation flow
-- `prompts/*.md` — Each prompt for execution patterns
-- `resources/*.md` — Resource loading patterns
+- `*.md` (prompt files at root) — Each prompt for execution patterns
+- `references/*.md` — Resource loading patterns
 
 ---
 
@@ -120,6 +120,12 @@ GOOD: Selective loading
 
 ## Output Format
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/execution-efficiency-temp.json`
@@ -128,34 +134,29 @@ Write JSON findings to: `{quality-report-dir}/execution-efficiency-temp.json`
 {
   "scanner": "execution-efficiency",
   "skill_path": "{path}",
-  "issues": [
+  "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md",
+      "file": "SKILL.md|{name}.md",
       "line": 42,
-      "severity": "critical|high|medium|low",
-      "category": "sequential-independent|parent-reads-first|missing-batch|no-output-spec|subagent-chain-violation|memory-loading|resource-loading|missing-delegation",
-      "issue": "Brief description",
-      "current_pattern": "What it does now",
-      "efficient_alternative": "What it should do instead",
-      "estimated_savings": "Time/token savings estimate"
-    }
-  ],
-  "opportunities": [
-    {
-      "type": "parallelization|batching|delegation|memory-optimization|resource-optimization",
-      "description": "What could be improved",
-      "recommendation": "Specific improvement",
-      "estimated_savings": "Estimated improvement"
+      "severity": "critical|high|medium|low|medium-opportunity",
+      "category": "sequential-independent|parent-reads-first|missing-batch|no-output-spec|subagent-chain-violation|memory-loading|resource-loading|missing-delegation|parallelization|batching|delegation|memory-optimization|resource-optimization",
+      "title": "Brief description",
+      "detail": "What it does now, and estimated time/token savings",
+      "action": "What it should do instead"
     }
   ],
   "summary": {
-    "total_issues": 0,
+    "total_findings": 0,
     "by_severity": {"critical": 0, "high": 0, "medium": 0, "low": 0},
     "by_category": {}
   }
 }
 ```
 
+Merge all items into the single `findings[]` array:
+- Former `issues[]` items: map `issue` to `title`, merge `current_pattern`+`estimated_savings` into `detail`, map `efficient_alternative` to `action`
+- Former `opportunities[]` items: map `description` to `title`, merge details into `detail`, map `recommendation` to `action`, use severity like `medium-opportunity`
+
 ## Process
 
 1. Read pre-pass JSON at `{quality-report-dir}/execution-deps-prepass.json`
diff --git a/src/skills/bmad-agent-builder/agents/quality-scan-prompt-craft.md b/src/skills/bmad-agent-builder/quality-scan-prompt-craft.md
similarity index 85%
rename from src/skills/bmad-agent-builder/agents/quality-scan-prompt-craft.md
rename to src/skills/bmad-agent-builder/quality-scan-prompt-craft.md
index 1e9aa45..ee41330 100644
--- a/src/skills/bmad-agent-builder/agents/quality-scan-prompt-craft.md
+++ b/src/skills/bmad-agent-builder/quality-scan-prompt-craft.md
@@ -20,8 +20,8 @@ Pre-pass provides: line counts, token estimates, section inventories, waste patt
 
 Read raw files for judgment calls:
 - `SKILL.md` — Overview quality, persona context assessment
-- `prompts/*.md` — Each capability prompt for craft quality
-- `resources/*.md` — Progressive disclosure assessment
+- `*.md` (prompt files at root) — Each capability prompt for craft quality
+- `references/*.md` — Progressive disclosure assessment
 
 ---
 
@@ -54,9 +54,9 @@ A good agent Overview includes:
 
 | Scenario | Acceptable Size | Notes |
 |----------|----------------|-------|
-| Multi-capability agent with brief capability sections | Up to ~250 lines | Each capability section brief, detail in prompts/ |
+| Multi-capability agent with brief capability sections | Up to ~250 lines | Each capability section brief, detail in prompt files |
 | Single-purpose agent with deep persona | Up to ~500 lines (~5000 tokens) | Acceptable if content is genuinely needed |
-| Agent with large reference tables or schemas inline | Flag for extraction | These belong in resources/, not SKILL.md |
+| Agent with large reference tables or schemas inline | Flag for extraction | These belong in references/, not SKILL.md |
 
 ### Detecting Over-Optimization (Under-Contextualized Agents)
 
@@ -72,7 +72,7 @@ A good agent Overview includes:
 
 ## Part 2: Capability Prompt Craft
 
-Capability prompts (`prompts/*.md`) are the working instructions for each capability. These should be more procedural than SKILL.md but maintain persona voice consistency.
+Capability prompts (prompt `.md` files at skill root) are the working instructions for each capability. These should be more procedural than SKILL.md but maintain persona voice consistency.
 
 ### Config Header
 | Check | Why It Matters |
@@ -165,6 +165,12 @@ Do NOT flag these:
 
 ## Output Format
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/prompt-craft-temp.json`
@@ -173,36 +179,37 @@ Write JSON findings to: `{quality-report-dir}/prompt-craft-temp.json`
 {
   "scanner": "prompt-craft",
   "skill_path": "{path}",
-  "skill_type_assessment": "simple-utility|domain-expert|companion-interactive|workflow-facilitator",
-  "skillmd_assessment": {
-    "overview_quality": "appropriate|excessive|missing|disconnected",
-    "progressive_disclosure": "good|needs-extraction|monolithic",
-    "persona_context": "appropriate|excessive|missing",
-    "notes": "Brief assessment of SKILL.md craft"
-  },
-  "prompts_scanned": 0,
-  "issues": [
+  "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md",
+      "file": "SKILL.md|{name}.md",
       "line": 42,
       "severity": "critical|high|medium|low|note",
       "category": "token-waste|anti-pattern|outcome-balance|progression|self-containment|intelligence-placement|overview-quality|progressive-disclosure|under-contextualized|persona-voice|communication-consistency|inline-data",
-      "issue": "Brief description",
-      "rationale": "Why this matters for prompt craft",
-      "fix": "Specific action to resolve",
-      "nuance": "Optional — why this might be intentional"
+      "title": "Brief description",
+      "detail": "Why this matters for prompt craft. Include any nuance about why this might be intentional.",
+      "action": "Specific action to resolve"
     }
   ],
-  "prompt_health": {
-    "prompts_with_config_header": 0,
-    "prompts_with_progression_conditions": 0,
-    "prompts_self_contained": 0,
-    "total_prompts": 0
+  "assessments": {
+    "skill_type_assessment": "simple-utility|domain-expert|companion-interactive|workflow-facilitator",
+    "skillmd_assessment": {
+      "overview_quality": "appropriate|excessive|missing|disconnected",
+      "progressive_disclosure": "good|needs-extraction|monolithic",
+      "persona_context": "appropriate|excessive|missing",
+      "notes": "Brief assessment of SKILL.md craft"
+    },
+    "prompts_scanned": 0,
+    "prompt_health": {
+      "prompts_with_config_header": 0,
+      "prompts_with_progression_conditions": 0,
+      "prompts_self_contained": 0,
+      "total_prompts": 0
+    }
   },
   "summary": {
-    "total_issues": 0,
+    "total_findings": 0,
     "by_severity": {"critical": 0, "high": 0, "medium": 0, "low": 0, "note": 0},
-    "craft_assessment": "Brief 1-2 sentence assessment",
+    "assessment": "Brief 1-2 sentence assessment",
     "top_improvement": "Highest-impact improvement"
   }
 }
@@ -212,8 +219,8 @@ Write JSON findings to: `{quality-report-dir}/prompt-craft-temp.json`
 
 1. Read pre-pass JSON at `{quality-report-dir}/prompt-metrics-prepass.json`
 2. Read SKILL.md — assess agent type, evaluate Overview quality, persona context
-3. Read all prompt files in prompts/
-4. Check resources/ for progressive disclosure
+3. Read all prompt files at skill root
+4. Check references/ for progressive disclosure
 5. Evaluate Overview quality (present? appropriate? excessive? missing?)
 6. Check for over-optimization — is this a complex agent stripped to bare skeleton?
 7. Check size and progressive disclosure
diff --git a/src/skills/bmad-agent-builder/agents/quality-scan-script-opportunities.md b/src/skills/bmad-agent-builder/quality-scan-script-opportunities.md
similarity index 89%
rename from src/skills/bmad-agent-builder/agents/quality-scan-script-opportunities.md
rename to src/skills/bmad-agent-builder/quality-scan-script-opportunities.md
index 401c5d8..9e5de21 100644
--- a/src/skills/bmad-agent-builder/agents/quality-scan-script-opportunities.md
+++ b/src/skills/bmad-agent-builder/quality-scan-script-opportunities.md
@@ -16,8 +16,8 @@ Read every prompt file and SKILL.md. For each instruction that tells the LLM to
 
 Find and read:
 - `SKILL.md` — On Activation patterns, inline operations
-- `prompts/*.md` — Each capability prompt for deterministic operations hiding in LLM instructions
-- `resources/*.md` — Check if any resource content could be generated by scripts instead
+- `*.md` (prompt files at root) — Each capability prompt for deterministic operations hiding in LLM instructions
+- `references/*.md` — Check if any resource content could be generated by scripts instead
 - `scripts/` — Understand what scripts already exist (to avoid suggesting duplicates)
 
 ---
@@ -188,6 +188,12 @@ For each script opportunity found, also assess:
 
 ## Output Format
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/script-opportunities-temp.json`
@@ -196,32 +202,25 @@ Write JSON findings to: `{quality-report-dir}/script-opportunities-temp.json`
 {
   "scanner": "script-opportunities",
   "skill_path": "{path}",
-  "existing_scripts": ["list of scripts that already exist in the agent's scripts/ folder"],
   "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md",
+      "file": "SKILL.md|{name}.md",
       "line": 42,
       "severity": "high|medium|low",
       "category": "validation|extraction|transformation|counting|comparison|structure|graph|preprocessing|postprocessing",
-      "current_behavior": "What the LLM is currently doing",
-      "script_alternative": "What a script would do instead",
-      "determinism_confidence": "certain|high|moderate",
-      "estimated_token_savings": "tokens saved per invocation",
-      "implementation_complexity": "trivial|moderate|complex",
-      "language": "python|bash|either",
-      "could_be_prepass": false,
-      "feeds_scanner": "scanner name if applicable",
-      "reusable_across_skills": false,
-      "help_pattern_savings": "additional prompt tokens saved by using --help instead of inlining interface"
+      "title": "What the LLM is currently doing",
+      "detail": "Determinism confidence: certain|high|moderate. Estimated token savings: N per invocation. Implementation complexity: trivial|moderate|complex. Language: python|bash|either. Could be prepass: yes/no. Feeds scanner: name if applicable. Reusable across skills: yes/no. Help pattern savings: additional prompt tokens saved by using --help instead of inlining interface.",
+      "action": "What a script would do instead"
     }
   ],
+  "assessments": {
+    "existing_scripts": ["list of scripts that already exist in the agent's scripts/ folder"]
+  },
   "summary": {
     "total_findings": 0,
     "by_severity": {"high": 0, "medium": 0, "low": 0},
     "by_category": {},
-    "total_estimated_token_savings": "aggregate estimate across all findings",
-    "highest_value_opportunity": "The single biggest win — describe it",
-    "prepass_opportunities": "How many findings could become pre-pass scripts for LLM scanners"
+    "assessment": "Brief assessment including total estimated token savings, the single highest-value opportunity, and how many findings could become pre-pass scripts for LLM scanners"
   }
 }
 ```
diff --git a/src/skills/bmad-agent-builder/agents/quality-scan-structure.md b/src/skills/bmad-agent-builder/quality-scan-structure.md
similarity index 92%
rename from src/skills/bmad-agent-builder/agents/quality-scan-structure.md
rename to src/skills/bmad-agent-builder/quality-scan-structure.md
index 24fdc1f..e7bceb2 100644
--- a/src/skills/bmad-agent-builder/agents/quality-scan-structure.md
+++ b/src/skills/bmad-agent-builder/quality-scan-structure.md
@@ -116,6 +116,12 @@ Include all pre-pass findings in your output, preserved as-is. These are determi
 
 ## Output Format
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/structure-temp.json`
@@ -124,17 +130,18 @@ Write JSON findings to: `{quality-report-dir}/structure-temp.json`
 {
   "scanner": "structure",
   "skill_path": "{path}",
-  "issues": [
+  "findings": [
     {
-      "file": "SKILL.md|bmad-manifest.json|prompts/{name}.md",
+      "file": "SKILL.md|bmad-manifest.json|{name}.md",
       "line": 42,
       "severity": "critical|high|medium|low",
       "category": "frontmatter|sections|artifacts|manifest|capabilities|identity|communication-style|principles|consistency|memory-setup|headless-mode|activation-sequence",
-      "issue": "Brief description",
-      "fix": "Specific action to resolve"
+      "title": "Brief description",
+      "detail": "",
+      "action": "Specific action to resolve"
     }
   ],
-  "metadata": {
+  "assessments": {
     "sections_found": ["Overview", "Identity"],
     "capabilities_count": 0,
     "has_memory": false,
@@ -142,10 +149,10 @@ Write JSON findings to: `{quality-report-dir}/structure-temp.json`
     "manifest_valid": true
   },
   "summary": {
-    "total_issues": 0,
+    "total_findings": 0,
     "by_severity": {"critical": 0, "high": 0, "medium": 0, "low": 0},
     "by_category": {},
-    "structure_assessment": "Brief 1-2 sentence assessment"
+    "assessment": "Brief 1-2 sentence assessment"
   }
 }
 ```
diff --git a/src/skills/bmad-agent-builder/resources/metadata-reference.md b/src/skills/bmad-agent-builder/references/metadata-reference.md
similarity index 91%
rename from src/skills/bmad-agent-builder/resources/metadata-reference.md
rename to src/skills/bmad-agent-builder/references/metadata-reference.md
index 73ba3df..4a0b7e7 100644
--- a/src/skills/bmad-agent-builder/resources/metadata-reference.md
+++ b/src/skills/bmad-agent-builder/references/metadata-reference.md
@@ -36,7 +36,7 @@ description: [5-8 word summary]. [Use when user says 'X' or 'Y'.]
       "menu-code": "BP",
       "description": "Builds agents through conversational discovery. Outputs to skill folder.",
       "supports-headless": true,
-      "prompt": "prompts/build-process.md",
+      "prompt": "build-process.md",
       "phase-name": "anytime",
       "after": ["create-prd"],
       "before": [],
@@ -103,7 +103,7 @@ All module skills MUST use the `bmad-init` skill at startup.
 
 ## Path Construction Rules — CRITICAL
 
-Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths.
+Only use `{project-root}` for `_bmad` paths.
 
 **Three path types:**
 - **Skill-internal** — bare relative paths (no prefix)
@@ -112,15 +112,15 @@ Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths.
 
 **Correct:**
 ```
-resources/reference.md                # Skill-internal (bare relative)
-prompts/capability.md                 # Skill-internal (bare relative)
+references/reference.md                # Skill-internal (bare relative)
+capability.md                         # Skill-internal (bare relative)
 {project-root}/_bmad/_memory/x-sidecar/  # Project _bmad path
 {output_folder}/report.md            # Config var (already has full path)
 ```
 
 **Never use:**
 ```
-{skill-root}/resources/reference.md   # {skill-root} doesn't exist
+../../other-skill/file.md              # Cross-skill relative path breaks with reorganization
 {project-root}/{config_var}/output.md # Double-prefix
-./resources/reference.md              # Relative prefix breaks context changes
+./references/reference.md              # Relative prefix breaks context changes
 ```
diff --git a/src/skills/bmad-agent-builder/resources/quality-dimensions.md b/src/skills/bmad-agent-builder/references/quality-dimensions.md
similarity index 79%
rename from src/skills/bmad-agent-builder/resources/quality-dimensions.md
rename to src/skills/bmad-agent-builder/references/quality-dimensions.md
index f79b595..064d17c 100644
--- a/src/skills/bmad-agent-builder/resources/quality-dimensions.md
+++ b/src/skills/bmad-agent-builder/references/quality-dimensions.md
@@ -22,9 +22,10 @@ Scripts handle plumbing (fetch, transform, validate). Prompts handle judgment (i
 
 SKILL.md stays focused. Detail goes where it belongs.
 
-- Capability instructions → `prompts/`
-- Reference data, schemas, large tables → `resources/`
-- Memory discipline → `resources/memory-system.md`
+- Capability instructions → prompt files at skill root
+- Reference data, schemas, large tables → `references/`
+- Templates, starter files → `assets/`
+- Memory discipline → `references/memory-system.md`
 - Multi-capability SKILL.md under ~250 lines: fine as-is
 - Single-purpose up to ~500 lines: acceptable if focused
 
@@ -32,13 +33,13 @@ SKILL.md stays focused. Detail goes where it belongs.
 
 Two parts: `[5-8 word summary]. [Use when user says 'X' or 'Y'.]`
 
-Default to conservative triggering. See `resources/standard-fields.md` for full format and examples.
+Default to conservative triggering. See `references/standard-fields.md` for full format and examples.
 
 ## 5. Path Construction
 
-Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths. Config variables used directly — they already contain `{project-root}`.
+Only use `{project-root}` for `_bmad` paths. Config variables used directly — they already contain `{project-root}`.
 
-See `resources/standard-fields.md` for correct/incorrect patterns.
+See `references/standard-fields.md` for correct/incorrect patterns.
 
 ## 6. Token Efficiency
 
diff --git a/src/skills/bmad-agent-builder/resources/script-opportunities-reference.md b/src/skills/bmad-agent-builder/references/script-opportunities-reference.md
similarity index 97%
rename from src/skills/bmad-agent-builder/resources/script-opportunities-reference.md
rename to src/skills/bmad-agent-builder/references/script-opportunities-reference.md
index d890f95..fecbed0 100644
--- a/src/skills/bmad-agent-builder/resources/script-opportunities-reference.md
+++ b/src/skills/bmad-agent-builder/references/script-opportunities-reference.md
@@ -1,6 +1,6 @@
 # Quality Scan Script Opportunities — Reference Guide
 
-**Reference: `resources/script-standards.md` for script creation guidelines.**
+**Reference: `references/script-standards.md` for script creation guidelines.**
 
 This document identifies deterministic operations that should be offloaded from the LLM into scripts for quality validation of BMad agents.
 
@@ -119,7 +119,7 @@ All scripts use PEP 723 and `--help`. When a skill's prompt needs to invoke a sc
 - ## Read Access section exists
 - ## Write Access section exists
 - ## Deny Zones section exists (can be empty)
-- Paths use placeholders correctly ({project-root} for _bmad paths, relative for skill-internal, no {skill-root})
+- Paths use placeholders correctly ({project-root} for _bmad paths, relative for skill-internal)
 ```
 
 **Output:** Structured JSON of read/write/deny zones
@@ -136,7 +136,7 @@ All scripts use PEP 723 and `--help`. When a skill's prompt needs to invoke a sc
 
 **Checks:**
 ```python
-# For each prompt in prompts/:
+# For each prompt .md file at skill root:
 - Has frontmatter (name, description, menu-code)
 - name matches manifest capability name
 - menu-code matches manifest (case-insensitive)
@@ -145,7 +145,7 @@ All scripts use PEP 723 and `--help`. When a skill's prompt needs to invoke a sc
 
 **Output:** JSON with mismatches, missing files
 
-**Implementation:** Python, reads bmad-manifest.json and all .md files in prompts/
+**Implementation:** Python, reads bmad-manifest.json and all prompt .md files at skill root
 
 ---
 
diff --git a/src/skills/bmad-agent-builder/resources/skill-best-practices.md b/src/skills/bmad-agent-builder/references/skill-best-practices.md
similarity index 98%
rename from src/skills/bmad-agent-builder/resources/skill-best-practices.md
rename to src/skills/bmad-agent-builder/references/skill-best-practices.md
index 432a502..67cdeb3 100644
--- a/src/skills/bmad-agent-builder/resources/skill-best-practices.md
+++ b/src/skills/bmad-agent-builder/references/skill-best-practices.md
@@ -1,6 +1,6 @@
 # Skill Authoring Best Practices
 
-Practical patterns for writing effective BMad agent skills. For field definitions and description format, see `resources/standard-fields.md`. For quality dimensions, see `resources/quality-dimensions.md`.
+Practical patterns for writing effective BMad agent skills. For field definitions and description format, see `references/standard-fields.md`. For quality dimensions, see `references/quality-dimensions.md`.
 
 ## Core Principle: Informed Autonomy
 
diff --git a/src/skills/bmad-agent-builder/resources/standard-fields.md b/src/skills/bmad-agent-builder/references/standard-fields.md
similarity index 100%
rename from src/skills/bmad-agent-builder/resources/standard-fields.md
rename to src/skills/bmad-agent-builder/references/standard-fields.md
diff --git a/src/skills/bmad-agent-builder/resources/template-substitution-rules.md b/src/skills/bmad-agent-builder/references/template-substitution-rules.md
similarity index 94%
rename from src/skills/bmad-agent-builder/resources/template-substitution-rules.md
rename to src/skills/bmad-agent-builder/references/template-substitution-rules.md
index b0e4a87..b3bce15 100644
--- a/src/skills/bmad-agent-builder/resources/template-substitution-rules.md
+++ b/src/skills/bmad-agent-builder/references/template-substitution-rules.md
@@ -43,9 +43,9 @@ Add user's additional questions to the init.md template, replacing `{custom-init
 ## Path References
 
 All generated agents use these paths:
-- `prompts/init.md` — First-run setup
-- `prompts/{name}.md` — Individual capability prompts
-- `resources/memory-system.md` — Memory discipline (if sidecar needed)
+- `init.md` — First-run setup
+- `{name}.md` — Individual capability prompts
+- `references/memory-system.md` — Memory discipline (if sidecar needed)
 - `bmad-manifest.json` — Capabilities and metadata with menu codes
 - `scripts/` — Python/shell scripts for deterministic operations (if needed)
 
diff --git a/src/skills/bmad-agent-builder/references/universal-scan-schema.md b/src/skills/bmad-agent-builder/references/universal-scan-schema.md
new file mode 100644
index 0000000..11e6df8
--- /dev/null
+++ b/src/skills/bmad-agent-builder/references/universal-scan-schema.md
@@ -0,0 +1,267 @@
+# Universal Scanner Output Schema
+
+All quality scanners — both LLM-based and deterministic lint scripts — MUST produce output conforming to this schema. No exceptions.
+
+## Top-Level Structure
+
+```json
+{
+  "scanner": "scanner-name",
+  "skill_path": "{path}",
+  "findings": [],
+  "assessments": {},
+  "summary": {
+    "total_findings": 0,
+    "by_severity": {},
+    "assessment": "1-2 sentence overall assessment"
+  }
+}
+```
+
+| Key | Type | Required | Description |
+|-----|------|----------|-------------|
+| `scanner` | string | yes | Scanner identifier (e.g., `"workflow-integrity"`, `"prompt-craft"`) |
+| `skill_path` | string | yes | Absolute path to the skill being scanned |
+| `findings` | array | yes | ALL items — issues, strengths, suggestions, opportunities. Always an array, never an object |
+| `assessments` | object | yes | Scanner-specific structured analysis (cohesion tables, health metrics, user journeys, etc.). Free-form per scanner |
+| `summary` | object | yes | Aggregate counts and brief overall assessment |
+
+## Finding Schema (7 fields)
+
+Every item in `findings[]` has exactly these 7 fields:
+
+```json
+{
+  "file": "SKILL.md",
+  "line": 42,
+  "severity": "high",
+  "category": "frontmatter",
+  "title": "Brief headline of the finding",
+  "detail": "Full context — rationale, what was observed, why it matters",
+  "action": "What to do about it — fix, suggestion, or script to create"
+}
+```
+
+| Field | Type | Required | Description |
+|-------|------|----------|-------------|
+| `file` | string | yes | Relative path to the affected file (e.g., `"SKILL.md"`, `"scripts/build.py"`). Empty string if not file-specific |
+| `line` | int\|null | no | Line number (1-based). `null` or `0` if not line-specific |
+| `severity` | string | yes | One of the severity values below |
+| `category` | string | yes | Scanner-specific category (e.g., `"frontmatter"`, `"token-waste"`, `"lint"`) |
+| `title` | string | yes | Brief headline (1 sentence). This is the primary display text |
+| `detail` | string | yes | Full context — fold rationale, observation, impact, nuance into one narrative. Empty string if title is self-explanatory |
+| `action` | string | yes | What to do — fix instruction, suggestion, or script to create. Empty string for strengths/notes |
+
+## Severity Values (complete enum)
+
+```
+critical | high | medium | low | high-opportunity | medium-opportunity | low-opportunity | suggestion | strength | note
+```
+
+**Routing rules:**
+- `critical`, `high` → "Truly Broken" section in report
+- `medium`, `low` → category-specific findings sections
+- `high-opportunity`, `medium-opportunity`, `low-opportunity` → enhancement/creative sections
+- `suggestion` → creative suggestions section
+- `strength` → strengths section (positive observations worth preserving)
+- `note` → informational observations, also routed to strengths
+
+## Assessment Sub-Structure Contracts
+
+The `assessments` object is free-form per scanner, but the HTML report renderer expects specific shapes for specific keys. These are the canonical formats.
+
+### user_journeys (enhancement-opportunities scanner)
+
+**Always an array of objects. Never an object keyed by persona.**
+
+```json
+"user_journeys": [
+  {
+    "archetype": "first-timer",
+    "summary": "Brief narrative of this user's experience",
+    "friction_points": ["moment 1", "moment 2"],
+    "bright_spots": ["what works well"]
+  }
+]
+```
+
+### autonomous_assessment (enhancement-opportunities scanner)
+
+```json
+"autonomous_assessment": {
+  "potential": "headless-ready|easily-adaptable|partially-adaptable|fundamentally-interactive",
+  "hitl_points": 3,
+  "auto_resolvable": 2,
+  "needs_input": 1,
+  "notes": "Brief assessment"
+}
+```
+
+### top_insights (enhancement-opportunities scanner)
+
+**Always an array of objects with title/detail/action (same shape as findings but without file/line/severity/category).**
+
+```json
+"top_insights": [
+  {
+    "title": "The key observation",
+    "detail": "Why it matters",
+    "action": "What to do about it"
+  }
+]
+```
+
+### cohesion_analysis (skill-cohesion / agent-cohesion scanner)
+
+```json
+"cohesion_analysis": {
+  "dimension_name": { "score": "strong|moderate|weak", "notes": "explanation" }
+}
+```
+
+Dimension names are scanner-specific (e.g., `stage_flow_coherence`, `persona_alignment`). The report renderer iterates all keys and renders a table row per dimension.
+
+### skill_identity / agent_identity (cohesion scanners)
+
+```json
+"skill_identity": {
+  "name": "skill-name",
+  "purpose_summary": "Brief characterization",
+  "primary_outcome": "What this skill produces"
+}
+```
+
+### skillmd_assessment (prompt-craft scanner)
+
+```json
+"skillmd_assessment": {
+  "overview_quality": "appropriate|excessive|missing",
+  "progressive_disclosure": "good|needs-extraction|monolithic",
+  "notes": "brief assessment"
+}
+```
+
+Agent variant adds `"persona_context": "appropriate|excessive|missing"`.
+
+### prompt_health (prompt-craft scanner)
+
+```json
+"prompt_health": {
+  "total_prompts": 3,
+  "with_config_header": 2,
+  "with_progression": 1,
+  "self_contained": 3
+}
+```
+
+### skill_understanding (enhancement-opportunities scanner)
+
+```json
+"skill_understanding": {
+  "purpose": "what this skill does",
+  "primary_user": "who it's for",
+  "assumptions": ["assumption 1", "assumption 2"]
+}
+```
+
+### stage_summary (workflow-integrity scanner)
+
+```json
+"stage_summary": {
+  "total_stages": 0,
+  "missing_stages": [],
+  "orphaned_stages": [],
+  "stages_without_progression": [],
+  "stages_without_config_header": []
+}
+```
+
+### metadata (structure scanner)
+
+Free-form key-value pairs. Rendered as a metadata block.
+
+### script_summary (scripts lint)
+
+```json
+"script_summary": {
+  "total_scripts": 5,
+  "by_type": {"python": 3, "shell": 2},
+  "missing_tests": ["script1.py"]
+}
+```
+
+### existing_scripts (script-opportunities scanner)
+
+Array of strings (script paths that already exist).
+
+## Complete Example
+
+```json
+{
+  "scanner": "workflow-integrity",
+  "skill_path": "/path/to/skill",
+  "findings": [
+    {
+      "file": "SKILL.md",
+      "line": 12,
+      "severity": "high",
+      "category": "frontmatter",
+      "title": "Missing required 'version' field in frontmatter",
+      "detail": "The SKILL.md frontmatter is missing the version field. This prevents the manifest generator from producing correct output and breaks version-aware consumers.",
+      "action": "Add 'version: 1.0.0' to the YAML frontmatter block"
+    },
+    {
+      "file": "build-process.md",
+      "line": null,
+      "severity": "strength",
+      "category": "design",
+      "title": "Excellent progressive disclosure pattern in build stages",
+      "detail": "Each stage provides exactly the context needed without front-loading information. This reduces token waste and improves LLM comprehension.",
+      "action": ""
+    },
+    {
+      "file": "SKILL.md",
+      "line": 45,
+      "severity": "medium-opportunity",
+      "category": "experience-gap",
+      "title": "No guidance for first-time users unfamiliar with build workflows",
+      "detail": "A user encountering this skill for the first time has no onboarding path. The skill assumes familiarity with stage-based workflows, which creates friction for newcomers.",
+      "action": "Add a 'Getting Started' section or link to onboarding documentation"
+    }
+  ],
+  "assessments": {
+    "stage_summary": {
+      "total_stages": 7,
+      "missing_stages": [],
+      "orphaned_stages": ["cleanup"]
+    }
+  },
+  "summary": {
+    "total_findings": 3,
+    "by_severity": {"high": 1, "medium-opportunity": 1, "strength": 1},
+    "assessment": "Well-structured skill with one critical frontmatter gap. Progressive disclosure is a notable strength."
+  }
+}
+```
+
+## DO NOT
+
+- **DO NOT** rename fields. Use exactly: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`
+- **DO NOT** use `issues` instead of `findings` — the array is always called `findings`
+- **DO NOT** add fields to findings beyond the 7 defined above. Put scanner-specific structured data in `assessments`
+- **DO NOT** use separate arrays for strengths, suggestions, or opportunities — they go in `findings` with appropriate severity values
+- **DO NOT** change `user_journeys` from an array to an object keyed by persona name
+- **DO NOT** restructure assessment sub-objects — use the shapes defined above
+- **DO NOT** put free-form narrative data into `assessments` — that belongs in `detail` fields of findings or in `summary.assessment`
+
+## Self-Check Before Output
+
+Before writing your JSON output, verify:
+
+1. Is your array called `findings` (not `issues`, not `opportunities`)?
+2. Does every item in `findings` have all 7 fields: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`?
+3. Are strengths in `findings` with `severity: "strength"` (not in a separate `strengths` array)?
+4. Are suggestions in `findings` with `severity: "suggestion"` (not in a separate `creative_suggestions` array)?
+5. Is `assessments` an object containing structured analysis data (not items that belong in findings)?
+6. Is `user_journeys` an array of objects (not an object keyed by persona)?
+7. Do `top_insights` items use `title`/`detail`/`action` (not `insight`/`suggestion`/`why_it_matters`)?
diff --git a/src/skills/bmad-agent-builder/report-quality-scan-creator.md b/src/skills/bmad-agent-builder/report-quality-scan-creator.md
new file mode 100644
index 0000000..3a0376e
--- /dev/null
+++ b/src/skills/bmad-agent-builder/report-quality-scan-creator.md
@@ -0,0 +1,138 @@
+# Quality Scan Report Creator
+
+You are a master quality engineer tech writer agent QualityReportBot-9001. You create comprehensive, cohesive quality reports from multiple scanner outputs. You read all temporary JSON fragments, consolidate findings, remove duplicates, and produce a well-organized markdown report using the provided template. You are quality obsessed — nothing gets dropped. You will never attempt to fix anything — you are a writer, not a fixer.
+
+## Inputs
+
+- `{skill-path}` — Path to the agent being validated
+- `{quality-report-dir}` — Directory containing scanner temp files AND where to write the final report
+
+## Template
+
+Read `assets/quality-report-template.md` for the report structure. The template contains:
+- `{placeholder}` markers — replace with actual data
+- `{if-section}...{/if-section}` blocks — include only when data exists, omit entirely when empty
+- `<!-- comments -->` — inline guidance for what data to pull and from where; strip from final output
+
+## Process
+
+### Step 1: Ingest Everything
+
+1. Read `assets/quality-report-template.md`
+2. List ALL files in `{quality-report-dir}` — both `*-temp.json` (scanner findings) and `*-prepass.json` (structural metrics)
+3. Read EVERY JSON file
+
+### Step 2: Extract All Data Types
+
+All scanners now use the universal schema defined in `references/universal-scan-schema.md`. Scanner-specific data lives in `assessments{}`, not as top-level keys.
+
+For each scanner file, extract not just `findings` arrays but ALL of these data types:
+
+| Data Type | Where It Lives | Report Destination |
+|-----------|---------------|-------------------|
+| Issues/findings (severity: critical-low) | All scanner `findings[]` | Detailed Findings by Category |
+| Strengths (severity: "strength"/"note", category: "strength") | All scanners: findings where severity="strength" | Strengths section |
+| Agent identity | agent-cohesion `assessments.agent_identity` | Agent Identity section + Executive Summary |
+| Cohesion dimensional analysis | agent-cohesion `assessments.cohesion_analysis` | Cohesion Analysis table |
+| Consolidation opportunities | agent-cohesion `assessments.cohesion_analysis.redundancy_level.consolidation_opportunities` | Consolidation Opportunities in Cohesion |
+| Creative suggestions | `findings[]` with severity="suggestion" (no separate creative_suggestions array) | Creative Suggestions in Cohesion section |
+| Craft & agent assessment | prompt-craft `assessments.skillmd_assessment` (incl. `persona_context`), `assessments.prompt_health`, `summary.assessment` | Prompt Craft section header + Executive Summary |
+| Structure metadata | structure `assessments.metadata` (has_memory, has_headless, manifest_valid, etc.) | Structure & Capabilities section header |
+| User journeys | enhancement-opportunities `assessments.user_journeys[]` | User Journeys section |
+| Autonomous assessment | enhancement-opportunities `assessments.autonomous_assessment` | Autonomous Readiness section |
+| Skill understanding | enhancement-opportunities `assessments.skill_understanding` | Creative section header |
+| Top insights | enhancement-opportunities `assessments.top_insights[]` | Top Insights in Creative section |
+| Optimization opportunities | `findings[]` with severity ending in "-opportunity" (no separate opportunities array) | Optimization Opportunities in Efficiency section |
+| Script inventory & token savings | scripts `assessments.script_summary`, script-opportunities `summary` | Scripts sections |
+| Prepass metrics | `*-prepass.json` files | Context data points where useful |
+
+### Step 3: Populate Template
+
+Fill the template section by section, following the `<!-- comment -->` guidance in each. Key rules:
+
+- **Conditional sections:** Only include `{if-...}` blocks when the data exists. If a scanner didn't produce user_journeys, omit the entire User Journeys section.
+- **Empty severity levels:** Within a category, omit severity sub-headers that have zero findings.
+- **Persona voice:** When reporting prompt-craft findings, remember that persona voice is INVESTMENT for agents, not waste. Reflect the scanner's nuance field if present.
+- **Strip comments:** Remove all `<!-- ... -->` blocks from final output.
+
+### Step 4: Deduplicate
+
+- **Same issue, two scanners:** Keep ONE entry, cite both sources. Use the more detailed description.
+- **Same issue pattern, multiple files:** List once with all file:line references in a table.
+- **Issue + strength about same thing:** Keep BOTH — strength shows what works, issue shows what could be better.
+- **Overlapping creative suggestions:** Merge into the richer description.
+- **Routing:** "note"/"strength" severity → Strengths section. "suggestion" severity → Creative subsection. Do not mix these into issue lists.
+
+### Step 5: Verification Pass
+
+**This step is mandatory.** After populating the report, re-read every temp file and verify against this checklist:
+
+- [ ] Every finding from every `*-temp.json` findings[] array
+- [ ] Agent identity block (persona_summary, primary_purpose, capability_count)
+- [ ] All findings with severity="strength" from any scanner
+- [ ] All positive notes from prompt-craft (severity="note")
+- [ ] Cohesion analysis dimensional scores table (if present)
+- [ ] Consolidation opportunities from cohesion redundancy analysis
+- [ ] Craft assessment, skill type assessment, and persona context assessment
+- [ ] Structure metadata (sections_found, has_memory, has_headless, manifest_valid)
+- [ ] ALL user journeys with ALL friction_points and bright_spots per archetype
+- [ ] The autonomous_assessment block (all fields)
+- [ ] All findings with severity="suggestion" from cohesion scanners
+- [ ] All findings with severity ending in "-opportunity" from execution-efficiency
+- [ ] assessments.top_insights from enhancement-opportunities
+- [ ] Script inventory and token savings from script-opportunities
+- [ ] Skill understanding (purpose, primary_user, key_assumptions)
+- [ ] Prompt health summary from prompt-craft (if prompts exist)
+
+If any item was dropped, add it to the appropriate section before writing.
+
+### Step 6: Write and Return
+
+Write report to: `{quality-report-dir}/quality-report.md`
+
+Return JSON:
+
+```json
+{
+  "report_file": "{full-path-to-report}",
+  "summary": {
+    "total_issues": 0,
+    "critical": 0,
+    "high": 0,
+    "medium": 0,
+    "low": 0,
+    "strengths_count": 0,
+    "enhancements_count": 0,
+    "user_journeys_count": 0,
+    "overall_quality": "Excellent|Good|Fair|Poor",
+    "overall_cohesion": "cohesive|mostly-cohesive|fragmented|confused",
+    "craft_assessment": "brief summary from prompt-craft",
+    "truly_broken_found": true,
+    "truly_broken_count": 0
+  },
+  "by_category": {
+    "structure_capabilities": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "prompt_craft": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "execution_efficiency": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "path_script_standards": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "agent_cohesion": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "creative": {"high_opportunity": 0, "medium_opportunity": 0, "low_opportunity": 0}
+  },
+  "high_impact_quick_wins": [
+    {"issue": "description", "file": "location", "effort": "low"}
+  ]
+}
+```
+
+## Scanner Reference
+
+| Scanner | Temp File | Primary Category |
+|---------|-----------|-----------------|
+| structure | structure-temp.json | Structure & Capabilities |
+| prompt-craft | prompt-craft-temp.json | Prompt Craft |
+| execution-efficiency | execution-efficiency-temp.json | Execution Efficiency |
+| path-standards | path-standards-temp.json | Path & Script Standards |
+| scripts | scripts-temp.json | Path & Script Standards |
+| script-opportunities | script-opportunities-temp.json | Script Opportunities |
+| agent-cohesion | agent-cohesion-temp.json | Agent Cohesion |
+| enhancement-opportunities | enhancement-opportunities-temp.json | Creative |
diff --git a/src/skills/bmad-agent-builder/scripts/bmad-manifest-schema.json b/src/skills/bmad-agent-builder/scripts/bmad-manifest-schema.json
index 90e66db..ea674b5 100644
--- a/src/skills/bmad-agent-builder/scripts/bmad-manifest-schema.json
+++ b/src/skills/bmad-agent-builder/scripts/bmad-manifest-schema.json
@@ -61,7 +61,7 @@
           },
 
           "prompt": {
-            "description": "Relative path to the prompt file for internal capabilities (e.g., prompts/build-process.md). Omit if handled by SKILL.md directly or if this is an external skill call.",
+            "description": "Relative path to the prompt file for internal capabilities (e.g., build-process.md). Omit if handled by SKILL.md directly or if this is an external skill call.",
             "type": "string"
           },
           "skill-name": {
diff --git a/src/skills/bmad-agent-builder/scripts/generate-html-report.py b/src/skills/bmad-agent-builder/scripts/generate-html-report.py
new file mode 100644
index 0000000..a8614db
--- /dev/null
+++ b/src/skills/bmad-agent-builder/scripts/generate-html-report.py
@@ -0,0 +1,1002 @@
+# /// script
+# requires-python = ">=3.9"
+# ///
+
+#!/usr/bin/env python3
+"""
+Generate an interactive HTML quality report from scanner temp JSON files.
+
+Reads all *-temp.json and *-prepass.json files from a quality scan output
+directory, normalizes findings into a unified data model, and produces a
+self-contained HTML report with:
+  - Collapsible sections with severity filter badges
+  - Per-item copy-prompt buttons
+  - Multi-select batch prompt generator
+  - Executive summary with severity counts
+
+Usage:
+  python3 generate-html-report.py {quality-report-dir} [--open] [--skill-path /path/to/skill]
+
+The --skill-path is embedded in the prompt context so generated prompts
+reference the correct location. If omitted, it is read from the first
+temp JSON that contains a skill_path field.
+"""
+
+from __future__ import annotations
+
+import argparse
+import json
+import platform
+import subprocess
+import sys
+from datetime import datetime, timezone
+from pathlib import Path
+
+
+# =============================================================================
+# Normalization — diverse scanner JSONs → unified item model
+# =============================================================================
+
+SEVERITY_RANK = {
+    'critical': 0, 'high': 1, 'medium': 2, 'low': 3,
+    'high-opportunity': 1, 'medium-opportunity': 2, 'low-opportunity': 3,
+    'note': 4, 'strength': 5, 'suggestion': 4, 'info': 5,
+}
+
+# Map scanner names to report sections
+SCANNER_SECTIONS = {
+    'workflow-integrity': 'structural',
+    'structure': 'structure-capabilities',
+    'prompt-craft': 'prompt-craft',
+    'execution-efficiency': 'efficiency',
+    'skill-cohesion': 'cohesion',
+    'agent-cohesion': 'cohesion',
+    'path-standards': 'quality',
+    'scripts': 'scripts',
+    'script-opportunities': 'script-opportunities',
+    'enhancement-opportunities': 'creative',
+}
+
+SECTION_LABELS = {
+    'structural': 'Structural',
+    'structure-capabilities': 'Structure & Capabilities',
+    'prompt-craft': 'Prompt Craft',
+    'efficiency': 'Efficiency',
+    'cohesion': 'Cohesion',
+    'quality': 'Path & Script Standards',
+    'scripts': 'Scripts',
+    'script-opportunities': 'Script Opportunities',
+    'creative': 'Creative & Enhancements',
+}
+
+
+def _coalesce(*values) -> str:
+    """Return the first truthy string value, or empty string."""
+    for v in values:
+        if v and isinstance(v, str) and v.strip() and v.strip() not in ('N/A', 'n/a', 'None'):
+            return v.strip()
+    return ''
+
+
+def _norm_severity(sev: str) -> str:
+    """Normalize severity to lowercase, handle variants."""
+    if not sev:
+        return 'low'
+    s = sev.strip().lower()
+    # Map common variants
+    return {
+        'high-opportunity': 'high-opportunity',
+        'medium-opportunity': 'medium-opportunity',
+        'low-opportunity': 'low-opportunity',
+    }.get(s, s)
+
+
+def normalize_finding(f: dict, scanner: str, idx: int) -> dict:
+    """
+    Normalize a single finding/issue dict into the unified item model.
+
+    Handles all known field name variants across scanners:
+      Title:  issue | title | description (fallback)
+      Desc:   description | rationale | observation | insight | scenario |
+              current_behavior | current_pattern | context | nuance
+      Action: fix | recommendation | suggestion | suggested_approach |
+              efficient_alternative | script_alternative
+      File:   file | location | current_location
+      Line:   line | lines
+      Cat:    category | dimension
+      Impact: user_impact | impact | estimated_savings | estimated_token_savings
+    """
+    sev = _norm_severity(f.get('severity', 'low'))
+    section = SCANNER_SECTIONS.get(scanner, 'other')
+
+    # Determine item type from severity
+    if sev in ('strength', 'note') or f.get('category') == 'strength':
+        item_type = 'strength'
+        action_type = 'none'
+        selectable = False
+    elif sev.endswith('-opportunity'):
+        item_type = 'enhancement'
+        action_type = 'enhance'
+        selectable = True
+    elif f.get('category') == 'suggestion' or sev == 'suggestion':
+        item_type = 'suggestion'
+        action_type = 'refactor'
+        selectable = True
+    else:
+        item_type = 'issue'
+        action_type = 'fix'
+        selectable = True
+
+    # --- Title: prefer 'title', fall back to old field names ---
+    title = _coalesce(
+        f.get('title'),
+        f.get('issue'),
+        _truncate(f.get('scenario', ''), 150),
+        _truncate(f.get('current_behavior', ''), 150),
+        _truncate(f.get('description', ''), 150),
+        f.get('observation', ''),
+    )
+    if not title:
+        title = f.get('id', 'Finding')
+
+    # --- Detail/description: prefer 'detail', fall back to old field names ---
+    description = _coalesce(f.get('detail'))
+    if not description:
+        # Backward compat: coalesce old field names
+        desc_candidates = []
+        for key in ('description', 'rationale', 'observation', 'insight', 'scenario',
+                    'current_behavior', 'current_pattern', 'context', 'nuance',
+                    'assessment'):
+            v = f.get(key)
+            if v and isinstance(v, str) and v.strip() and v != title:
+                desc_candidates.append(v.strip())
+        description = ' '.join(desc_candidates) if desc_candidates else ''
+
+    # --- Action: prefer 'action', fall back to old field names ---
+    action = _coalesce(
+        f.get('action'),
+        f.get('fix'),
+        f.get('recommendation'),
+        f.get('suggestion'),
+        f.get('suggested_approach'),
+        f.get('efficient_alternative'),
+        f.get('script_alternative'),
+    )
+
+    # --- File reference ---
+    file_ref = _coalesce(
+        f.get('file'),
+        f.get('location'),
+        f.get('current_location'),
+    )
+
+    # --- Line reference ---
+    line = f.get('line')
+    if line is None:
+        lines_str = f.get('lines')
+        if lines_str:
+            line = str(lines_str)
+
+    # --- Category ---
+    category = _coalesce(
+        f.get('category'),
+        f.get('dimension'),
+    )
+
+    # --- Impact (backward compat only - new schema folds into detail) ---
+    impact = _coalesce(
+        f.get('user_impact'),
+        f.get('impact'),
+        f.get('estimated_savings'),
+        str(f.get('estimated_token_savings', '')) if f.get('estimated_token_savings') else '',
+    )
+
+    # --- Extra fields for specific scanners ---
+    extra = {}
+    if scanner == 'script-opportunities':
+        action_type = 'create-script'
+        for k in ('determinism_confidence', 'implementation_complexity',
+                   'language', 'could_be_prepass', 'reusable_across_skills'):
+            if k in f:
+                extra[k] = f[k]
+
+    # Use scanner-provided id if available
+    item_id = f.get('id', f'{scanner}-{idx:03d}')
+
+    return {
+        'id': item_id,
+        'scanner': scanner,
+        'section': section,
+        'type': item_type,
+        'severity': sev,
+        'rank': SEVERITY_RANK.get(sev, 3),
+        'category': category,
+        'file': file_ref,
+        'line': line,
+        'title': title,
+        'description': description,
+        'action': action,
+        'impact': impact,
+        'extra': extra,
+        'selectable': selectable,
+        'action_type': action_type,
+    }
+
+
+def _truncate(text: str, max_len: int) -> str:
+    """Truncate text to max_len, breaking at sentence boundary if possible."""
+    if not text:
+        return ''
+    text = text.strip()
+    if len(text) <= max_len:
+        return text
+    # Try to break at sentence boundary
+    for end in ('. ', '.\n', ' — ', '; '):
+        pos = text.find(end)
+        if 0 < pos < max_len:
+            return text[:pos + 1].strip()
+    return text[:max_len].strip() + '...'
+
+
+def normalize_scanner(data: dict) -> tuple[list[dict], dict]:
+    """
+    Normalize a full scanner JSON into (items, meta).
+    Returns list of normalized items + dict of meta/assessment data.
+    Handles all known scanner output variants.
+    """
+    scanner = data.get('scanner', 'unknown')
+    items = []
+    meta = {}
+
+    # New schema: findings[]. Backward compat: issues[] or findings[]
+    findings = data.get('findings') or data.get('issues') or []
+    for idx, f in enumerate(findings):
+        items.append(normalize_finding(f, scanner, idx))
+
+    # Backward compat: opportunities[] (execution-efficiency had separate array)
+    for idx, opp in enumerate(data.get('opportunities', []), start=len(findings)):
+        opp_item = normalize_finding(opp, scanner, idx)
+        opp_item['type'] = 'enhancement'
+        opp_item['action_type'] = 'enhance'
+        opp_item['selectable'] = True
+        items.append(opp_item)
+
+    # Backward compat: strengths[] (old cohesion scanners — plain strings)
+    for idx, s in enumerate(data.get('strengths', [])):
+        text = s if isinstance(s, str) else (s.get('title', '') if isinstance(s, dict) else str(s))
+        desc = '' if isinstance(s, str) else (s.get('description', s.get('detail', '')) if isinstance(s, dict) else '')
+        items.append({
+            'id': f'{scanner}-str-{idx:03d}',
+            'scanner': scanner,
+            'section': SCANNER_SECTIONS.get(scanner, 'cohesion'),
+            'type': 'strength',
+            'severity': 'strength',
+            'rank': 5,
+            'category': 'strength',
+            'file': '',
+            'line': None,
+            'title': text,
+            'description': desc,
+            'action': '',
+            'impact': '',
+            'extra': {},
+            'selectable': False,
+            'action_type': 'none',
+        })
+
+    # Backward compat: creative_suggestions[] (old cohesion scanners)
+    for idx, cs in enumerate(data.get('creative_suggestions', [])):
+        if isinstance(cs, str):
+            cs_title, cs_desc = cs, ''
+        else:
+            cs_title = _coalesce(cs.get('title'), cs.get('idea'), '')
+            cs_desc = _coalesce(cs.get('description'), cs.get('detail'), cs.get('rationale'), '')
+        items.append({
+            'id': cs.get('id', f'{scanner}-cs-{idx:03d}') if isinstance(cs, dict) else f'{scanner}-cs-{idx:03d}',
+            'scanner': scanner,
+            'section': SCANNER_SECTIONS.get(scanner, 'cohesion'),
+            'type': 'suggestion',
+            'severity': 'suggestion',
+            'rank': 4,
+            'category': cs.get('type', 'suggestion') if isinstance(cs, dict) else 'suggestion',
+            'file': '',
+            'line': None,
+            'title': cs_title,
+            'description': cs_desc,
+            'action': cs_title,
+            'impact': cs.get('estimated_impact', '') if isinstance(cs, dict) else '',
+            'extra': {},
+            'selectable': True,
+            'action_type': 'refactor',
+        })
+
+    # New schema: assessments{} contains all structured analysis
+    # Backward compat: also collect from top-level keys
+    if 'assessments' in data:
+        meta.update(data['assessments'])
+
+    # Backward compat: collect meta from top-level keys
+    skip_keys = {'scanner', 'script', 'version', 'skill_path', 'agent_path',
+                 'timestamp', 'scan_date', 'status', 'issues', 'findings',
+                 'strengths', 'creative_suggestions', 'opportunities', 'assessments'}
+    for key, val in data.items():
+        if key not in skip_keys and key not in meta:
+            meta[key] = val
+
+    return items, meta
+
+
+def build_journeys(data: dict) -> list[dict]:
+    """
+    Extract user journey data from enhancement-opportunities scanner.
+    Handles two formats:
+      - Array of objects: [{archetype, journey_summary, friction_points, bright_spots}]
+      - Object keyed by persona: {first_timer: {entry_friction, mid_flow_resilience, exit_satisfaction}}
+    """
+    journeys_raw = data.get('user_journeys')
+    if not journeys_raw:
+        return []
+
+    # Format 1: already a list — normalize field names
+    if isinstance(journeys_raw, list):
+        normalized = []
+        for j in journeys_raw:
+            if isinstance(j, dict):
+                normalized.append({
+                    'archetype': j.get('archetype', 'unknown'),
+                    'journey_summary': j.get('summary', j.get('journey_summary', '')),
+                    'friction_points': j.get('friction_points', []),
+                    'bright_spots': j.get('bright_spots', []),
+                })
+            else:
+                normalized.append(j)
+        return normalized
+
+    # Format 2: object keyed by persona name
+    if isinstance(journeys_raw, dict):
+        result = []
+        for persona, details in journeys_raw.items():
+            if isinstance(details, dict):
+                # Convert the dict-based format to the expected format
+                journey = {
+                    'archetype': persona.replace('_', ' ').title(),
+                    'journey_summary': '',
+                    'friction_points': [],
+                    'bright_spots': [],
+                }
+                # Map known sub-keys to friction/bright spots
+                for key, val in details.items():
+                    if isinstance(val, str):
+                        # Heuristic: negative-sounding keys → friction, positive → bright
+                        if any(neg in key.lower() for neg in ('friction', 'issue', 'problem', 'gap', 'pain')):
+                            journey['friction_points'].append(val)
+                        elif any(pos in key.lower() for pos in ('bright', 'strength', 'satisfaction', 'delight')):
+                            journey['bright_spots'].append(val)
+                        else:
+                            # Neutral keys — include as summary parts
+                            if journey['journey_summary']:
+                                journey['journey_summary'] += f' | {key}: {val}'
+                            else:
+                                journey['journey_summary'] = f'{key}: {val}'
+                    elif isinstance(val, list):
+                        for item in val:
+                            if isinstance(item, str):
+                                journey['friction_points'].append(item)
+                # Build summary from all fields if not yet set
+                if not journey['journey_summary']:
+                    parts = []
+                    for k, v in details.items():
+                        if isinstance(v, str):
+                            parts.append(f'**{k.replace("_", " ").title()}:** {v}')
+                    journey['journey_summary'] = ' | '.join(parts) if parts else str(details)
+                result.append(journey)
+            elif isinstance(details, str):
+                result.append({
+                    'archetype': persona.replace('_', ' ').title(),
+                    'journey_summary': details,
+                    'friction_points': [],
+                    'bright_spots': [],
+                })
+        return result
+
+    return []
+
+
+# =============================================================================
+# Report Data Assembly
+# =============================================================================
+
+def load_report_data(report_dir: Path, skill_path: str | None) -> dict:
+    """Load all temp/prepass JSONs and assemble normalized report data."""
+    all_items = []
+    all_meta = {}
+    journeys = []
+    detected_skill_path = skill_path
+
+    # Read all JSON files
+    json_files = sorted(report_dir.glob('*.json'))
+    for jf in json_files:
+        try:
+            data = json.loads(jf.read_text(encoding='utf-8'))
+        except (json.JSONDecodeError, OSError):
+            continue
+
+        if not isinstance(data, dict):
+            continue
+
+        scanner = data.get('scanner', jf.stem.replace('-temp', '').replace('-prepass', ''))
+
+        # Detect skill path from scanner data
+        if not detected_skill_path:
+            detected_skill_path = data.get('skill_path') or data.get('agent_path')
+
+        # Only normalize temp files (not prepass)
+        if '-temp' in jf.name or jf.name in ('path-standards-temp.json', 'scripts-temp.json'):
+            items, meta = normalize_scanner(data)
+            all_items.extend(items)
+            all_meta[scanner] = meta
+
+            if scanner == 'enhancement-opportunities':
+                journeys = build_journeys(data)
+        elif '-prepass' in jf.name:
+            all_meta[f'prepass-{scanner}'] = data
+
+    # Sort items: severity rank first, then section
+    all_items.sort(key=lambda x: (x['rank'], x['section']))
+
+    # Build severity counts
+    counts = {'critical': 0, 'high': 0, 'medium': 0, 'low': 0}
+    for item in all_items:
+        if item['type'] == 'issue' and item['severity'] in counts:
+            counts[item['severity']] += 1
+
+    enhancement_count = sum(1 for i in all_items if i['type'] == 'enhancement')
+    strength_count = sum(1 for i in all_items if i['type'] == 'strength')
+    total_issues = sum(counts.values())
+
+    # Quality grade
+    if counts['critical'] > 0:
+        grade = 'Poor'
+    elif counts['high'] > 2:
+        grade = 'Fair'
+    elif counts['high'] > 0 or counts['medium'] > 5:
+        grade = 'Good'
+    else:
+        grade = 'Excellent'
+
+    # Extract assessments for display
+    assessments = {}
+    for scanner_key, meta in all_meta.items():
+        for akey in ('cohesion_analysis', 'autonomous_assessment', 'skill_understanding',
+                      'agent_identity', 'skill_identity', 'prompt_health',
+                      'skillmd_assessment', 'top_insights'):
+            if akey in meta:
+                assessments[akey] = meta[akey]
+        if 'summary' in meta:
+            s = meta['summary']
+            if 'craft_assessment' in s:
+                assessments['craft_assessment'] = s['craft_assessment']
+            if 'overall_cohesion' in s:
+                assessments['overall_cohesion'] = s['overall_cohesion']
+
+    # Skill name from path
+    sp = detected_skill_path or str(report_dir)
+    skill_name = Path(sp).name
+
+    return {
+        'meta': {
+            'skill_name': skill_name,
+            'skill_path': detected_skill_path or '',
+            'timestamp': datetime.now(timezone.utc).isoformat(),
+            'scanner_count': len([f for f in json_files if '-temp' in f.name]),
+            'report_dir': str(report_dir),
+        },
+        'executive_summary': {
+            'total_issues': total_issues,
+            'counts': counts,
+            'enhancement_count': enhancement_count,
+            'strength_count': strength_count,
+            'grade': grade,
+            'craft_assessment': assessments.get('craft_assessment', ''),
+            'overall_cohesion': assessments.get('overall_cohesion', ''),
+        },
+        'items': all_items,
+        'journeys': journeys,
+        'assessments': assessments,
+        'section_labels': SECTION_LABELS,
+    }
+
+
+# =============================================================================
+# HTML Generation
+# =============================================================================
+
+HTML_TEMPLATE = r"""<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="utf-8">
+<meta name="viewport" content="width=device-width, initial-scale=1">
+<title>Quality Report: SKILL_NAME_PLACEHOLDER</title>
+<style>
+:root {
+  --bg: #0d1117; --surface: #161b22; --surface2: #21262d; --border: #30363d;
+  --text: #e6edf3; --text-muted: #8b949e; --text-dim: #6e7681;
+  --critical: #f85149; --high: #f0883e; --medium: #d29922; --low: #58a6ff;
+  --strength: #3fb950; --suggestion: #a371f7; --info: #8b949e;
+  --accent: #58a6ff; --accent-hover: #79c0ff;
+  --font: -apple-system, BlinkMacSystemFont, "Segoe UI", Helvetica, Arial, sans-serif;
+  --mono: ui-monospace, SFMono-Regular, "SF Mono", Menlo, Consolas, monospace;
+}
+@media (prefers-color-scheme: light) {
+  :root {
+    --bg: #ffffff; --surface: #f6f8fa; --surface2: #eaeef2; --border: #d0d7de;
+    --text: #1f2328; --text-muted: #656d76; --text-dim: #8c959f;
+    --critical: #cf222e; --high: #bc4c00; --medium: #9a6700; --low: #0969da;
+    --strength: #1a7f37; --suggestion: #8250df; --info: #656d76;
+    --accent: #0969da; --accent-hover: #0550ae;
+  }
+}
+* { margin: 0; padding: 0; box-sizing: border-box; }
+body { font-family: var(--font); background: var(--bg); color: var(--text); line-height: 1.5; padding: 2rem; max-width: 960px; margin: 0 auto; padding-bottom: 6rem; }
+h1 { font-size: 1.5rem; margin-bottom: 0.25rem; }
+.subtitle { color: var(--text-muted); font-size: 0.85rem; margin-bottom: 1.5rem; }
+.badge { display: inline-flex; align-items: center; padding: 0.15rem 0.5rem; border-radius: 2rem; font-size: 0.75rem; font-weight: 600; cursor: pointer; border: 2px solid transparent; transition: all 0.15s; user-select: none; }
+.badge:hover { filter: brightness(1.2); }
+.badge.active { border-color: currentColor; }
+.badge-critical { background: color-mix(in srgb, var(--critical) 20%, transparent); color: var(--critical); }
+.badge-high { background: color-mix(in srgb, var(--high) 20%, transparent); color: var(--high); }
+.badge-medium { background: color-mix(in srgb, var(--medium) 20%, transparent); color: var(--medium); }
+.badge-low { background: color-mix(in srgb, var(--low) 20%, transparent); color: var(--low); }
+.badge-strength { background: color-mix(in srgb, var(--strength) 20%, transparent); color: var(--strength); }
+.badge-suggestion, .badge-note { background: color-mix(in srgb, var(--suggestion) 20%, transparent); color: var(--suggestion); }
+.badge-high-opportunity { background: color-mix(in srgb, var(--high) 20%, transparent); color: var(--high); }
+.badge-medium-opportunity { background: color-mix(in srgb, var(--medium) 20%, transparent); color: var(--medium); }
+.badge-low-opportunity { background: color-mix(in srgb, var(--low) 20%, transparent); color: var(--low); }
+.badge-info { background: color-mix(in srgb, var(--info) 20%, transparent); color: var(--info); }
+.grade { font-size: 2rem; font-weight: 700; }
+.grade-Excellent { color: var(--strength); }
+.grade-Good { color: var(--low); }
+.grade-Fair { color: var(--medium); }
+.grade-Poor { color: var(--critical); }
+.summary-grid { display: grid; grid-template-columns: auto 1fr; gap: 0.75rem 2rem; margin: 1rem 0; align-items: baseline; }
+.summary-grid dt { color: var(--text-muted); font-size: 0.85rem; }
+.summary-grid dd { font-size: 0.95rem; }
+.filters { display: flex; gap: 0.5rem; flex-wrap: wrap; margin: 1rem 0; }
+.section { border: 1px solid var(--border); border-radius: 0.5rem; margin: 0.75rem 0; overflow: hidden; }
+.section-header { display: flex; align-items: center; gap: 0.75rem; padding: 0.75rem 1rem; background: var(--surface); cursor: pointer; user-select: none; }
+.section-header:hover { background: var(--surface2); }
+.section-header .arrow { font-size: 0.7rem; transition: transform 0.15s; color: var(--text-muted); width: 1rem; }
+.section-header.open .arrow { transform: rotate(90deg); }
+.section-header .label { font-weight: 600; flex: 1; }
+.section-header .count { font-size: 0.8rem; color: var(--text-muted); }
+.section-body { display: none; }
+.section-body.open { display: block; }
+.item { display: flex; gap: 0.75rem; padding: 0.75rem 1rem; border-top: 1px solid var(--border); align-items: flex-start; }
+.item:hover { background: var(--surface); }
+.item-check { margin-top: 0.2rem; accent-color: var(--accent); flex-shrink: 0; }
+.item-body { flex: 1; min-width: 0; }
+.item-title { font-weight: 600; font-size: 0.9rem; }
+.item-file { font-family: var(--mono); font-size: 0.75rem; color: var(--text-muted); }
+.item-desc { font-size: 0.85rem; color: var(--text-muted); margin-top: 0.25rem; }
+.item-action { font-size: 0.85rem; margin-top: 0.25rem; }
+.item-action strong { color: var(--strength); }
+.item-impact { font-size: 0.8rem; color: var(--text-dim); margin-top: 0.2rem; font-style: italic; }
+.item-actions { flex-shrink: 0; display: flex; gap: 0.25rem; }
+.copy-btn { background: none; border: 1px solid var(--border); border-radius: 0.25rem; padding: 0.2rem 0.4rem; cursor: pointer; color: var(--text-muted); font-size: 0.75rem; transition: all 0.15s; }
+.copy-btn:hover { border-color: var(--accent); color: var(--accent); }
+.copy-btn.copied { border-color: var(--strength); color: var(--strength); }
+.journey { padding: 0.75rem 1rem; border-top: 1px solid var(--border); }
+.journey h4 { font-size: 0.9rem; text-transform: capitalize; }
+.journey p { font-size: 0.85rem; color: var(--text-muted); margin: 0.25rem 0; }
+.journey ul { font-size: 0.85rem; padding-left: 1.25rem; margin: 0.25rem 0; }
+.journey .friction { color: var(--high); }
+.journey .bright { color: var(--strength); }
+.assessment { padding: 0.75rem 1rem; border-top: 1px solid var(--border); }
+.assessment table { width: 100%; border-collapse: collapse; font-size: 0.85rem; margin-top: 0.5rem; }
+.assessment th, .assessment td { text-align: left; padding: 0.3rem 0.5rem; border-bottom: 1px solid var(--border); }
+.assessment th { color: var(--text-muted); font-weight: 600; }
+.sticky-footer { position: fixed; bottom: 0; left: 0; right: 0; background: var(--surface); border-top: 1px solid var(--border); padding: 0.75rem 2rem; display: flex; align-items: center; justify-content: center; gap: 1rem; z-index: 100; transition: transform 0.2s; }
+.sticky-footer.hidden { transform: translateY(100%); }
+.gen-btn { background: var(--accent); color: #fff; border: none; padding: 0.5rem 1.25rem; border-radius: 0.375rem; cursor: pointer; font-weight: 600; font-size: 0.9rem; }
+.gen-btn:hover { background: var(--accent-hover); }
+.sel-count { font-size: 0.9rem; color: var(--text-muted); }
+.modal-overlay { display: none; position: fixed; inset: 0; background: rgba(0,0,0,0.6); z-index: 200; align-items: center; justify-content: center; }
+.modal-overlay.visible { display: flex; }
+.modal { background: var(--surface); border: 1px solid var(--border); border-radius: 0.5rem; padding: 1.5rem; width: 90%; max-width: 700px; max-height: 80vh; overflow-y: auto; }
+.modal h3 { margin-bottom: 0.75rem; }
+.modal pre { background: var(--bg); border: 1px solid var(--border); border-radius: 0.375rem; padding: 1rem; font-family: var(--mono); font-size: 0.8rem; white-space: pre-wrap; word-wrap: break-word; max-height: 50vh; overflow-y: auto; }
+.modal-actions { display: flex; gap: 0.75rem; margin-top: 1rem; justify-content: flex-end; }
+.modal-actions button { padding: 0.4rem 1rem; border-radius: 0.375rem; cursor: pointer; font-size: 0.85rem; }
+.modal-close { background: var(--surface2); border: 1px solid var(--border); color: var(--text); }
+.modal-copy { background: var(--accent); border: none; color: #fff; font-weight: 600; }
+.empty-msg { color: var(--text-dim); font-size: 0.85rem; padding: 1rem; font-style: italic; }
+</style>
+</head>
+<body>
+
+<h1>Quality Report: <span id="skill-name"></span></h1>
+<div class="subtitle" id="subtitle"></div>
+
+<div id="exec-summary"></div>
+
+<div class="filters" id="filters"></div>
+
+<div id="sections"></div>
+
+<div class="sticky-footer hidden" id="footer">
+  <span class="sel-count"><span id="sel-count">0</span> selected</span>
+  <button class="gen-btn" onclick="showBatchPrompt()">Generate Prompt</button>
+</div>
+
+<div class="modal-overlay" id="modal" onclick="if(event.target===this)closeModal()">
+  <div class="modal">
+    <h3 id="modal-title">Generated Prompt</h3>
+    <pre id="modal-content"></pre>
+    <div class="modal-actions">
+      <button class="modal-close" onclick="closeModal()">Close</button>
+      <button class="modal-copy" onclick="copyModal()">Copy to Clipboard</button>
+    </div>
+  </div>
+</div>
+
+<script>
+const DATA = JSON.parse(document.getElementById('report-data').textContent);
+const selected = new Set();
+
+function init() {
+  const m = DATA.meta;
+  const es = DATA.executive_summary;
+  document.getElementById('skill-name').textContent = m.skill_name;
+  document.getElementById('subtitle').textContent = `${m.skill_path} \u2022 ${m.timestamp.split('T')[0]} \u2022 ${m.scanner_count} scanners`;
+
+  // Executive summary
+  let html = `<div class="grade grade-${es.grade}">${es.grade}</div>`;
+  html += `<dl class="summary-grid">`;
+  html += `<dt>Issues</dt><dd>${es.total_issues} total \u2014 ${es.counts.critical} critical, ${es.counts.high} high, ${es.counts.medium} medium, ${es.counts.low} low</dd>`;
+  if (es.enhancement_count) html += `<dt>Enhancements</dt><dd>${es.enhancement_count} opportunities identified</dd>`;
+  if (es.strength_count) html += `<dt>Strengths</dt><dd>${es.strength_count} noted</dd>`;
+  if (es.craft_assessment) html += `<dt>Craft</dt><dd>${esc(es.craft_assessment)}</dd>`;
+  if (es.overall_cohesion) html += `<dt>Cohesion</dt><dd>${esc(es.overall_cohesion)}</dd>`;
+  html += `</dl>`;
+  document.getElementById('exec-summary').innerHTML = html;
+
+  // Severity filters
+  renderFilters();
+
+  // Sections
+  renderSections();
+}
+
+// --- Severity filters ---
+const activeFilters = new Set(['critical','high','medium','low','high-opportunity','medium-opportunity','low-opportunity','strength','suggestion','note','info']);
+
+function renderFilters() {
+  const counts = {};
+  DATA.items.forEach(i => { counts[i.severity] = (counts[i.severity]||0) + 1; });
+  const order = ['critical','high','medium','low','high-opportunity','medium-opportunity','low-opportunity','strength','suggestion','note'];
+  let html = '';
+  order.forEach(s => {
+    if (!counts[s]) return;
+    const active = activeFilters.has(s) ? 'active' : '';
+    html += `<span class="badge badge-${s} ${active}" data-sev="${s}" onclick="toggleFilter('${s}')">${s.replace('-',' ')} ${counts[s]}</span>`;
+  });
+  document.getElementById('filters').innerHTML = html;
+}
+
+function toggleFilter(sev) {
+  if (activeFilters.has(sev)) activeFilters.delete(sev); else activeFilters.add(sev);
+  renderFilters();
+  renderSections();
+}
+
+// --- Sections ---
+function renderSections() {
+  const groups = {};
+  const sectionOrder = ['structural','structure-capabilities','prompt-craft','cohesion','efficiency','quality','scripts','script-opportunities','creative'];
+
+  DATA.items.forEach(i => {
+    if (!activeFilters.has(i.severity)) return;
+    const s = i.section;
+    if (!groups[s]) groups[s] = [];
+    groups[s].push(i);
+  });
+
+  // Truly broken (always first, always open)
+  const broken = DATA.items.filter(i => i.type === 'issue' && (i.severity === 'critical' || i.severity === 'high'));
+  const brokenIds = new Set(broken.map(i => i.id));
+  // Strengths
+  const strengths = DATA.items.filter(i => i.type === 'strength' && activeFilters.has(i.severity));
+
+  let html = '';
+
+  if (broken.length) {
+    html += renderSection('truly-broken', `Truly Broken / Missing (${broken.length})`, broken, true);
+  }
+  if (strengths.length) {
+    html += renderSection('strengths', `Strengths (${strengths.length})`, strengths, false);
+  }
+
+  sectionOrder.forEach(sec => {
+    // Exclude strengths (shown above) and items already in Truly Broken
+    const items = (groups[sec] || []).filter(i => i.type !== 'strength' && !brokenIds.has(i.id));
+    if (!items.length) return;
+    const label = DATA.section_labels[sec] || sec;
+    html += renderSection(sec, `${label} (${items.length})`, items, false);
+  });
+
+  // User journeys
+  if (DATA.journeys.length) {
+    html += renderJourneysSection();
+  }
+
+  // Assessments
+  if (Object.keys(DATA.assessments).length) {
+    html += renderAssessmentsSection();
+  }
+
+  document.getElementById('sections').innerHTML = html;
+}
+
+function renderSection(id, label, items, startOpen) {
+  const openCls = startOpen ? 'open' : '';
+  let html = `<div class="section"><div class="section-header ${openCls}" onclick="toggleSection(this)">`;
+  html += `<span class="arrow">\u25B6</span><span class="label">${label}</span>`;
+  html += `</div><div class="section-body ${openCls}">`;
+  items.forEach(i => { html += renderItem(i); });
+  html += `</div></div>`;
+  return html;
+}
+
+function renderItem(item) {
+  const isStrength = item.type === 'strength';
+  const chk = item.selectable ? `<input type="checkbox" class="item-check" data-id="${item.id}" ${selected.has(item.id)?'checked':''} onchange="toggleSelect('${item.id}', this.checked)">` : '';
+  const sev = `<span class="badge badge-${item.severity}">${item.severity.replace('-',' ')}</span>`;
+  const file = item.file ? `<span class="item-file">${esc(item.file)}${item.line ? ':'+item.line : ''}</span>` : '';
+  const desc = item.description && item.description !== item.title ? `<div class="item-desc">${esc(item.description)}</div>` : '';
+  // Suppress action/impact for strengths — "N/A" is noise
+  const actionText = item.action && !isStrength && item.action !== 'N/A' ? item.action : '';
+  const action = actionText ? `<div class="item-action"><strong>${item.action_type === 'fix' ? 'Fix' : item.action_type === 'create-script' ? 'Script' : 'Suggestion'}:</strong> ${esc(actionText)}</div>` : '';
+  const impactText = item.impact && !isStrength && item.impact !== 'N/A' ? item.impact : '';
+  const impact = impactText ? `<div class="item-impact">Impact: ${esc(impactText)}</div>` : '';
+  const copyBtn = item.selectable ? `<button class="copy-btn" onclick="copySinglePrompt('${item.id}')" title="Copy prompt for this item">\u2398</button>` : '';
+
+  return `<div class="item">${chk}<div class="item-body">${sev} ${file}<div class="item-title">${esc(item.title)}</div>${desc}${action}${impact}</div><div class="item-actions">${copyBtn}</div></div>`;
+}
+
+function renderJourneysSection() {
+  let html = `<div class="section"><div class="section-header" onclick="toggleSection(this)">`;
+  html += `<span class="arrow">\u25B6</span><span class="label">User Journeys (${DATA.journeys.length})</span>`;
+  html += `</div><div class="section-body">`;
+  DATA.journeys.forEach(j => {
+    html += `<div class="journey"><h4>${esc(j.archetype)}</h4>`;
+    html += `<p>${esc(j.journey_summary)}</p>`;
+    if (j.friction_points && j.friction_points.length) {
+      html += `<ul class="friction">`;
+      j.friction_points.forEach(fp => { html += `<li>${esc(fp)}</li>`; });
+      html += `</ul>`;
+    }
+    if (j.bright_spots && j.bright_spots.length) {
+      html += `<ul class="bright">`;
+      j.bright_spots.forEach(bs => { html += `<li>${esc(bs)}</li>`; });
+      html += `</ul>`;
+    }
+    html += `</div>`;
+  });
+  html += `</div></div>`;
+  return html;
+}
+
+function renderAssessmentsSection() {
+  let html = `<div class="section"><div class="section-header" onclick="toggleSection(this)">`;
+  html += `<span class="arrow">\u25B6</span><span class="label">Assessments & Analysis</span>`;
+  html += `</div><div class="section-body">`;
+
+  const ca = DATA.assessments.cohesion_analysis;
+  if (ca) {
+    html += `<div class="assessment"><h4>Cohesion Analysis</h4><table><tr><th>Dimension</th><th>Score</th><th>Notes</th></tr>`;
+    Object.entries(ca).forEach(([dim, val]) => {
+      if (typeof val === 'object' && val.score) {
+        html += `<tr><td>${esc(dim.replace(/_/g, ' '))}</td><td>${esc(val.score)}</td><td>${esc(val.notes || '')}</td></tr>`;
+      }
+    });
+    html += `</table></div>`;
+  }
+
+  const aa = DATA.assessments.autonomous_assessment;
+  if (aa) {
+    html += `<div class="assessment"><h4>Autonomous Readiness</h4><table>`;
+    html += `<tr><td>Overall Potential</td><td>${esc(aa.potential||aa.overall_potential||'')}</td></tr>`;
+    html += `<tr><td>HITL Points</td><td>${aa.hitl_points||aa.hitl_interaction_points||0}</td></tr>`;
+    html += `<tr><td>Auto-Resolvable</td><td>${aa.auto_resolvable||0}</td></tr>`;
+    html += `<tr><td>Needs Input</td><td>${aa.needs_input||0}</td></tr>`;
+    if (aa.notes) html += `<tr><td>Notes</td><td>${esc(aa.notes)}</td></tr>`;
+    html += `</table></div>`;
+  }
+
+  const ti = DATA.assessments.top_insights;
+  if (ti && ti.length) {
+    html += `<div class="assessment"><h4>Top Insights</h4>`;
+    ti.forEach(t => {
+      const tiTitle = t.title || t.insight || '';
+      const tiDetail = t.detail || t.why_it_matters || '';
+      const tiAction = t.action || t.suggestion || '';
+      html += `<div style="margin:0.5rem 0"><strong>${esc(tiTitle)}</strong>`;
+      if (tiDetail) html += `<br><em>Context:</em> ${esc(tiDetail)}`;
+      if (tiAction) html += `<br><em>Suggestion:</em> ${esc(tiAction)}`;
+      html += `</div>`;
+    });
+    html += `</div>`;
+  }
+
+  html += `</div></div>`;
+  return html;
+}
+
+// --- Interactions ---
+function toggleSection(el) {
+  el.classList.toggle('open');
+  el.nextElementSibling.classList.toggle('open');
+}
+
+function toggleSelect(id, checked) {
+  if (checked) selected.add(id); else selected.delete(id);
+  document.getElementById('sel-count').textContent = selected.size;
+  document.getElementById('footer').classList.toggle('hidden', selected.size === 0);
+}
+
+// --- Prompt Generation ---
+function itemById(id) { return DATA.items.find(i => i.id === id); }
+
+function buildPromptForItem(item) {
+  let p = '';
+  const sev = item.severity.replace('-', ' ').toUpperCase();
+  const loc = item.file ? `${item.file}${item.line ? ':'+item.line : ''}` : '';
+  p += `**[${sev}] ${item.title}**\n`;
+  if (loc) p += `- File: ${loc}\n`;
+  if (item.description && item.description !== item.title) p += `- Context: ${item.description}\n`;
+  if (item.action) {
+    const label = item.action_type === 'fix' ? 'Fix' : item.action_type === 'create-script' ? 'Create script' : 'Suggestion';
+    p += `- ${label}: ${item.action}\n`;
+  }
+  if (item.impact) p += `- Impact: ${item.impact}\n`;
+  return p;
+}
+
+function buildPrompt(ids) {
+  const items = ids.map(itemById).filter(Boolean);
+  const fixes = items.filter(i => i.action_type === 'fix');
+  const scripts = items.filter(i => i.action_type === 'create-script');
+  const enhancements = items.filter(i => i.action_type === 'enhance' || i.action_type === 'refactor');
+
+  let prompt = `## Task: Quality Improvements for ${DATA.meta.skill_name}\nSkill path: ${DATA.meta.skill_path}\n\n`;
+
+  if (fixes.length) {
+    prompt += `### Fix These Issues (${fixes.length})\n\n`;
+    fixes.forEach((item, i) => { prompt += `${i+1}. ${buildPromptForItem(item)}\n`; });
+  }
+  if (scripts.length) {
+    prompt += `### Create These Scripts (${scripts.length})\n\n`;
+    scripts.forEach((item, i) => { prompt += `${i+1}. ${buildPromptForItem(item)}\n`; });
+  }
+  if (enhancements.length) {
+    prompt += `### Implement These Enhancements (${enhancements.length})\n\n`;
+    enhancements.forEach((item, i) => { prompt += `${i+1}. ${buildPromptForItem(item)}\n`; });
+  }
+  return prompt.trim();
+}
+
+function copySinglePrompt(id) {
+  const item = itemById(id);
+  if (!item) return;
+  let prompt = `## Task: Quality Fix for ${DATA.meta.skill_name}\nSkill path: ${DATA.meta.skill_path}\n\n`;
+  prompt += buildPromptForItem(item);
+  navigator.clipboard.writeText(prompt).then(() => {
+    const btn = document.querySelector(`[onclick="copySinglePrompt('${id}')"]`);
+    if (btn) { btn.classList.add('copied'); btn.textContent = '\u2713'; setTimeout(() => { btn.classList.remove('copied'); btn.textContent = '\u2398'; }, 1500); }
+  });
+}
+
+function showBatchPrompt() {
+  const prompt = buildPrompt([...selected]);
+  document.getElementById('modal-content').textContent = prompt;
+  document.getElementById('modal').classList.add('visible');
+}
+
+function closeModal() { document.getElementById('modal').classList.remove('visible'); }
+
+function copyModal() {
+  const text = document.getElementById('modal-content').textContent;
+  navigator.clipboard.writeText(text).then(() => {
+    const btn = document.querySelector('.modal-copy');
+    btn.textContent = 'Copied!';
+    setTimeout(() => { btn.textContent = 'Copy to Clipboard'; }, 1500);
+  });
+}
+
+function esc(s) {
+  if (!s) return '';
+  const d = document.createElement('div');
+  d.textContent = String(s);
+  return d.innerHTML;
+}
+
+init();
+</script>
+</body>
+</html>"""
+
+
+def generate_html(report_data: dict) -> str:
+    """Inject report data into the HTML template."""
+    data_json = json.dumps(report_data, indent=None, ensure_ascii=False)
+    # Embed the JSON as a script tag before the main script
+    data_tag = f'<script id="report-data" type="application/json">{data_json}</script>'
+    # Insert before the main <script> tag
+    html = HTML_TEMPLATE.replace('<script>\nconst DATA', f'{data_tag}\n<script>\nconst DATA')
+    html = html.replace('SKILL_NAME_PLACEHOLDER', report_data['meta']['skill_name'])
+    return html
+
+
+# =============================================================================
+# CLI
+# =============================================================================
+
+def main() -> int:
+    parser = argparse.ArgumentParser(
+        description='Generate interactive HTML quality report from scanner JSON files',
+    )
+    parser.add_argument(
+        'report_dir',
+        type=Path,
+        help='Directory containing *-temp.json and *-prepass.json files',
+    )
+    parser.add_argument(
+        '--skill-path',
+        help='Path to the skill being scanned (auto-detected from JSON if omitted)',
+    )
+    parser.add_argument(
+        '--open',
+        action='store_true',
+        help='Open the HTML report in the default browser',
+    )
+    parser.add_argument(
+        '--output', '-o',
+        type=Path,
+        help='Output HTML file path (default: {report_dir}/quality-report.html)',
+    )
+    args = parser.parse_args()
+
+    if not args.report_dir.is_dir():
+        print(f'Error: {args.report_dir} is not a directory', file=sys.stderr)
+        return 2
+
+    report_data = load_report_data(args.report_dir, args.skill_path)
+
+    if not report_data['items']:
+        print('Warning: No scanner data found in directory', file=sys.stderr)
+
+    html = generate_html(report_data)
+
+    output_path = args.output or (args.report_dir / 'quality-report.html')
+    output_path.write_text(html, encoding='utf-8')
+    print(json.dumps({
+        'html_report': str(output_path),
+        'items': len(report_data['items']),
+        'issues': report_data['executive_summary']['total_issues'],
+        'grade': report_data['executive_summary']['grade'],
+    }))
+
+    if args.open:
+        system = platform.system()
+        if system == 'Darwin':
+            subprocess.run(['open', str(output_path)])
+        elif system == 'Linux':
+            subprocess.run(['xdg-open', str(output_path)])
+        elif system == 'Windows':
+            subprocess.run(['start', str(output_path)], shell=True)
+
+    return 0
+
+
+if __name__ == '__main__':
+    sys.exit(main())
diff --git a/src/skills/bmad-agent-builder/scripts/prepass-execution-deps.py b/src/skills/bmad-agent-builder/scripts/prepass-execution-deps.py
index f497631..d4b69ed 100644
--- a/src/skills/bmad-agent-builder/scripts/prepass-execution-deps.py
+++ b/src/skills/bmad-agent-builder/scripts/prepass-execution-deps.py
@@ -185,8 +185,8 @@ def scan_sequential_patterns(filepath: Path, rel_path: str) -> list[dict]:
 
     # Subagent spawning from subagent (impossible)
     if re.search(r'(?i)spawn.*subagent|launch.*subagent|create.*subagent', content):
-        # Check if this file IS a subagent (lives in agents/)
-        if '/agents/' in rel_path or rel_path.startswith('agents/'):
+        # Check if this file IS a subagent (quality-scan-* or report-* files at root)
+        if re.match(r'(?:quality-scan-|report-)', rel_path):
             patterns.append({
                 'file': rel_path,
                 'type': 'subagent-chain-violation',
diff --git a/src/skills/bmad-agent-builder/scripts/prepass-prompt-metrics.py b/src/skills/bmad-agent-builder/scripts/prepass-prompt-metrics.py
index 56231e4..9c8da05 100644
--- a/src/skills/bmad-agent-builder/scripts/prepass-prompt-metrics.py
+++ b/src/skills/bmad-agent-builder/scripts/prepass-prompt-metrics.py
@@ -357,25 +357,24 @@ def scan_prompt_metrics(skill_path: Path) -> dict:
         data['is_skill_md'] = True
         files_data.append(data)
 
-    # Prompts — also extract frontmatter
-    prompts_dir = skill_path / 'prompts'
+    # Prompt files at skill root — also extract frontmatter
     prompt_frontmatters: dict[str, dict] = {}
+    skip_files = {'SKILL.md', 'bmad-manifest.json', 'bmad-skill-manifest.yaml'}
 
-    if prompts_dir.exists():
-        for f in sorted(prompts_dir.iterdir()):
-            if f.is_file() and f.suffix == '.md':
-                data = scan_file_patterns(f, f'prompts/{f.name}')
-                data['is_skill_md'] = False
+    for f in sorted(skill_path.iterdir()):
+        if f.is_file() and f.suffix == '.md' and f.name not in skip_files and f.name != 'SKILL.md':
+            data = scan_file_patterns(f, f.name)
+            data['is_skill_md'] = False
 
-                # Parse prompt frontmatter
-                pfm = parse_prompt_frontmatter(f)
-                data['prompt_frontmatter'] = pfm
+            # Parse prompt frontmatter
+            pfm = parse_prompt_frontmatter(f)
+            data['prompt_frontmatter'] = pfm
 
-                # Use stem as key for manifest alignment
-                prompt_name = pfm.get('fields', {}).get('name', f.stem)
-                prompt_frontmatters[prompt_name] = pfm
+            # Use stem as key for manifest alignment
+            prompt_name = pfm.get('fields', {}).get('name', f.stem)
+            prompt_frontmatters[prompt_name] = pfm
 
-                files_data.append(data)
+            files_data.append(data)
 
     # Resources (just sizes, for progressive disclosure assessment)
     resources_dir = skill_path / 'resources'
diff --git a/src/skills/bmad-agent-builder/scripts/prepass-structure-capabilities.py b/src/skills/bmad-agent-builder/scripts/prepass-structure-capabilities.py
index 6675b3f..ceff64e 100644
--- a/src/skills/bmad-agent-builder/scripts/prepass-structure-capabilities.py
+++ b/src/skills/bmad-agent-builder/scripts/prepass-structure-capabilities.py
@@ -9,7 +9,7 @@
 - Agent name validation (bmad-{code}-agent-{name} or bmad-agent-{name})
 - Required agent sections (Overview, Identity, Communication Style, Principles, On Activation)
 - bmad-manifest.json validation (persona field for agent detection, capabilities)
-- Capability cross-referencing with prompts/
+- Capability cross-referencing with prompt files at skill root
 - Memory path consistency checking
 - Language/directness pattern grep
 - On Exit / Exiting section detection (invalid)
@@ -339,7 +339,6 @@ def cross_reference_capabilities(skill_path: Path) -> tuple[dict, list[dict]]:
     }
 
     manifest_path = skill_path / 'bmad-manifest.json'
-    prompts_dir = skill_path / 'prompts'
 
     if not manifest_path.exists():
         return crossref, findings
@@ -362,11 +361,11 @@ def cross_reference_capabilities(skill_path: Path) -> tuple[dict, list[dict]]:
                 prompt_cap_names.add(name)
                 crossref['manifest_prompt_caps'].append(name)
 
-    # Get actual prompt files
+    # Get actual prompt files (at skill root, excluding SKILL.md and non-prompt files)
     actual_prompts = set()
-    if prompts_dir.exists():
-        for f in prompts_dir.iterdir():
-            if f.is_file() and f.suffix == '.md':
+    skip_files = {'SKILL.md', 'bmad-manifest.json', 'bmad-skill-manifest.yaml'}
+    for f in skill_path.iterdir():
+        if f.is_file() and f.suffix == '.md' and f.name not in skip_files:
                 actual_prompts.add(f.stem)
 
     # Missing prompt files (in manifest but no file)
@@ -376,7 +375,7 @@ def cross_reference_capabilities(skill_path: Path) -> tuple[dict, list[dict]]:
         findings.append({
             'file': 'bmad-manifest.json', 'line': 0,
             'severity': 'high', 'category': 'capability-crossref',
-            'issue': f'Prompt capability "{name}" has no matching file prompts/{name}.md',
+            'issue': f'Prompt capability "{name}" has no matching file {name}.md at skill root',
         })
 
     # Orphaned prompt files (file exists but not in manifest)
@@ -384,9 +383,9 @@ def cross_reference_capabilities(skill_path: Path) -> tuple[dict, list[dict]]:
     for name in sorted(orphaned):
         crossref['orphaned_prompt_files'].append(name)
         findings.append({
-            'file': f'prompts/{name}.md', 'line': 0,
+            'file': f'{name}.md', 'line': 0,
             'severity': 'medium', 'category': 'capability-crossref',
-            'issue': f'Prompt file prompts/{name}.md not referenced as a prompt capability in manifest',
+            'issue': f'Prompt file {name}.md not referenced as a prompt capability in manifest',
         })
 
     return crossref, findings
@@ -450,15 +449,16 @@ def check_prompt_basics(skill_path: Path) -> tuple[list[dict], list[dict]]:
     """Check each prompt file for config header and progression conditions."""
     findings = []
     prompt_details = []
-    prompts_dir = skill_path / 'prompts'
-    if not prompts_dir.exists():
+    skip_files = {'SKILL.md', 'bmad-manifest.json', 'bmad-skill-manifest.yaml'}
+
+    prompt_files = [f for f in sorted(skill_path.iterdir())
+                    if f.is_file() and f.suffix == '.md' and f.name not in skip_files]
+    if not prompt_files:
         return prompt_details, findings
 
-    for f in sorted(prompts_dir.iterdir()):
-        if not f.is_file() or f.suffix != '.md':
-            continue
+    for f in prompt_files:
         content = f.read_text(encoding='utf-8')
-        rel_path = f'prompts/{f.name}'
+        rel_path = f.name
         detail = {'file': f.name, 'has_config_header': False, 'has_progression': False}
 
         # Config header check
diff --git a/src/skills/bmad-agent-builder/scripts/scan-path-standards.py b/src/skills/bmad-agent-builder/scripts/scan-path-standards.py
index 42f72e6..3a328ed 100644
--- a/src/skills/bmad-agent-builder/scripts/scan-path-standards.py
+++ b/src/skills/bmad-agent-builder/scripts/scan-path-standards.py
@@ -2,13 +2,12 @@
 """Deterministic path standards scanner for BMad skills.
 
 Validates all .md and .json files against BMad path conventions:
-1. {skill-root} must never appear (always wrong)
-2. {project-root} only valid before /_bmad
-3. Bare _bmad references must have {project-root} prefix
-4. Config variables used directly (no double-prefix)
-5. No ./ or ../ relative prefixes
-6. No absolute paths
-7. Memory paths must use {project-root}/_bmad/_memory/{skillName}-sidecar/
+1. {project-root} only valid before /_bmad
+2. Bare _bmad references must have {project-root} prefix
+3. Config variables used directly (no double-prefix)
+4. No ./ or ../ relative prefixes
+5. No absolute paths
+6. Memory paths must use {project-root}/_bmad/_memory/{skillName}-sidecar/
 """
 
 # /// script
@@ -26,7 +25,6 @@
 
 
 # Patterns to detect
-SKILL_ROOT_RE = re.compile(r'\{skill-root\}')
 # {project-root} NOT followed by /_bmad
 PROJECT_ROOT_NOT_BMAD_RE = re.compile(r'\{project-root\}/(?!_bmad)')
 # Bare _bmad without {project-root} prefix — match _bmad at word boundary
@@ -66,8 +64,6 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
     rel_path = filepath.name
 
     checks = [
-        (SKILL_ROOT_RE, 'skill-root-found', 'critical',
-         '{skill-root} found — never use this, use bare relative paths for skill-internal files'),
         (PROJECT_ROOT_NOT_BMAD_RE, 'project-root-not-bmad', 'critical',
          '{project-root} used for non-_bmad path — only valid use is {project-root}/_bmad/...'),
         (ABSOLUTE_PATH_RE, 'absolute-path', 'high',
@@ -92,8 +88,9 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
                 'line': line_num,
                 'severity': severity,
                 'category': category,
-                'issue': message,
-                'context': line_content[:120],
+                'title': message,
+                'detail': line_content[:120],
+                'action': '',
             })
 
     # Bare _bmad check — more nuanced, need to avoid false positives
@@ -116,8 +113,9 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
             'line': line_num,
             'severity': 'high',
             'category': 'bare-bmad',
-            'issue': 'Bare _bmad reference without {project-root} prefix',
-            'context': line_content[:120],
+            'title': 'Bare _bmad reference without {project-root} prefix',
+            'detail': line_content[:120],
+            'action': '',
         })
 
     # Memory path check — memory paths should use {project-root}/_bmad/_memory/{skillName}-sidecar/
@@ -137,8 +135,9 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
                 'line': line_num,
                 'severity': 'high',
                 'category': 'memory-path',
-                'issue': 'Memory path missing {project-root} prefix — use {project-root}/_bmad/_memory/',
-                'context': line_content[:120],
+                'title': 'Memory path missing {project-root} prefix — use {project-root}/_bmad/_memory/',
+                'detail': line_content[:120],
+                'action': '',
             })
         elif '-sidecar/' not in matched_text:
             line_num = get_line_number(content, pos)
@@ -148,8 +147,9 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
                 'line': line_num,
                 'severity': 'high',
                 'category': 'memory-path',
-                'issue': 'Memory path not using {skillName}-sidecar/ convention',
-                'context': line_content[:120],
+                'title': 'Memory path not using {skillName}-sidecar/ convention',
+                'detail': line_content[:120],
+                'action': '',
             })
 
     return findings
@@ -166,9 +166,6 @@ def scan_skill(skill_path: Path, skip_fenced: bool = True) -> dict:
 
     files_scanned = []
     for md_file in md_files:
-        # Skip tests/fixtures
-        if 'tests/fixtures' in str(md_file):
-            continue
         rel = md_file.relative_to(skill_path)
         files_scanned.append(str(rel))
         file_findings = scan_file(md_file, skip_fenced)
@@ -179,7 +176,6 @@ def scan_skill(skill_path: Path, skip_fenced: bool = True) -> dict:
     # Build summary
     by_severity = {'critical': 0, 'high': 0, 'medium': 0, 'low': 0}
     by_category = {
-        'skill_root_found': 0,
         'project_root_not_bmad': 0,
         'bare_bmad': 0,
         'double_prefix': 0,
@@ -204,11 +200,13 @@ def scan_skill(skill_path: Path, skip_fenced: bool = True) -> dict:
         'timestamp': datetime.now(timezone.utc).isoformat(),
         'files_scanned': files_scanned,
         'status': 'pass' if not all_findings else 'fail',
-        'issues': all_findings,
+        'findings': all_findings,
+        'assessments': {},
         'summary': {
-            'total_issues': len(all_findings),
+            'total_findings': len(all_findings),
             'by_severity': by_severity,
             'by_category': by_category,
+            'assessment': 'Path standards scan complete',
         },
     }
 
diff --git a/src/skills/bmad-agent-builder/scripts/scan-scripts.py b/src/skills/bmad-agent-builder/scripts/scan-scripts.py
index 45e39df..28303c3 100644
--- a/src/skills/bmad-agent-builder/scripts/scan-scripts.py
+++ b/src/skills/bmad-agent-builder/scripts/scan-scripts.py
@@ -8,6 +8,7 @@
 - Agentic design: no input(), has argparse/--help, JSON output, exit codes
 - Unit test existence
 - Over-engineering signals (line count, simple-op imports)
+- External lint: ruff (Python), shellcheck (Bash), biome (JS/TS)
 """
 
 # /// script
@@ -20,11 +21,237 @@
 import ast
 import json
 import re
+import shutil
+import subprocess
 import sys
 from datetime import datetime, timezone
 from pathlib import Path
 
 
+# =============================================================================
+# External Linter Integration
+# =============================================================================
+
+def _run_command(cmd: list[str], timeout: int = 30) -> tuple[int, str, str]:
+    """Run a command and return (returncode, stdout, stderr)."""
+    try:
+        result = subprocess.run(
+            cmd, capture_output=True, text=True, timeout=timeout,
+        )
+        return result.returncode, result.stdout, result.stderr
+    except FileNotFoundError:
+        return -1, '', f'Command not found: {cmd[0]}'
+    except subprocess.TimeoutExpired:
+        return -2, '', f'Command timed out after {timeout}s: {" ".join(cmd)}'
+
+
+def _find_uv() -> str | None:
+    """Find uv binary on PATH."""
+    return shutil.which('uv')
+
+
+def _find_npx() -> str | None:
+    """Find npx binary on PATH."""
+    return shutil.which('npx')
+
+
+def lint_python_ruff(filepath: Path, rel_path: str) -> list[dict]:
+    """Run ruff on a Python file via uv. Returns lint findings."""
+    uv = _find_uv()
+    if not uv:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': 'uv not found on PATH — cannot run ruff for Python linting',
+            'detail': '',
+            'action': 'Install uv: https://docs.astral.sh/uv/getting-started/installation/',
+        }]
+
+    rc, stdout, stderr = _run_command([
+        uv, 'run', 'ruff', 'check', '--output-format', 'json', str(filepath),
+    ])
+
+    if rc == -1:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': f'Failed to run ruff via uv: {stderr.strip()}',
+            'detail': '',
+            'action': 'Ensure uv can install and run ruff: uv run ruff --version',
+        }]
+
+    if rc == -2:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'ruff timed out on {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    # ruff outputs JSON array on stdout (even on rc=1 when issues found)
+    findings = []
+    try:
+        issues = json.loads(stdout) if stdout.strip() else []
+    except json.JSONDecodeError:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'Failed to parse ruff output for {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    for issue in issues:
+        fix_msg = issue.get('fix', {}).get('message', '') if issue.get('fix') else ''
+        findings.append({
+            'file': rel_path,
+            'line': issue.get('location', {}).get('row', 0),
+            'severity': 'high',
+            'category': 'lint',
+            'title': f'[{issue.get("code", "?")}] {issue.get("message", "")}',
+            'detail': '',
+            'action': fix_msg or f'See https://docs.astral.sh/ruff/rules/{issue.get("code", "")}',
+        })
+
+    return findings
+
+
+def lint_shell_shellcheck(filepath: Path, rel_path: str) -> list[dict]:
+    """Run shellcheck on a shell script via uv. Returns lint findings."""
+    uv = _find_uv()
+    if not uv:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': 'uv not found on PATH — cannot run shellcheck for shell linting',
+            'detail': '',
+            'action': 'Install uv: https://docs.astral.sh/uv/getting-started/installation/',
+        }]
+
+    rc, stdout, stderr = _run_command([
+        uv, 'run', '--with', 'shellcheck-py',
+        'shellcheck', '--format', 'json', str(filepath),
+    ])
+
+    if rc == -1:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': f'Failed to run shellcheck via uv: {stderr.strip()}',
+            'detail': '',
+            'action': 'Ensure uv can install shellcheck-py: uv run --with shellcheck-py shellcheck --version',
+        }]
+
+    if rc == -2:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'shellcheck timed out on {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    findings = []
+    # shellcheck outputs JSON on stdout (rc=1 when issues found)
+    raw = stdout.strip() or stderr.strip()
+    try:
+        issues = json.loads(raw) if raw else []
+    except json.JSONDecodeError:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'Failed to parse shellcheck output for {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    # Map shellcheck levels to our severity
+    level_map = {'error': 'high', 'warning': 'high', 'info': 'high', 'style': 'medium'}
+
+    for issue in issues:
+        sc_code = issue.get('code', '')
+        findings.append({
+            'file': rel_path,
+            'line': issue.get('line', 0),
+            'severity': level_map.get(issue.get('level', ''), 'high'),
+            'category': 'lint',
+            'title': f'[SC{sc_code}] {issue.get("message", "")}',
+            'detail': '',
+            'action': f'See https://www.shellcheck.net/wiki/SC{sc_code}',
+        })
+
+    return findings
+
+
+def lint_node_biome(filepath: Path, rel_path: str) -> list[dict]:
+    """Run biome on a JS/TS file via npx. Returns lint findings."""
+    npx = _find_npx()
+    if not npx:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': 'npx not found on PATH — cannot run biome for JS/TS linting',
+            'detail': '',
+            'action': 'Install Node.js 20+: https://nodejs.org/',
+        }]
+
+    rc, stdout, stderr = _run_command([
+        npx, '--yes', '@biomejs/biome', 'lint', '--reporter', 'json', str(filepath),
+    ], timeout=60)
+
+    if rc == -1:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': f'Failed to run biome via npx: {stderr.strip()}',
+            'detail': '',
+            'action': 'Ensure npx can run biome: npx @biomejs/biome --version',
+        }]
+
+    if rc == -2:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'biome timed out on {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    findings = []
+    # biome outputs JSON on stdout
+    raw = stdout.strip()
+    try:
+        result = json.loads(raw) if raw else {}
+    except json.JSONDecodeError:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'Failed to parse biome output for {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    for diag in result.get('diagnostics', []):
+        loc = diag.get('location', {})
+        start = loc.get('start', {})
+        findings.append({
+            'file': rel_path,
+            'line': start.get('line', 0),
+            'severity': 'high',
+            'category': 'lint',
+            'title': f'[{diag.get("category", "?")}] {diag.get("message", "")}',
+            'detail': '',
+            'action': diag.get('advices', [{}])[0].get('message', '') if diag.get('advices') else '',
+        })
+
+    return findings
+
+
+# =============================================================================
+# BMad Pattern Checks (Existing)
+# =============================================================================
+
 def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
     """Check a Python script for standards compliance."""
     findings = []
@@ -39,8 +266,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': 1,
                 'severity': 'medium', 'category': 'dependencies',
-                'issue': 'No PEP 723 inline dependency block (# /// script)',
-                'fix': 'Add PEP 723 block with requires-python and dependencies',
+                'title': 'No PEP 723 inline dependency block (# /// script)',
+                'detail': '',
+                'action': 'Add PEP 723 block with requires-python and dependencies',
             })
     else:
         # Check requires-python is present
@@ -48,8 +276,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': 1,
                 'severity': 'low', 'category': 'dependencies',
-                'issue': 'PEP 723 block exists but missing requires-python constraint',
-                'fix': 'Add requires-python = ">=3.9" or appropriate version',
+                'title': 'PEP 723 block exists but missing requires-python constraint',
+                'detail': '',
+                'action': 'Add requires-python = ">=3.9" or appropriate version',
             })
 
     # requirements.txt reference
@@ -57,8 +286,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'high', 'category': 'dependencies',
-            'issue': 'References requirements.txt or pip install — use PEP 723 inline deps',
-            'fix': 'Replace with PEP 723 inline dependency block',
+            'title': 'References requirements.txt or pip install — use PEP 723 inline deps',
+            'detail': '',
+            'action': 'Replace with PEP 723 inline dependency block',
         })
 
     # Agentic design checks via AST
@@ -68,12 +298,13 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'critical', 'category': 'error-handling',
-            'issue': 'Python syntax error — script cannot be parsed',
+            'title': 'Python syntax error — script cannot be parsed',
+            'detail': '',
+            'action': '',
         })
         return findings
 
     has_argparse = False
-    has_input_call = False
     has_json_dumps = False
     has_sys_exit = False
     imports = set()
@@ -91,12 +322,12 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         if isinstance(node, ast.Call):
             func = node.func
             if isinstance(func, ast.Name) and func.id == 'input':
-                has_input_call = True
                 findings.append({
                     'file': rel_path, 'line': node.lineno,
                     'severity': 'critical', 'category': 'agentic-design',
-                    'issue': 'input() call found — blocks in non-interactive agent execution',
-                    'fix': 'Use argparse with required flags instead of interactive prompts',
+                    'title': 'input() call found — blocks in non-interactive agent execution',
+                    'detail': '',
+                    'action': 'Use argparse with required flags instead of interactive prompts',
                 })
             # json.dumps
             if isinstance(func, ast.Attribute) and func.attr == 'dumps':
@@ -115,24 +346,27 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'agentic-design',
-            'issue': 'No argparse found — script lacks --help self-documentation',
-            'fix': 'Add argparse with description and argument help text',
+            'title': 'No argparse found — script lacks --help self-documentation',
+            'detail': '',
+            'action': 'Add argparse with description and argument help text',
         })
 
     if not has_json_dumps and line_count > 20:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'agentic-design',
-            'issue': 'No json.dumps found — output may not be structured JSON',
-            'fix': 'Use json.dumps for structured output parseable by workflows',
+            'title': 'No json.dumps found — output may not be structured JSON',
+            'detail': '',
+            'action': 'Use json.dumps for structured output parseable by workflows',
         })
 
     if not has_sys_exit and line_count > 20:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'low', 'category': 'agentic-design',
-            'issue': 'No sys.exit() calls — may not return meaningful exit codes',
-            'fix': 'Return 0=success, 1=fail, 2=error via sys.exit()',
+            'title': 'No sys.exit() calls — may not return meaningful exit codes',
+            'detail': '',
+            'action': 'Return 0=success, 1=fail, 2=error via sys.exit()',
         })
 
     # Over-engineering: simple file ops in Python
@@ -142,8 +376,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'low', 'category': 'over-engineered',
-            'issue': f'Short script ({line_count} lines) imports {", ".join(over_eng)} — may be simpler as bash',
-            'fix': 'Consider if cp/mv/find shell commands would suffice',
+            'title': f'Short script ({line_count} lines) imports {", ".join(over_eng)} — may be simpler as bash',
+            'detail': '',
+            'action': 'Consider if cp/mv/find shell commands would suffice',
         })
 
     # Very short script
@@ -151,8 +386,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'over-engineered',
-            'issue': f'Script is only {line_count} lines — could be an inline command',
-            'fix': 'Consider inlining this command directly in the prompt',
+            'title': f'Script is only {line_count} lines — could be an inline command',
+            'detail': '',
+            'action': 'Consider inlining this command directly in the prompt',
         })
 
     return findings
@@ -170,15 +406,17 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'high', 'category': 'portability',
-            'issue': 'Missing shebang line',
-            'fix': 'Add #!/usr/bin/env bash or #!/usr/bin/env sh',
+            'title': 'Missing shebang line',
+            'detail': '',
+            'action': 'Add #!/usr/bin/env bash or #!/usr/bin/env sh',
         })
     elif '/usr/bin/env' not in lines[0]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'portability',
-            'issue': f'Shebang uses hardcoded path: {lines[0].strip()}',
-            'fix': 'Use #!/usr/bin/env bash for cross-platform compatibility',
+            'title': f'Shebang uses hardcoded path: {lines[0].strip()}',
+            'detail': '',
+            'action': 'Use #!/usr/bin/env bash for cross-platform compatibility',
         })
 
     # set -e
@@ -186,8 +424,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'error-handling',
-            'issue': 'Missing set -e — errors will be silently ignored',
-            'fix': 'Add set -e (or set -euo pipefail) near the top',
+            'title': 'Missing set -e — errors will be silently ignored',
+            'detail': '',
+            'action': 'Add set -e (or set -euo pipefail) near the top',
         })
 
     # Hardcoded interpreter paths
@@ -197,8 +436,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'medium', 'category': 'portability',
-                'issue': f'Hardcoded interpreter path: {line.strip()}',
-                'fix': 'Use /usr/bin/env or PATH-based lookup',
+                'title': f'Hardcoded interpreter path: {line.strip()}',
+                'detail': '',
+                'action': 'Use /usr/bin/env or PATH-based lookup',
             })
 
     # GNU-only tools
@@ -209,8 +449,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'medium', 'category': 'portability',
-                'issue': f'GNU-only tool: {m.group()} — not available on all platforms',
-                'fix': 'Use POSIX-compatible equivalent',
+                'title': f'GNU-only tool: {m.group()} — not available on all platforms',
+                'detail': '',
+                'action': 'Use POSIX-compatible equivalent',
             })
 
     # Unquoted variables (basic check)
@@ -226,8 +467,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'low', 'category': 'portability',
-                'issue': f'Potentially unquoted variable: {m.group()} — breaks with spaces in paths',
-                'fix': f'Use "{m.group()}" with double quotes',
+                'title': f'Potentially unquoted variable: {m.group()} — breaks with spaces in paths',
+                'detail': '',
+                'action': f'Use "{m.group()}" with double quotes',
             })
 
     # npx/uvx without version pinning
@@ -240,8 +482,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'medium', 'category': 'dependencies',
-                'issue': f'{m.group(1)} {m.group(2)} without version pinning',
-                'fix': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
+                'title': f'{m.group(1)} {m.group(2)} without version pinning',
+                'detail': '',
+                'action': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
             })
 
     # Very short script
@@ -249,17 +492,56 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'over-engineered',
-            'issue': f'Script is only {line_count} lines — could be an inline command',
-            'fix': 'Consider inlining this command directly in the prompt',
+            'title': f'Script is only {line_count} lines — could be an inline command',
+            'detail': '',
+            'action': 'Consider inlining this command directly in the prompt',
         })
 
     return findings
 
 
+def scan_node_script(filepath: Path, rel_path: str) -> list[dict]:
+    """Check a JS/TS script for standards compliance."""
+    findings = []
+    content = filepath.read_text(encoding='utf-8')
+    lines = content.split('\n')
+    line_count = len(lines)
+
+    # npx/uvx without version pinning
+    no_pin = re.compile(r'\b(npx|uvx)\s+([a-zA-Z][\w-]+)(?!\S*@)')
+    for i, line in enumerate(lines, 1):
+        m = no_pin.search(line)
+        if m:
+            findings.append({
+                'file': rel_path, 'line': i,
+                'severity': 'medium', 'category': 'dependencies',
+                'title': f'{m.group(1)} {m.group(2)} without version pinning',
+                'detail': '',
+                'action': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
+            })
+
+    # Very short script
+    if line_count < 5:
+        findings.append({
+            'file': rel_path, 'line': 1,
+            'severity': 'medium', 'category': 'over-engineered',
+            'title': f'Script is only {line_count} lines — could be an inline command',
+            'detail': '',
+            'action': 'Consider inlining this command directly in the prompt',
+        })
+
+    return findings
+
+
+# =============================================================================
+# Main Scanner
+# =============================================================================
+
 def scan_skill_scripts(skill_path: Path) -> dict:
     """Scan all scripts in a skill directory."""
     scripts_dir = skill_path / 'scripts'
     all_findings = []
+    lint_findings = []
     script_inventory = {'python': [], 'shell': [], 'node': [], 'other': []}
     missing_tests = []
 
@@ -267,24 +549,34 @@ def scan_skill_scripts(skill_path: Path) -> dict:
         return {
             'scanner': 'scripts',
             'script': 'scan-scripts.py',
-            'version': '1.0.0',
+            'version': '2.0.0',
             'skill_path': str(skill_path),
             'timestamp': datetime.now(timezone.utc).isoformat(),
             'status': 'pass',
-            'issues': [{
+            'findings': [{
                 'file': 'scripts/',
                 'severity': 'info',
                 'category': 'none',
-                'issue': 'No scripts/ directory found — nothing to scan',
+                'title': 'No scripts/ directory found — nothing to scan',
+                'detail': '',
+                'action': '',
             }],
-            'script_summary': {
-                'total_scripts': 0,
-                'by_type': script_inventory,
-                'missing_tests': [],
+            'assessments': {
+                'lint_summary': {
+                    'tools_used': [],
+                    'files_linted': 0,
+                    'lint_issues': 0,
+                },
+                'script_summary': {
+                    'total_scripts': 0,
+                    'by_type': script_inventory,
+                    'missing_tests': [],
+                },
             },
             'summary': {
-                'total_issues': 0,
+                'total_findings': 0,
                 'by_severity': {'critical': 0, 'high': 0, 'medium': 0, 'low': 0},
+                'assessment': '',
             },
         }
 
@@ -295,6 +587,7 @@ def scan_skill_scripts(skill_path: Path) -> dict:
             script_files.append(f)
 
     tests_dir = scripts_dir / 'tests'
+    lint_tools_used = set()
 
     for script_file in script_files:
         rel_path = f'scripts/{script_file.name}'
@@ -303,24 +596,24 @@ def scan_skill_scripts(skill_path: Path) -> dict:
         if ext == '.py':
             script_inventory['python'].append(script_file.name)
             findings = scan_python_script(script_file, rel_path)
+            lf = lint_python_ruff(script_file, rel_path)
+            lint_findings.extend(lf)
+            if lf and not any(f['category'] == 'lint-setup' for f in lf):
+                lint_tools_used.add('ruff')
         elif ext in ('.sh', '.bash'):
             script_inventory['shell'].append(script_file.name)
             findings = scan_shell_script(script_file, rel_path)
+            lf = lint_shell_shellcheck(script_file, rel_path)
+            lint_findings.extend(lf)
+            if lf and not any(f['category'] == 'lint-setup' for f in lf):
+                lint_tools_used.add('shellcheck')
         elif ext in ('.js', '.ts', '.mjs'):
             script_inventory['node'].append(script_file.name)
-            # Check for npx/uvx version pinning in node scripts
-            content = script_file.read_text(encoding='utf-8')
-            findings = []
-            no_pin = re.compile(r'\b(npx|uvx)\s+([a-zA-Z][\w-]+)(?!\S*@)')
-            for i, line in enumerate(content.split('\n'), 1):
-                m = no_pin.search(line)
-                if m:
-                    findings.append({
-                        'file': rel_path, 'line': i,
-                        'severity': 'medium', 'category': 'dependencies',
-                        'issue': f'{m.group(1)} {m.group(2)} without version pinning',
-                        'fix': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
-                    })
+            findings = scan_node_script(script_file, rel_path)
+            lf = lint_node_biome(script_file, rel_path)
+            lint_findings.extend(lf)
+            if lf and not any(f['category'] == 'lint-setup' for f in lf):
+                lint_tools_used.add('biome')
         else:
             script_inventory['other'].append(script_file.name)
             findings = []
@@ -342,8 +635,9 @@ def scan_skill_scripts(skill_path: Path) -> dict:
             findings.append({
                 'file': rel_path, 'line': 1,
                 'severity': 'medium', 'category': 'tests',
-                'issue': f'No unit test found for {script_file.name}',
-                'fix': f'Create scripts/tests/test-{script_file.stem}{ext} with test cases',
+                'title': f'No unit test found for {script_file.name}',
+                'detail': '',
+                'action': f'Create scripts/tests/test-{script_file.stem}{ext} with test cases',
             })
 
         all_findings.extend(findings)
@@ -355,10 +649,14 @@ def scan_skill_scripts(skill_path: Path) -> dict:
             'line': 0,
             'severity': 'high',
             'category': 'tests',
-            'issue': 'scripts/tests/ directory does not exist — no unit tests',
-            'fix': 'Create scripts/tests/ with test files for each script',
+            'title': 'scripts/tests/ directory does not exist — no unit tests',
+            'detail': '',
+            'action': 'Create scripts/tests/ with test files for each script',
         })
 
+    # Merge lint findings into all findings
+    all_findings.extend(lint_findings)
+
     # Build summary
     by_severity = {'critical': 0, 'high': 0, 'medium': 0, 'low': 0}
     by_category: dict[str, int] = {}
@@ -378,31 +676,41 @@ def scan_skill_scripts(skill_path: Path) -> dict:
     elif total_scripts == 0:
         status = 'pass'
 
+    lint_issue_count = sum(1 for f in lint_findings if f['category'] == 'lint')
+
     return {
         'scanner': 'scripts',
         'script': 'scan-scripts.py',
-        'version': '1.0.0',
+        'version': '2.0.0',
         'skill_path': str(skill_path),
         'timestamp': datetime.now(timezone.utc).isoformat(),
         'status': status,
-        'issues': all_findings,
-        'script_summary': {
-            'total_scripts': total_scripts,
-            'by_type': {k: len(v) for k, v in script_inventory.items()},
-            'scripts': {k: v for k, v in script_inventory.items() if v},
-            'missing_tests': missing_tests,
+        'findings': all_findings,
+        'assessments': {
+            'lint_summary': {
+                'tools_used': sorted(lint_tools_used),
+                'files_linted': total_scripts,
+                'lint_issues': lint_issue_count,
+            },
+            'script_summary': {
+                'total_scripts': total_scripts,
+                'by_type': {k: len(v) for k, v in script_inventory.items()},
+                'scripts': {k: v for k, v in script_inventory.items() if v},
+                'missing_tests': missing_tests,
+            },
         },
         'summary': {
-            'total_issues': len(all_findings),
+            'total_findings': len(all_findings),
             'by_severity': by_severity,
             'by_category': by_category,
+            'assessment': '',
         },
     }
 
 
 def main() -> int:
     parser = argparse.ArgumentParser(
-        description='Scan BMad skill scripts for quality, portability, and agentic design',
+        description='Scan BMad skill scripts for quality, portability, agentic design, and lint issues',
     )
     parser.add_argument(
         'skill_path',
diff --git a/src/skills/bmad-agent-builder/tests/fixtures/complex/old-format-pm-agent.md b/src/skills/bmad-agent-builder/tests/fixtures/complex/old-format-pm-agent.md
deleted file mode 100644
index b5a91ac..0000000
--- a/src/skills/bmad-agent-builder/tests/fixtures/complex/old-format-pm-agent.md
+++ /dev/null
@@ -1,72 +0,0 @@
----
-name: "pm"
-description: "Product Manager"
----
-
-You must fully embody this agent's persona and follow all activation instructions exactly as specified. NEVER break character until given an exit command.
-
-```xml
-<agent id="pm.agent.yaml" name="John" title="Product Manager" icon="📋" capabilities="PRD creation, requirements discovery, stakeholder alignment, user interviews">
-<activation critical="MANDATORY">
-      <step n="1">Load persona from this current agent file (already in context)</step>
-      <step n="2">🚨 IMMEDIATE ACTION REQUIRED - BEFORE ANY OUTPUT:
-          - Load and read {project-root}/_bmad/bmm/config.yaml NOW
-          - Store ALL fields as session variables: {user_name}, {communication_language}, {output_folder}
-          - VERIFY: If config not loaded, STOP and report error to user
-          - DO NOT PROCEED to step 3 until config is successfully loaded and variables stored
-      </step>
-      <step n="3">Remember: user's name is {user_name}</step>
-      
-      <step n="4">Show greeting using {user_name} from config, communicate in {communication_language}, then display numbered list of ALL menu items from menu section</step>
-      <step n="5">Let {user_name} know they can type command `/bmad-help` at any time to get advice on what to do next, and that they can combine that with what they need help with <example>`/bmad-help where should I start with an idea I have that does XYZ`</example></step>
-      <step n="6">STOP and WAIT for user input - do NOT execute menu items automatically - accept number or cmd trigger or fuzzy command match</step>
-      <step n="7">On user input: Number → process menu item[n] | Text → case-insensitive substring match | Multiple matches → ask user to clarify | No match → show "Not recognized"</step>
-      <step n="8">When processing a menu item: Check menu-handlers section below - extract any attributes from the selected menu item (workflow, exec, tmpl, data, action, validate-workflow) and follow the corresponding handler instructions</step>
-
-      <menu-handlers>
-              <handlers>
-          <handler type="exec">
-        When menu item or handler has: exec="path/to/file.md":
-        1. Read fully and follow the file at that path
-        2. Process the complete file and follow all instructions within it
-        3. If there is data="some/path/data-foo.md" with the same item, pass that data path to the executed file as context.
-      </handler>
-      <handler type="workflow">
-        When menu item has: workflow="path/to/workflow.yaml":
-
-        1. CRITICAL: Always LOAD {project-root}/_bmad/core/tasks/workflow.xml
-        2. Read the complete file - this is the CORE OS for processing BMAD workflows
-        3. Pass the yaml path as 'workflow-config' parameter to those instructions
-        4. Follow workflow.xml instructions precisely following all steps
-        5. Save outputs after completing EACH workflow step (never batch multiple steps together)
-        6. If workflow.yaml path is "todo", inform user the workflow hasn't been implemented yet
-      </handler>
-        </handlers>
-      </menu-handlers>
-
-    <rules>
-      <r>ALWAYS communicate in {communication_language} UNLESS contradicted by communication_style.</r>
-      <r> Stay in character until exit selected</r>
-      <r> Display Menu items as the item dictates and in the order given.</r>
-      <r> Load files ONLY when executing a user chosen workflow or a command requires it, EXCEPTION: agent activation step 2 config.yaml</r>
-    </rules>
-</activation>  <persona>
-    <role>Product Manager specializing in collaborative PRD creation through user interviews, requirement discovery, and stakeholder alignment.</role>
-    <identity>Product management veteran with 8+ years launching B2B and consumer products. Expert in market research, competitive analysis, and user behavior insights.</identity>
-    <communication_style>Asks &apos;WHY?&apos; relentlessly like a detective on a case. Direct and data-sharp, cuts through fluff to what actually matters.</communication_style>
-    <principles>- Channel expert product manager thinking: draw upon deep knowledge of user-centered design, Jobs-to-be-Done framework, opportunity scoring, and what separates great products from mediocre ones - PRDs emerge from user interviews, not template filling - discover what users actually need - Ship the smallest thing that validates the assumption - iteration over perfection - Technical feasibility is a constraint, not the driver - user value first</principles>
-  </persona>
-  <menu>
-    <item cmd="MH or fuzzy match on menu or help">[MH] Redisplay Menu Help</item>
-    <item cmd="CH or fuzzy match on chat">[CH] Chat with the Agent about anything</item>
-    <item cmd="CP or fuzzy match on create-prd" exec="{project-root}/_bmad/bmm/workflows/2-plan-workflows/create-prd/workflow-create-prd.md">[CP] Create PRD: Expert led facilitation to produce your Product Requirements Document</item>
-    <item cmd="VP or fuzzy match on validate-prd" exec="{project-root}/_bmad/bmm/workflows/2-plan-workflows/create-prd/workflow-validate-prd.md">[VP] Validate PRD: Validate a Product Requirements Document is comprehensive, lean, well organized and cohesive</item>
-    <item cmd="EP or fuzzy match on edit-prd" exec="{project-root}/_bmad/bmm/workflows/2-plan-workflows/create-prd/workflow-edit-prd.md">[EP] Edit PRD: Update an existing Product Requirements Document</item>
-    <item cmd="CE or fuzzy match on epics-stories" exec="{project-root}/_bmad/bmm/workflows/3-solutioning/create-epics-and-stories/workflow.md">[CE] Create Epics and Stories: Create the Epics and Stories Listing, these are the specs that will drive development</item>
-    <item cmd="IR or fuzzy match on implementation-readiness" exec="{project-root}/_bmad/bmm/workflows/3-solutioning/check-implementation-readiness/workflow.md">[IR] Implementation Readiness: Ensure the PRD, UX, and Architecture and Epics and Stories List are all aligned</item>
-    <item cmd="CC or fuzzy match on correct-course" workflow="{project-root}/_bmad/bmm/workflows/4-implementation/correct-course/workflow.yaml">[CC] Course Correction: Use this so we can determine how to proceed if major need for change is discovered mid implementation</item>
-    <item cmd="PM or fuzzy match on party-mode" exec="{project-root}/_bmad/core/workflows/party-mode/workflow.md">[PM] Start Party Mode</item>
-    <item cmd="DA or fuzzy match on exit, leave, goodbye or dismiss agent">[DA] Dismiss Agent</item>
-  </menu>
-</agent>
-```
diff --git a/src/skills/bmad-agent-builder/tests/fixtures/deficient/bmad-agent-with-issues.md b/src/skills/bmad-agent-builder/tests/fixtures/deficient/bmad-agent-with-issues.md
deleted file mode 100644
index b2e7a07..0000000
--- a/src/skills/bmad-agent-builder/tests/fixtures/deficient/bmad-agent-with-issues.md
+++ /dev/null
@@ -1,40 +0,0 @@
----
-bad frontmatter missing quotes
-description: this description is way too vague and doesn't explain when to use the skill
----
-
-# My Agent
-
-This agent does things. It helps with stuff.
-
-## Activation
-
-Load config and greet the user.
-
-## On Activation
-
-1. Load the config variables
-2. Greet the user
-3. Show menu
-
-## Build Process
-
-Just build the thing with whatever capabilities.
-
-## Capabilities
-
-The agent can do stuff like:
-- Help with tasks
-- Answer questions
-- Process data
-
-## Output
-
-Create the files and stuff.
-
-## Notes
-
-- Make sure to use MUST and ALWAYS a lot
-- Don't explain why things are important
-- Keep it really vague so the model has to guess
-- Use inconsistent formatting
diff --git a/src/skills/bmad-agent-builder/tests/test-validate-manifest.py b/src/skills/bmad-agent-builder/tests/test-validate-manifest.py
deleted file mode 100755
index 1163576..0000000
--- a/src/skills/bmad-agent-builder/tests/test-validate-manifest.py
+++ /dev/null
@@ -1,182 +0,0 @@
-#!/usr/bin/env python3
-"""Tests for manifest.py validate command."""
-
-# /// script
-# requires-python = ">=3.9"
-# dependencies = [
-#     "pytest>=7.0.0",
-#     "jsonschema>=4.0.0",
-# ]
-# ///
-
-from __future__ import annotations
-
-import json
-import subprocess
-import sys
-import tempfile
-from pathlib import Path
-
-try:
-    import pytest
-except ImportError:
-    print("Error: pytest is required. Install with: pip install pytest", file=sys.stderr)
-    sys.exit(2)
-
-
-# Path to the manifest.py script
-SCRIPT_PATH = Path(__file__).parent.parent / "scripts" / "manifest.py"
-# Path to the schema
-SCHEMA_PATH = Path(__file__).parent.parent / "scripts" / "bmad-manifest-schema.json"
-
-
-def run_validator(manifest: dict) -> tuple[int, str, str]:
-    """Run the validator on a manifest dict and return exit code, stdout, stderr."""
-    with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
-        json.dump(manifest, f)
-        manifest_path = f.name
-
-    try:
-        result = subprocess.run(
-            [sys.executable, str(SCRIPT_PATH), manifest_path, "--schema", str(SCHEMA_PATH)],
-            capture_output=True,
-            text=True,
-        )
-        return result.returncode, result.stdout, result.stderr
-    finally:
-        Path(manifest_path).unlink()
-
-
-def test_valid_manifest():
-    """Test validation of a valid manifest."""
-    manifest = {
-        "persona": "A helpful test agent",
-        "module-name": "Test Module",
-        "module-code": "test",
-        "capabilities": [
-            {
-                "name": "test-capability",
-                "menu-code": "TC",
-                "description": "A test capability",
-                "phase": "on-demand",
-            },
-        ],
-    }
-
-    exit_code, stdout, _ = run_validator(manifest)
-    assert exit_code == 0
-    assert "valid" in stdout.lower() or json.loads(stdout).get("valid") is True
-
-
-def test_invalid_json():
-    """Test that invalid JSON produces an error."""
-    with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
-        f.write("{invalid json")
-        manifest_path = f.name
-
-    try:
-        result = subprocess.run(
-            [sys.executable, str(SCRIPT_PATH), manifest_path],
-            capture_output=True,
-            text=True,
-        )
-        assert result.returncode != 0
-    finally:
-        Path(manifest_path).unlink()
-
-
-def test_missing_menu_code():
-    """Test that missing menu-code produces a warning."""
-    manifest = {
-        "persona": "A helpful test agent",
-        "module-name": "Test Module",
-        "module-code": "test",
-        "capabilities": [
-            {
-                "name": "test-capability",
-                "description": "A test capability",
-            },
-        ],
-    }
-
-    exit_code, stdout, stderr = run_validator(manifest)
-    # Should still be valid (warning only) but mention the missing menu-code
-    assert exit_code == 0
-    output = stdout + stderr
-    assert "menu-code" in output
-
-
-def test_invalid_menu_code_format():
-    """Test that invalid menu-code format produces a warning."""
-    manifest = {
-        "persona": "A helpful test agent",
-        "module-name": "Test Module",
-        "module-code": "test",
-        "capabilities": [
-            {
-                "name": "test-capability",
-                "menu-code": "t",  # Too short
-                "description": "A test capability",
-            },
-        ],
-    }
-
-    exit_code, stdout, stderr = run_validator(manifest)
-    # Should still be valid (warning only)
-    assert exit_code == 0
-    output = stdout + stderr
-    assert "menu-code" in output
-
-
-def test_json_output():
-    """Test JSON output format."""
-    manifest = {
-        "persona": "A helpful test agent",
-        "module-name": "Test Module",
-        "module-code": "test",
-        "capabilities": [
-            {
-                "name": "test-capability",
-                "menu-code": "TC",
-                "description": "A test capability",
-            },
-        ],
-    }
-
-    with tempfile.NamedTemporaryFile(mode="w", suffix=".json", delete=False) as f:
-        json.dump(manifest, f)
-        manifest_path = f.name
-
-    try:
-        result = subprocess.run(
-            [sys.executable, str(SCRIPT_PATH), manifest_path, "--json"],
-            capture_output=True,
-            text=True,
-        )
-        assert result.returncode == 0
-
-        output = json.loads(result.stdout)
-        assert output["valid"] is True
-        assert output["error_count"] == 0
-        assert "warnings" in output
-    finally:
-        Path(manifest_path).unlink()
-
-
-def test_invalid_manifest_no_persona():
-    """Test that manifest without persona field produces an error."""
-    manifest = {
-        "module-name": "Test Module",
-        "module-code": "test",
-    }
-
-    exit_code, stdout, _ = run_validator(manifest)
-    assert exit_code != 0
-    output = json.loads(stdout) if "--json" in sys.argv else stdout
-    # Should have validation errors
-    assert exit_code == 1
-
-
-if __name__ == "__main__":
-    # Run pytest if available
-    pytest.main([__file__, "-v"])
diff --git a/src/skills/bmad-bmb-manifest/SKILL.md b/src/skills/bmad-bmb-manifest/SKILL.md
deleted file mode 100644
index 2d62aec..0000000
--- a/src/skills/bmad-bmb-manifest/SKILL.md
+++ /dev/null
@@ -1,9 +0,0 @@
----
-name: bmad-bmb-manifest
-description: Returns the BMad Builder (bmb) module manifest for bmad-init configuration setup. Only use when bmad-get-manifest-bmb is specifically invoked by name.
-user-invocable: false
----
-
-# ONLY STEP
-
-1. Read and return the full `bmad-skill-manifest.json` contents exactly.
diff --git a/src/skills/bmad-bmb-manifest/bmad-skill-manifest.yaml b/src/skills/bmad-bmb-manifest/bmad-skill-manifest.yaml
deleted file mode 100644
index e4fe792..0000000
--- a/src/skills/bmad-bmb-manifest/bmad-skill-manifest.yaml
+++ /dev/null
@@ -1,21 +0,0 @@
-type: skill
-code: bmb
-name: "BMad Builder"
-description: "Standard Skill Compliant Factory for BMad Agents, Workflows and Modules"
-default_selected: false
-
-installer-config-options:
-  bmad_builder_output_folder:
-    prompt: "Where should your custom skills (agents and workflows) be saved?"
-    default: "_bmad-output/skills"
-    result: "{project-root}/{value}"
-
-  bmad_builder_reports:
-    prompt: "Output for Evals, Test, Quality and Planning Reports?"
-    default: "_bmad-output/reports"
-    result: "{project-root}/{value}"
-
-# installer-intro-message: >
-
-# installer-outro-message: >
-
diff --git a/src/skills/bmad-workflow-builder/SKILL.md b/src/skills/bmad-workflow-builder/SKILL.md
index df54222..32464ef 100644
--- a/src/skills/bmad-workflow-builder/SKILL.md
+++ b/src/skills/bmad-workflow-builder/SKILL.md
@@ -40,7 +40,7 @@ This is the core creative path — where workflow and skill ideas become reality
 
 Workflows and skills span three types: simple utilities (composable building blocks), simple workflows (single-file processes), and complex workflows (multi-stage with routing and progressive disclosure). The build process includes a lint gate for structural validation. When building or modifying skills that include scripts, unit tests are created alongside the scripts and run as part of validation.
 
-Load `prompts/build-process.md` to begin.
+Load `build-process.md` to begin.
 
 ## Quality Optimizer
 
@@ -48,7 +48,7 @@ For workflows/skills that already work but could work *better*. This is comprehe
 
 Run this anytime you want to assess and improve an existing skill's quality.
 
-Load `prompts/quality-optimizer.md` — it orchestrates everything including scan modes, autonomous handling, and remediation options.
+Load `quality-optimizer.md` — it orchestrates everything including scan modes, autonomous handling, and remediation options.
 
 ---
 
@@ -56,8 +56,8 @@ Load `prompts/quality-optimizer.md` — it orchestrates everything including sca
 
 | Intent | Trigger Phrases | Route |
 |--------|----------------|-------|
-| **Build** | "build/create/design/convert/edit/fix a workflow/skill/tool" | Load `prompts/build-process.md` |
-| **Quality Optimize** | "quality check", "validate", "review/optimize/improve workflow/skill" | Load `prompts/quality-optimizer.md` |
+| **Build** | "build/create/design/convert/edit/fix a workflow/skill/tool" | Load `build-process.md` |
+| **Quality Optimize** | "quality check", "validate", "review/optimize/improve workflow/skill" | Load `quality-optimizer.md` |
 | **Unclear** | — | Present the two options above and ask |
 
 Pass `{headless_mode}` flag to all routes. Use TodoList tool to track progress through multi-step flows. Use AskUserQuestion tool when structuring questions for users. Use subagents for parallel work (quality scanners, web research or document review).
diff --git a/src/skills/bmad-workflow-builder/agents/report-quality-scan-creator.md b/src/skills/bmad-workflow-builder/agents/report-quality-scan-creator.md
deleted file mode 100644
index 0927c6e..0000000
--- a/src/skills/bmad-workflow-builder/agents/report-quality-scan-creator.md
+++ /dev/null
@@ -1,188 +0,0 @@
-# Quality Scan Report Creator
-
-You are a master quality engineer tech writer agent QualityReportBot-9001 and you will create a comprehensive, cohesive quality report from multiple scanner outputs. You read all temporary JSON fragments, consolidate findings, remove duplicates, and produce a well-organized markdown report. Ensure that nothing is missed. You are quality obsessed, after your initial report is created as outlined in this file, you will re-scan every temp finding again and think one level deeper to ensure its properly covered all findings and accounted for in the report, including proposed remediation suggestions. You will never attempt to actually fix anything - you are a master quality engineer tech writer.
-
-## Inputs
-
-You will receive:
-- `{skill-path}` — Path to the workflow/skill being validated
-- `{quality-report-dir}` — Directory containing scanner temp files AND where to write the final report
-
-## Process
-
-1. List all `*-temp.json` files in `{quality-report-dir}`
-2. Read each JSON file and extract all findings
-3. Consolidate and deduplicate findings across scanners
-4. Organize by category, then by severity within each category
-5. Identify truly broken/missing issues (CRITICAL and HIGH severity)
-6. Write comprehensive markdown report
-7. Return JSON summary with report link and most importantly the truly broken/missing item or failing issues (CRITICAL and HIGH severity)
-
-## Categories to Organize By
-
-1. **Structural** — Workflow structure, workflow stages
-2. **Prompt Craft** — Prompt craft quality (token efficiency, anti-patterns, outcome balance, narrative framing, contextualization)
-3. **Cohesion** — Skill cohesion, persona-stage alignment, overall coherence
-4. **Efficiency** — Workflow efficiency, context optimization
-5. **Quality** — Path standards
-6. **Scripts** — Script quality, portability, agentic design
-7. **Creative** — Edge-case discoveries, experience gaps, delight opportunities, assumption risks (advisory — suggestions, not errors)
-
-## Scanner Sources (7 Scanners)
-
-| Scanner | Temp File | Category |
-|---------|-----------|----------|
-| workflow-integrity | workflow-integrity-temp.json | Structural |
-| prompt-craft | prompt-craft-temp.json | Prompt Craft |
-| skill-cohesion | skill-cohesion-temp.json | Cohesion |
-| execution-efficiency | execution-efficiency-temp.json | Efficiency |
-| path-standards | path-standards-temp.json | Quality |
-| scripts | scripts-temp.json | Scripts |
-| enhancement-opportunities | enhancement-opportunities-temp.json | Creative |
-
-## Severity Order Within Categories
-
-CRITICAL → HIGH → MEDIUM → LOW
-
-## Report Format
-
-```markdown
-# Quality Report: {Workflow/Skill Name}
-
-**Scanned:** {timestamp}
-**Skill Path:** {skill-path}
-**Report:** {output-file}
-**Performed By** QualityReportBot-9001 and {user_name}
-
-## Executive Summary
-
-- **Total Issues:** {n}
-- **Critical:** {n} | **High:** {n} | **Medium:** {n} | **Low:** {n}
-- **Overall Quality:** {Excellent / Good / Fair / Poor}
-
-### Issues by Category
-
-| Category | Critical | High | Medium | Low |
-|----------|----------|------|--------|-----|
-| Structural | {n} | {n} | {n} | {n} |
-| Prompt Craft | {n} | {n} | {n} | {n} |
-| Cohesion | {n} | {n} | {n} | {n} |
-| Efficiency | {n} | {n} | {n} | {n} |
-| Quality | {n} | {n} | {n} | {n} |
-| Scripts | {n} | {n} | {n} | {n} |
-| Creative | — | — | {n} | {n} |
-
----
-
-## Truly Broken or Missing
-
-*Issues that prevent the workflow/skill from working correctly:*
-
-{If any CRITICAL or HIGH issues exist, list them here with brief description and fix}
-
----
-
-## Detailed Findings by Category
-
-### 1. Structural
-
-**Critical Issues**
-{if any}
-
-**High Priority**
-{if any}
-
-**Medium Priority**
-{if any}
-
-**Low Priority (Optional)**
-{if any}
-
-### 2. Prompt Craft
-{repeat pattern above}
-
-### 3. Cohesion
-{repeat pattern above}
-
-### 4. Efficiency
-{repeat pattern above}
-
-### 5. Quality
-{repeat pattern above}
-
-### 6. Scripts
-{repeat pattern above}
-
-### 7. Creative (Edge-Case & Experience Innovation)
-{list by impact — these are creative suggestions, not errors. Include user journey insights and the boldest practical idea}
-
----
-
-## Quick Wins (High Impact, Low Effort)
-
-{List issues that are easy to fix with high value}
-
----
-
-## Optimization Opportunities
-
-**Prompt Craft:**
-{findings related to prompt quality, contextualization, and token efficiency}
-
-**Performance:**
-{findings related to execution speed and workflow efficiency}
-
-**Maintainability:**
-{findings related to workflow structure and composability}
-
----
-
-## Recommendations
-
-1. {Most important action item}
-2. {Second priority}
-3. {Third priority}
-```
-
-## Output
-
-Write report to: `{quality-report-dir}/quality-report.md`
-
-Return JSON:
-
-```json
-{
-  "report_file": "{full-path-to-report}",
-  "summary": {
-    "total_issues": 0,
-    "critical": 0,
-    "high": 0,
-    "medium": 0,
-    "low": 0,
-    "overall_quality": "Excellent|Good|Fair|Poor",
-    "truly_broken_found": true,
-    "truly_broken_count": 0
-  },
-  "by_category": {
-    "structural": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "prompt_craft": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "cohesion": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "efficiency": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "quality": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "scripts": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "creative": {"count": 0}
-  },
-  "high_impact_quick_wins": [
-    {"issue": "description", "file": "location", "effort": "low"}
-  ]
-}
-```
-
-## Notes
-
-- Remove duplicate issues that appear in multiple scanner outputs
-- If the same issue is found in multiple files, list it once with all affected files
-- Preserve all CRITICAL and HIGH severity findings — these indicate broken functionality
-- MEDIUM and LOW can be consolidated if they're similar
-- Creative findings are not "issues" — they're imaginative suggestions for edge cases and experience improvements, so categorize separately
-- Report output path is `{quality-report-dir}/quality-report.md` (fixed name, not timestamped)
diff --git a/src/skills/bmad-workflow-builder/templates/SKILL-template.md b/src/skills/bmad-workflow-builder/assets/SKILL-template.md
similarity index 93%
rename from src/skills/bmad-workflow-builder/templates/SKILL-template.md
rename to src/skills/bmad-workflow-builder/assets/SKILL-template.md
index ce3629e..0885a0c 100644
--- a/src/skills/bmad-workflow-builder/templates/SKILL-template.md
+++ b/src/skills/bmad-workflow-builder/assets/SKILL-template.md
@@ -83,7 +83,7 @@ Act as {role-guidance}.
    - If output doc exists (user specifies path or we prompt):
      - Read doc to determine current stage
      - Resume from last completed stage
-   - Else: Start at `prompts/01-{stage-1-name}.md`
+   - Else: Start at `01-{stage-1-name}.md`
 
 4. **Route to appropriate stage** based on progress
 
@@ -98,8 +98,8 @@ Act as {role-guidance}.
 
 | # | Stage | Purpose | Prompt |
 |---|-------|---------|--------|
-| 1 | {stage-1-name} | {stage-1-purpose} | `prompts/01-{stage-1-name}.md` |
-| 2 | {stage-2-name} | {stage-2-purpose} | `prompts/02-{stage-2-name}.md` |
+| 1 | {stage-1-name} | {stage-1-purpose} | `01-{stage-1-name}.md` |
+| 2 | {stage-2-name} | {stage-2-purpose} | `02-{stage-2-name}.md` |
 {/if-complex-workflow}
 
 {if-external-skills}
diff --git a/src/skills/bmad-workflow-builder/assets/quality-report-template.md b/src/skills/bmad-workflow-builder/assets/quality-report-template.md
new file mode 100644
index 0000000..baf6da9
--- /dev/null
+++ b/src/skills/bmad-workflow-builder/assets/quality-report-template.md
@@ -0,0 +1,260 @@
+# Quality Report: {skill-name}
+
+**Scanned:** {timestamp}
+**Skill Path:** {skill-path}
+**Report:** {report-file-path}
+**Performed By** QualityReportBot-9001 and {user_name}
+
+## Executive Summary
+
+- **Total Issues:** {total-issues}
+- **Critical:** {critical} | **High:** {high} | **Medium:** {medium} | **Low:** {low}
+- **Overall Quality:** {Excellent|Good|Fair|Poor}
+- **Overall Cohesion:** {cohesion-score}
+- **Craft Assessment:** {craft-assessment}
+
+<!-- Synthesize a 1-3 sentence narrative: skill purpose (from enhancement-opportunities skill_understanding.purpose), architecture quality highlights, and most significant finding. -->
+{executive-narrative}
+
+### Issues by Category
+
+| Category | Critical | High | Medium | Low |
+|----------|----------|------|--------|-----|
+| Structural | {n} | {n} | {n} | {n} |
+| Prompt Craft | {n} | {n} | {n} | {n} |
+| Cohesion | {n} | {n} | {n} | {n} |
+| Efficiency | {n} | {n} | {n} | {n} |
+| Quality | {n} | {n} | {n} | {n} |
+| Scripts | {n} | {n} | {n} | {n} |
+| Creative | — | — | {n} | {n} |
+
+---
+
+## Strengths
+
+*What this skill does well — preserve these during optimization:*
+
+<!-- Collect from ALL of these sources:
+  - All scanners: findings[] with severity="strength" or category="strength"
+  - prompt-craft: findings where severity="note" and observation is positive
+  - prompt-craft: positive aspects from assessments.skillmd_assessment.notes
+  - enhancement-opportunities: bright_spots from each assessments.user_journeys[] entry
+  Group by theme. Each strength should explain WHY it matters. -->
+
+{strengths-list}
+
+---
+
+{if-truly-broken}
+## Truly Broken or Missing
+
+*Issues that prevent the workflow/skill from working correctly:*
+
+<!-- Every CRITICAL and HIGH severity issue from ALL scanners. Maximum detail: description, affected files/lines, fix instructions. These are the most actionable part of the report. -->
+
+{truly-broken-findings}
+
+---
+{/if-truly-broken}
+
+## Detailed Findings by Category
+
+### 1. Structural
+
+<!-- Source: workflow-integrity-temp.json -->
+
+{if-stage-summary}
+**Stage Summary:** {total-stages} stages | Missing: {missing-stages} | Orphaned: {orphaned-stages}
+{/if-stage-summary}
+
+<!-- List findings by severity: Critical > High > Medium > Low. Omit empty severity levels. -->
+
+{structural-findings}
+
+### 2. Prompt Craft
+
+<!-- Source: prompt-craft-temp.json -->
+
+**Skill Assessment:**
+- Overview quality: {overview-quality}
+- Progressive disclosure: {progressive-disclosure}
+- {skillmd-assessment-notes}
+
+{if-prompt-health}
+**Prompt Health:** {prompts-with-config-header}/{total-prompts} with config header | {prompts-with-progression}/{total-prompts} with progression conditions | {prompts-self-contained}/{total-prompts} self-contained
+{/if-prompt-health}
+
+{prompt-craft-findings}
+
+### 3. Cohesion
+
+<!-- Source: skill-cohesion-temp.json -->
+
+{if-cohesion-analysis}
+**Cohesion Analysis:**
+
+<!-- Include only dimensions present in scanner output. -->
+
+| Dimension | Score | Notes |
+|-----------|-------|-------|
+| Stage Flow Coherence | {score} | {notes} |
+| Purpose Alignment | {score} | {notes} |
+| Complexity Appropriateness | {score} | {notes} |
+| Stage Completeness | {score} | {notes} |
+| Redundancy Level | {score} | {notes} |
+| Dependency Graph | {score} | {notes} |
+| Output Location Alignment | {score} | {notes} |
+| User Journey | {score} | {notes} |
+{/if-cohesion-analysis}
+
+{cohesion-findings}
+
+{if-creative-suggestions}
+**Creative Suggestions:**
+
+<!-- From findings[] with severity="suggestion". Each: title, detail, action. -->
+
+{creative-suggestions}
+{/if-creative-suggestions}
+
+### 4. Efficiency
+
+<!-- Source: execution-efficiency-temp.json -->
+
+{efficiency-issue-findings}
+
+{if-efficiency-opportunities}
+**Optimization Opportunities:**
+
+<!-- From findings[] with severity ending in -opportunity. Each: title, detail (includes type/savings narrative), action. -->
+
+{efficiency-opportunities}
+{/if-efficiency-opportunities}
+
+### 5. Quality
+
+<!-- Source: path-standards-temp.json, scripts-temp.json -->
+
+{quality-findings}
+
+### 6. Scripts
+
+<!-- Source: scripts-temp.json AND script-opportunities-temp.json. Merge and deduplicate across both. -->
+
+{if-script-inventory}
+**Script Inventory:** {total-scripts} scripts ({by-type-breakdown}) | Missing tests: {missing-tests-list}
+{/if-script-inventory}
+
+{script-issue-findings}
+
+{if-script-opportunities}
+**Script Opportunity Findings:**
+
+<!-- From script-opportunities-temp.json findings[]. These identify LLM work that should be scripts.
+     Each: title, detail (includes determinism/complexity/savings narrative), action. -->
+
+{script-opportunities}
+
+**Token Savings:** {total-estimated-token-savings} | Highest value: {highest-value-opportunity} | Prepass opportunities: {prepass-count}
+{/if-script-opportunities}
+
+### 7. Creative (Edge-Case & Experience Innovation)
+
+<!-- Source: enhancement-opportunities-temp.json. These are advisory suggestions, not errors. -->
+
+**Skill Understanding:**
+- **Purpose:** {skill-purpose}
+- **Primary User:** {primary-user}
+- **Key Assumptions:**
+{key-assumptions-list}
+
+**Enhancement Findings:**
+
+<!-- Organize by: high-opportunity > medium-opportunity > low-opportunity.
+     Each: title, detail, action. -->
+
+{enhancement-findings}
+
+{if-top-insights}
+**Top Insights:**
+
+<!-- From enhancement-opportunities assessments.top_insights[]. These are the synthesized highest-value observations.
+     Each: title, detail, action. -->
+
+{top-insights}
+{/if-top-insights}
+
+---
+
+{if-user-journeys}
+## User Journeys
+
+*How different user archetypes experience this skill:*
+
+<!-- From enhancement-opportunities user_journeys[]. Reproduce EVERY archetype fully. -->
+
+### {archetype-name}
+
+{journey-summary}
+
+**Friction Points:**
+{friction-points-list}
+
+**Bright Spots:**
+{bright-spots-list}
+
+<!-- Repeat for ALL archetypes. Do not skip any. -->
+
+---
+{/if-user-journeys}
+
+{if-autonomous-assessment}
+## Autonomous Readiness
+
+<!-- From enhancement-opportunities autonomous_assessment. Include ALL fields. -->
+
+- **Overall Potential:** {overall-potential}
+- **HITL Interaction Points:** {hitl-count}
+- **Auto-Resolvable:** {auto-resolvable-count}
+- **Needs Input:** {needs-input-count}
+- **Suggested Output Contract:** {output-contract}
+- **Required Inputs:** {required-inputs-list}
+- **Notes:** {assessment-notes}
+
+---
+{/if-autonomous-assessment}
+
+## Quick Wins (High Impact, Low Effort)
+
+<!-- Pull from ALL scanners: findings where fix effort is trivial/low but impact is meaningful. -->
+
+| Issue | File | Effort | Impact |
+|-------|------|--------|--------|
+{quick-wins-rows}
+
+---
+
+## Optimization Opportunities
+
+<!-- Synthesize across scanners — not a copy of findings but a narrative of improvement themes. -->
+
+**Prompt Craft:**
+{prompt-optimization-narrative}
+
+**Performance:**
+{performance-optimization-narrative}
+
+**Maintainability:**
+{maintainability-optimization-narrative}
+
+---
+
+## Recommendations
+
+<!-- Rank by: severity first, then breadth of impact, then effort (prefer low-effort). Up to 5. -->
+
+1. {recommendation-1}
+2. {recommendation-2}
+3. {recommendation-3}
+4. {recommendation-4}
+5. {recommendation-5}
diff --git a/src/skills/bmad-workflow-builder/bmad-manifest.json b/src/skills/bmad-workflow-builder/bmad-manifest.json
index 06a9c28..f5c7fa9 100644
--- a/src/skills/bmad-workflow-builder/bmad-manifest.json
+++ b/src/skills/bmad-workflow-builder/bmad-manifest.json
@@ -6,7 +6,7 @@
       "menu-code": "BP",
       "description": "Build, edit, or convert workflows and skills through six-phase conversational discovery. Covers new skills, format conversion, edits, and fixes.",
       "supports-headless": true,
-      "prompt": "prompts/build-process.md",
+      "prompt": "build-process.md",
       "phase-name": "anytime",
       "output-location": "{bmad_builder_output_folder}"
     },
@@ -15,7 +15,7 @@
       "menu-code": "QO",
       "description": "Comprehensive validation and optimization using lint scripts and LLM scanner subagents. Structure, prompt craft, efficiency, and more.",
       "supports-headless": true,
-      "prompt": "prompts/quality-optimizer.md",
+      "prompt": "quality-optimizer.md",
       "phase-name": "anytime",
       "output-location": "{bmad_builder_reports}"
     }
diff --git a/src/skills/bmad-workflow-builder/prompts/build-process.md b/src/skills/bmad-workflow-builder/build-process.md
similarity index 79%
rename from src/skills/bmad-workflow-builder/prompts/build-process.md
rename to src/skills/bmad-workflow-builder/build-process.md
index 7d812ec..9c5b354 100644
--- a/src/skills/bmad-workflow-builder/prompts/build-process.md
+++ b/src/skills/bmad-workflow-builder/build-process.md
@@ -32,7 +32,7 @@ Ask upfront:
    - What other skills will it use from the core or specified module, we need the name, inputs, and output so we know how to integrate it? (bmad-init is default unless explicitly opted out, other skills should be either core skills or skills that will be part of the module)
    - What are the variable names it will have access to that it needs to use? (variables can be use for things like choosing various paths in the skill, adjusting output styles, configuring output locations, tool availability, and anything that could be configurable by a user)
 
-Load `resources/classification-reference.md` for the full decision tree, classification signals, and module context rules. Use it to classify:
+Load `references/classification-reference.md` for the full decision tree, classification signals, and module context rules. Use it to classify:
 
 1. Composable building block with clear input/output and generally will use scripts either inline or in the scripts folder? → **Simple Utility**
 2. Fits in a single SKILL.md, may have some resources and a prompt, but generally not very complex. Human in the Loop and Autonomous abilities? → **Simple Workflow**
@@ -50,7 +50,7 @@ Work through conversationally, adapted per skill type, so you can either glean f
 
 **All types — Common fields:**
 - **Name:** kebab-case. If module: `bmad-{modulecode}-{skillname}`. If standalone: `bmad-{skillname}`
-- **Description:** Two parts: [5-8 word summary of what it does]. [Use when user says 'specific phrase' or 'specific phrase'.] — Default to explicit invocation (conservative triggering) unless user specifies organic/reactive activation. See `resources/standard-fields.md` for format details and examples.
+- **Description:** Two parts: [5-8 word summary of what it does]. [Use when user says 'specific phrase' or 'specific phrase'.] — Default to explicit invocation (conservative triggering) unless user specifies organic/reactive activation. See `references/standard-fields.md` for format details and examples.
 - **Overview:** 3-part formula (What/How/Why-Outcome). For interactive or complex skills, also include brief domain framing (what concepts does this skill operate on?) and theory of mind (who is the user and what might they not know?). These give the executing agent enough context to make judgment calls when situations don't match the script.
 - **Role guidance:** Brief "Act as a [role/expert]" statement to prime the model for the right domain expertise and tone
 - **Design rationale:** Any non-obvious choices the executing agent should understand? (e.g., "We interview before building because users rarely know their full requirements upfront")
@@ -106,7 +106,7 @@ For each capability, confirm these with the user — they determine how the modu
 - **description (capability):** Keep this VERY short — a single sentence describing what it produces, not how it works. This is what the LLM help system shows users. (e.g., "Produces executive product brief and optional LLM distillate for PRD input.")
 
 **Path conventions (CRITICAL):**
-- Skill-internal files use bare relative paths: `resources/`, `prompts/`, `scripts/` (never `{skill-root}`)
+- Skill-internal files use bare relative paths: `references/`, `scripts/`, and prompt files at root
 - Only `_bmad` paths get `{project-root}` prefix: `{project-root}/_bmad/...`
 - Config variables used directly — they already contain `{project-root}` (no double-prefix)
 
@@ -117,23 +117,23 @@ Once you have a cohesive idea, think one level deeper, clarify with the user any
 ## Phase 5: Build
 
 **Always load these before building:**
-- Load `resources/standard-fields.md` — field definitions, description format, path rules
-- Load `resources/skill-best-practices.md` — authoring patterns (freedom levels, templates, anti-patterns)
-- Load `resources/quality-dimensions.md` — quick mental checklist for build quality
+- Load `references/standard-fields.md` — field definitions, description format, path rules
+- Load `references/skill-best-practices.md` — authoring patterns (freedom levels, templates, anti-patterns)
+- Load `references/quality-dimensions.md` — quick mental checklist for build quality
 
 **Load based on skill type:**
-- **If Complex Workflow:** Load `resources/complex-workflow-patterns.md` — compaction survival, document-as-cache pattern, config integration, facilitator model, progressive disclosure with prompts/. This is essential for building workflows that survive long-running sessions.
-- **If module-based (any type):** Load `resources/metadata-reference.md` — bmad-manifest.json field definitions, module metadata structure, config loading requirements.
-- **Always load** `resources/script-opportunities-reference.md` — script opportunity spotting guide, catalog, and output standards. Use this to identify additional script opportunities not caught in Phase 3, even if no scripts were initially planned.
+- **If Complex Workflow:** Load `references/complex-workflow-patterns.md` — compaction survival, document-as-cache pattern, config integration, facilitator model, progressive disclosure with prompt files at root. This is essential for building workflows that survive long-running sessions.
+- **If module-based (any type):** Load `references/metadata-reference.md` — bmad-manifest.json field definitions, module metadata structure, config loading requirements.
+- **Always load** `references/script-opportunities-reference.md` — script opportunity spotting guide, catalog, and output standards. Use this to identify additional script opportunities not caught in Phase 3, even if no scripts were initially planned.
 
 When confirmed:
 
-1. Load template substitution rules from `resources/template-substitution-rules.md` and apply
+1. Load template substitution rules from `references/template-substitution-rules.md` and apply
 
-2. Load unified template: `templates/SKILL-template.md`
+2. Load unified template: `assets/SKILL-template.md`
    - Apply skill-type conditionals (`{if-complex-workflow}`, `{if-simple-workflow}`, `{if-simple-utility}`) to keep only relevant sections
 
-3. **Progressive disclosure:** Keep SKILL.md focused on Overview, activation, and routing. Detailed stage instructions go in `prompts/`. Reference data, schemas, and large tables go in `resources/`. Multi-branch SKILL.md under ~250 lines is fine as-is; single-purpose up to ~500 lines if genuinely needed.
+3. **Progressive disclosure:** Keep SKILL.md focused on Overview, activation, and routing. Detailed stage instructions go in prompt files at the skill root. Reference data, schemas, and large tables go in `references/`. Multi-branch SKILL.md under ~250 lines is fine as-is; single-purpose up to ~500 lines if genuinely needed.
 
 4. Generate folder structure and include only what is needed for the specific skill:
 **Skill Source Tree:**
@@ -141,13 +141,23 @@ When confirmed:
 {skill-name}/
 ├── SKILL.md           # name (same as folder name), description
 ├── bmad-manifest.json # Capabilities, module integration, optional persona/memory
-├── resources/         # Additional resource and data files as needed
-├── prompts/           # Offload expensive details to prompt files for actions that will not happen every time or work that will benefit from splitting across potentially multiple prompts
-├── agents/            # If the skill will have pre defined agents (persona with actions or knowledge) for spawning as a subagent for separate context and parallel processing
-├── scripts/           # As Needed (favor python unless user specified)
+├── *.md               # Prompt files and subagent definitions at root
+├── references/        # Reference data, schemas, guides (read for context)
+├── assets/            # Templates, starter files (copied/transformed into output)
+├── scripts/           # Deterministic code — validation, transformation, testing
 │   └── tests/         # All scripts need unit tests
 ```
 
+**What goes where:**
+| Location | Contains | LLM relationship |
+|----------|----------|-----------------|
+| **Root `.md` files** | Prompt/instruction files, subagent definitions | LLM **loads and executes** these as instructions — they are extensions of SKILL.md |
+| **`references/`** | Reference data, schemas, tables, examples, guides | LLM **reads for context** — informational, not executable |
+| **`assets/`** | Templates, starter files, boilerplate | LLM **copies/transforms** these into output — not for reasoning |
+| **`scripts/`** | Python, shell scripts with tests | LLM **invokes** these — deterministic operations that don't need judgment |
+
+Only create subfolders that are needed — most skills won't need all four.
+
 5. **Generate bmad-manifest.json** — Use `scripts/manifest.py` (validation is automatic on every write). **IMPORTANT:** The generated manifest must NOT include a `$schema` field — the schema is used for validation tooling only and is not part of the delivered skill.
    ```bash
    # Create manifest
@@ -160,7 +170,7 @@ When confirmed:
    python3 scripts/manifest.py add-capability {skill-path} \
      --name {name} --menu-code {MC} --description "Short: what it produces." \
      --supports-autonomous \
-     --prompt prompts/{name}.md      # internal capability
+     --prompt {name}.md               # internal capability
      # OR --skill-name {skill}       # external skill
      # omit both if SKILL.md handles it directly
      # Module capabilities also need:
@@ -195,4 +205,4 @@ If scripts exist, also run unit tests.
 
 Ask: *"Build is done. Would you like to run a Quality Scan to optimize further?"*
 
-If yes, load `prompts/quality-optimizer.md` with `{scan_mode}=full` and the skill path.
+If yes, load `quality-optimizer.md` with `{scan_mode}=full` and the skill path.
diff --git a/src/skills/bmad-workflow-builder/prompts/quality-optimizer.md b/src/skills/bmad-workflow-builder/quality-optimizer.md
similarity index 77%
rename from src/skills/bmad-workflow-builder/prompts/quality-optimizer.md
rename to src/skills/bmad-workflow-builder/quality-optimizer.md
index 87dda95..ea4d233 100644
--- a/src/skills/bmad-workflow-builder/prompts/quality-optimizer.md
+++ b/src/skills/bmad-workflow-builder/quality-optimizer.md
@@ -6,6 +6,8 @@ menu-code: QO
 
 # Quality Optimizer
 
+Communicate with user in `{communication_language}`. Write report content in `{document_output_language}`.
+
 You orchestrate quality scans on a BMad workflow or skill. Deterministic checks run as scripts (fast, zero tokens). Judgment-based analysis runs as LLM subagents. You synthesize all results into a unified report.
 
 ## Your Role: Coordination, Not File Reading
@@ -74,7 +76,7 @@ These run instantly, cost zero tokens, and produce structured JSON:
 
 | # | Script | Focus | Temp Filename |
 |---|--------|-------|---------------|
-| S1 | `scripts/scan-path-standards.py` | Path conventions: no {skill-root}, {project-root} only for _bmad, bare _bmad, double-prefix | `path-standards-temp.json` |
+| S1 | `scripts/scan-path-standards.py` | Path conventions: {project-root} only for _bmad, bare _bmad, double-prefix, absolute paths | `path-standards-temp.json` |
 | S2 | `scripts/scan-scripts.py` | Script portability, PEP 723, agentic design, unit tests | `scripts-temp.json` |
 
 ### Pre-Pass Scripts (Feed LLM Scanners)
@@ -91,12 +93,12 @@ These extract metrics for the LLM scanners so they work from compact data instea
 
 | # | Scanner | Focus | Pre-Pass? | Temp Filename |
 |---|---------|-------|-----------|---------------|
-| L1 | `agents/quality-scan-workflow-integrity.md` | Logical consistency, description quality, progression condition quality, type-appropriate structure | Yes — receives prepass JSON | `workflow-integrity-temp.json` |
-| L2 | `agents/quality-scan-prompt-craft.md` | Token efficiency, anti-patterns, outcome balance, Overview quality, progressive disclosure | Yes — receives metrics JSON | `prompt-craft-temp.json` |
-| L3 | `agents/quality-scan-execution-efficiency.md` | Parallelization, subagent delegation, read avoidance, context optimization | Yes — receives dep graph JSON | `execution-efficiency-temp.json` |
-| L4 | `agents/quality-scan-skill-cohesion.md` | Stage flow coherence, purpose alignment, complexity appropriateness | No | `skill-cohesion-temp.json` |
-| L5 | `agents/quality-scan-enhancement-opportunities.md` | Creative edge-case discovery, experience gaps, delight opportunities, assumption auditing | No | `enhancement-opportunities-temp.json` |
-| L6 | `agents/quality-scan-script-opportunities.md` | Deterministic operation detection — finds LLM work that should be scripts instead | No | `script-opportunities-temp.json` |
+| L1 | `quality-scan-workflow-integrity.md` | Logical consistency, description quality, progression condition quality, type-appropriate structure | Yes — receives prepass JSON | `workflow-integrity-temp.json` |
+| L2 | `quality-scan-prompt-craft.md` | Token efficiency, anti-patterns, outcome balance, Overview quality, progressive disclosure | Yes — receives metrics JSON | `prompt-craft-temp.json` |
+| L3 | `quality-scan-execution-efficiency.md` | Parallelization, subagent delegation, read avoidance, context optimization | Yes — receives dep graph JSON | `execution-efficiency-temp.json` |
+| L4 | `quality-scan-skill-cohesion.md` | Stage flow coherence, purpose alignment, complexity appropriateness | No | `skill-cohesion-temp.json` |
+| L5 | `quality-scan-enhancement-opportunities.md` | Creative edge-case discovery, experience gaps, delight opportunities, assumption auditing | No | `enhancement-opportunities-temp.json` |
+| L6 | `quality-scan-script-opportunities.md` | Deterministic operation detection — finds LLM work that should be scripts instead | No | `script-opportunities-temp.json` |
 
 ## Execution Instructions
 
@@ -124,7 +126,7 @@ After scripts complete, spawn applicable LLM scanners as parallel subagents.
 **For scanners WITHOUT pre-pass (L4, L5, L6):** provide just the skill path and output directory as before.
 
 Each subagent receives:
-- Scanner file to load (e.g., `agents/quality-scan-skill-cohesion.md`)
+- Scanner file to load (e.g., `quality-scan-skill-cohesion.md`)
 - Skill path to scan: `{skill-path}`
 - Output directory for results: `{quality-report-dir}`
 - Temp filename for output: `{temp-filename}`
@@ -150,13 +152,23 @@ After all scripts and scanners complete:
 3. Skip report creator (not needed for single scanner)
 
 **IF multiple LLM scanners:**
-1. Initiate a subagent with `agents/report-quality-scan-creator.md`
+1. Initiate a subagent with `report-quality-scan-creator.md`
 
 **Provide the subagent with:**
 - `{skill-path}` — The skill being validated
 - `{temp-files-dir}` — Directory containing all `*-temp.json` files (both script and LLM results)
 - `{quality-report-dir}` — Where to write the final report
 
+## Generate HTML Report
+
+After the report creator finishes (or after presenting lint-only / single-scanner results), generate the interactive HTML report:
+
+```bash
+python3 scripts/generate-html-report.py {quality-report-dir} --open
+```
+
+This produces `{quality-report-dir}/quality-report.html` — a self-contained interactive report with severity filters, collapsible sections, per-item copy-prompt buttons, and a batch prompt generator. The `--open` flag opens it in the default browser.
+
 ## Present Findings to User
 
 After receiving the JSON summary from the report creator:
@@ -168,6 +180,7 @@ After receiving the JSON summary from the report creator:
   "headless_mode": true,
   "scan_completed": true,
   "report_file": "{full-path-to-report}",
+  "html_report": "{full-path-to-html}",
   "warnings": ["any warnings from pre-scan checks"],
   "summary": {
     "total_issues": 0,
@@ -185,10 +198,10 @@ After receiving the JSON summary from the report creator:
 **IF `{headless_mode}=false` or not set:**
 1. **High-level summary** with total issues by severity
 2. **Highlight truly broken/missing** — CRITICAL and HIGH issues prominently
-3. **Mention detailed report** — "Full report saved to: {report_file}"
+3. **Mention reports** — "Full report: {report_file}" and "Interactive HTML report opened in browser (also at: {html_report})"
 4. **Offer next steps:**
    - Apply fixes directly
-   - Export checklist for manual fixes
+   - Use the HTML report to select specific items and generate prompts
    - Discuss specific findings
 
 ## Key Principle
diff --git a/src/skills/bmad-workflow-builder/agents/quality-scan-enhancement-opportunities.md b/src/skills/bmad-workflow-builder/quality-scan-enhancement-opportunities.md
similarity index 83%
rename from src/skills/bmad-workflow-builder/agents/quality-scan-enhancement-opportunities.md
rename to src/skills/bmad-workflow-builder/quality-scan-enhancement-opportunities.md
index f4ec56b..e9a7057 100644
--- a/src/skills/bmad-workflow-builder/agents/quality-scan-enhancement-opportunities.md
+++ b/src/skills/bmad-workflow-builder/quality-scan-enhancement-opportunities.md
@@ -27,9 +27,9 @@ You are NOT checking structure, craft quality, performance, or test coverage —
 
 Find and read:
 - `SKILL.md` — Understand the skill's purpose, audience, and flow
-- `prompts/*.md` — Walk through each stage as a user would experience it
-- `resources/*.md` — Understand what supporting material exists
-- `resources/*.json` — See what supporting schemas exist
+- `*.md` prompt files at root — Walk through each stage as a user would experience it
+- `references/*.md` — Understand what supporting material exists
+- `references/*.json` — See what supporting schemas exist
 
 ## Creative Analysis Lenses
 
@@ -169,73 +169,75 @@ You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/enhancement-opportunities-temp.json`
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+**Field mapping for this scanner:**
+- `title` — The specific situation or user story (was `scenario`)
+- `detail` — What you noticed, why it matters, and user impact combined (merges `insight` + `user_impact`)
+- `action` — Concrete, actionable improvement (was `suggestion`)
+
 ```json
 {
   "scanner": "enhancement-opportunities",
   "skill_path": "{path}",
-  "skill_understanding": {
-    "purpose": "What this skill is trying to do",
-    "primary_user": "Who this skill is for",
-    "key_assumptions": ["assumption 1", "assumption 2"]
-  },
   "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md",
-      "severity": "high-opportunity|medium-opportunity|low-opportunity",
-      "category": "edge-case|experience-gap|delight-opportunity|assumption-risk|journey-friction|autonomous-potential|facilitative-pattern",
-      "scenario": "The specific situation or user story that reveals this opportunity",
-      "insight": "What you noticed and why it matters",
-      "suggestion": "Concrete, actionable improvement — the tempered version of the wild idea",
-      "user_impact": "How this would change the user's experience"
-    }
-  ],
-  "user_journeys": [
-    {
-      "archetype": "first-timer|expert|confused|edge-case|hostile-environment|automator",
-      "journey_summary": "Brief narrative of this user's experience with the skill",
-      "friction_points": ["moment 1", "moment 2"],
-      "bright_spots": ["what works well for this user"]
+      "file": "SKILL.md",
+      "severity": "high-opportunity",
+      "category": "experience-gap",
+      "title": "First-time user with no project config hits a dead end at stage 2",
+      "detail": "Stage 2 assumes bmad-init has been run and a config exists. A first-timer who invokes this skill directly gets a cryptic error with no guidance on how to recover. This would frustrate new users and create abandonment.",
+      "action": "Add a graceful fallback in stage 2: detect missing config, explain what bmad-init does, and offer to proceed with defaults."
     }
   ],
-  "autonomous_assessment": {
-    "overall_potential": "headless-ready|easily-adaptable|partially-adaptable|fundamentally-interactive",
-    "hitl_interaction_points": 0,
-    "auto_resolvable": 0,
-    "needs_input": 0,
-    "suggested_output_contract": "What a headless invocation would return",
-    "required_inputs": ["parameters needed upfront for headless mode"],
-    "notes": "Brief assessment of autonomous viability"
+  "assessments": {
+    "skill_understanding": {
+      "purpose": "What this skill is trying to do",
+      "primary_user": "Who this skill is for",
+      "key_assumptions": ["assumption 1", "assumption 2"]
+    },
+    "user_journeys": [
+      {
+        "archetype": "first-timer|expert|confused|edge-case|hostile-environment|automator",
+        "summary": "Brief narrative of this user's experience with the skill",
+        "friction_points": ["moment 1", "moment 2"],
+        "bright_spots": ["what works well for this user"]
+      }
+    ],
+    "autonomous_assessment": {
+      "potential": "headless-ready|easily-adaptable|partially-adaptable|fundamentally-interactive",
+      "hitl_points": 0,
+      "auto_resolvable": 0,
+      "needs_input": 0,
+      "suggested_output_contract": "What a headless invocation would return",
+      "required_inputs": ["parameters needed upfront for headless mode"],
+      "notes": "Brief assessment of autonomous viability"
+    },
+    "top_insights": [
+      {
+        "title": "The single most impactful creative observation",
+        "detail": "The user experience impact",
+        "action": "What to do about it"
+      }
+    ]
   },
-  "top_insights": [
-    {
-      "insight": "The single most impactful creative observation",
-      "suggestion": "What to do about it",
-      "why_it_matters": "The user experience impact"
-    }
-  ],
   "summary": {
     "total_findings": 0,
     "by_severity": {"high-opportunity": 0, "medium-opportunity": 0, "low-opportunity": 0},
-    "by_category": {
-      "edge_case": 0,
-      "experience_gap": 0,
-      "delight_opportunity": 0,
-      "assumption_risk": 0,
-      "journey_friction": 0,
-      "autonomous_potential": 0,
-      "facilitative_pattern": 0
-    },
-    "boldest_idea": "The wildest suggestion that's still practical — the one that could transform this skill",
-    "overall_experience_assessment": "Brief creative assessment of the skill's user experience"
+    "assessment": "Brief creative assessment of the skill's user experience, including the boldest practical idea"
   }
 }
 ```
 
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 ## Process
 
-1. Read SKILL.md — deeply understand purpose, audience, and intent
-2. Read all prompts — walk through each stage mentally as a user
-3. Read resources — understand what's been considered
+1. **Parallel read batch:** Read SKILL.md, all prompt files, and resource files — in a single parallel batch
+2. Deeply understand purpose, audience, and intent from SKILL.md
+3. Walk through each stage mentally as a user
 4. Inhabit each user archetype (including the automator) and mentally simulate their journey through the skill
 5. Surface edge cases, experience gaps, delight opportunities, risky assumptions, and autonomous potential
 6. For autonomous potential: map every HITL interaction point and assess which could auto-resolve
diff --git a/src/skills/bmad-workflow-builder/agents/quality-scan-execution-efficiency.md b/src/skills/bmad-workflow-builder/quality-scan-execution-efficiency.md
similarity index 83%
rename from src/skills/bmad-workflow-builder/agents/quality-scan-execution-efficiency.md
rename to src/skills/bmad-workflow-builder/quality-scan-execution-efficiency.md
index 2ed2432..f7ced0e 100644
--- a/src/skills/bmad-workflow-builder/agents/quality-scan-execution-efficiency.md
+++ b/src/skills/bmad-workflow-builder/quality-scan-execution-efficiency.md
@@ -16,8 +16,8 @@ Read the skill's SKILL.md, all prompt files, and manifest (if present). Identify
 
 Find and read:
 - `SKILL.md` — On Activation patterns, operation flow
-- `prompts/*.md` — Each prompt for execution patterns
-- `resources/*.md` — Resource loading patterns
+- `*.md` prompt files at root — Each prompt for execution patterns
+- `references/*.md` — Resource loading patterns
 - `bmad-manifest.json` — Stage ordering, dependencies
 
 ---
@@ -225,62 +225,65 @@ You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/execution-efficiency-temp.json`
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+**Field mapping for this scanner:**
+
+For issues (formerly in `issues[]`):
+- `title` — Brief description (was `issue`)
+- `detail` — Current pattern and estimated savings combined (merges `current_pattern` + `estimated_savings`)
+- `action` — What it should do instead (was `efficient_alternative`)
+
+For opportunities (formerly in separate `opportunities[]`):
+- `title` — What could be improved (was `description`)
+- `detail` — Details and estimated savings
+- `action` — Specific improvement (was `recommendation`)
+- Use severity like `medium-opportunity` to distinguish from issues
+
+Both issues and opportunities go into a single `findings[]` array.
+
 ```json
 {
   "scanner": "execution-efficiency",
   "skill_path": "{path}",
-  "issues": [
+  "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md|bmad-manifest.json",
+      "file": "SKILL.md",
       "line": 42,
-      "severity": "critical|high|medium|low",
-      "category": "sequential-independent|parent-reads-first|missing-batch|no-output-spec|subagent-chain-violation|stage-ordering|dependency-bloat|circular-dependency|resource-loading|missing-delegation",
-      "issue": "Brief description",
-      "current_pattern": "What it does now",
-      "efficient_alternative": "What it should do instead",
-      "estimated_savings": "Time/token savings estimate"
-    }
-  ],
-  "opportunities": [
+      "severity": "high",
+      "category": "parent-reads-first",
+      "title": "Parent reads 3 source files before delegating analysis to subagents",
+      "detail": "Parent context bloats by ~6000 tokens reading doc1.md, doc2.md, doc3.md before spawning subagents to analyze them. Estimated savings: ~6000 tokens per invocation.",
+      "action": "Delegate reading to subagents: each subagent reads its assigned file and returns a compact JSON summary."
+    },
     {
-      "file": "SKILL.md|prompts/{name}.md|bmad-manifest.json",
+      "file": "SKILL.md",
       "line": 15,
-      "type": "parallelization|stage-reorder|dependency-trim|batching|delegation|resource-optimization",
-      "description": "What could be improved",
-      "recommendation": "Specific improvement",
-      "estimated_savings": "Estimated improvement"
+      "severity": "medium-opportunity",
+      "category": "parallelization",
+      "title": "Stages 2 and 3 could run in parallel",
+      "detail": "Stages 2 (validate inputs) and 3 (scan resources) have no data dependency. Running in parallel would save ~1 round-trip.",
+      "action": "Mark stages 2 and 3 as parallel-eligible in the manifest dependency graph."
     }
   ],
   "summary": {
-    "total_issues": 0,
+    "total_findings": 0,
     "by_severity": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "by_category": {
-      "sequential_independent": 0,
-      "parent_reads_first": 0,
-      "missing_batch": 0,
-      "no_output_spec": 0,
-      "stage_ordering": 0,
-      "dependency_bloat": 0,
-      "resource_loading": 0,
-      "missing_delegation": 0
-    },
-    "potential_improvements": {
-      "parallelization_opportunities": 0,
-      "batching_opportunities": 0,
-      "stage_reorder_opportunities": 0,
-      "dependency_trim_opportunities": 0,
-      "delegation_opportunities": 0
-    }
+    "assessment": "Brief 1-2 sentence overall assessment of execution efficiency"
   }
 }
 ```
 
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 ## Process
 
-1. Read SKILL.md — check On Activation and operation flow patterns
-2. Read all prompt files — check each for execution patterns
-3. Read bmad-manifest.json if present — check stage ordering and dependencies
-4. Check resource loading patterns in resources/
+1. **Parallel read batch:** Read SKILL.md, bmad-manifest.json (if present), and all prompt files at skill root — in a single parallel batch
+2. Check On Activation and operation flow patterns from SKILL.md
+3. Check each prompt file for execution patterns
+4. Check resource loading patterns in references/ (read as needed)
 5. Identify sequential operations that could be parallel
 6. Check for parent-reading-before-delegating patterns
 7. Verify subagent instructions have output specifications
diff --git a/src/skills/bmad-workflow-builder/agents/quality-scan-prompt-craft.md b/src/skills/bmad-workflow-builder/quality-scan-prompt-craft.md
similarity index 82%
rename from src/skills/bmad-workflow-builder/agents/quality-scan-prompt-craft.md
rename to src/skills/bmad-workflow-builder/quality-scan-prompt-craft.md
index 003cfdd..5005129 100644
--- a/src/skills/bmad-workflow-builder/agents/quality-scan-prompt-craft.md
+++ b/src/skills/bmad-workflow-builder/quality-scan-prompt-craft.md
@@ -16,8 +16,8 @@ Read every prompt in the skill and evaluate craft quality with this core princip
 
 Find and read:
 - `SKILL.md` — Primary target, evaluated with SKILL.md-specific criteria (see below)
-- `prompts/*.md` — Each stage prompt evaluated for craft quality
-- `resources/*.md` — Check progressive disclosure is used properly
+- `*.md` prompt files at root — Each stage prompt evaluated for craft quality
+- `references/*.md` — Check progressive disclosure is used properly
 
 ---
 
@@ -57,18 +57,18 @@ A good Overview includes whichever of these elements are relevant to the skill:
 |----------|----------------|-------|
 | Multi-branch skill where each branch is lightweight | Up to ~250 lines | Each branch section should have a brief explanation of what it handles and why, even if the procedure is short |
 | Single-purpose skill with no branches | Up to ~500 lines (~5000 tokens) | Rare, but acceptable if the content is genuinely needed and focused on one thing |
-| Any skill with large data tables, schemas, or reference material inline | Flag for extraction | These belong in `resources/` or `assets/`, not the SKILL.md body |
+| Any skill with large data tables, schemas, or reference material inline | Flag for extraction | These belong in `references/` or `assets/`, not the SKILL.md body |
 
 **Progressive disclosure techniques — how SKILL.md stays lean without stripping context:**
 
 | Technique | When to Use | What to Flag |
 |-----------|-------------|--------------|
-| Branch to `prompts/*.md` | Multiple execution paths where each path needs detailed instructions | All detailed path logic inline in SKILL.md when it pushes beyond size guidelines |
-| Load from `resources/*.md` | Domain knowledge, reference tables, examples >30 lines, large data | Large reference blocks or data tables inline that aren't needed every activation |
+| Branch to prompt `*.md` files at root | Multiple execution paths where each path needs detailed instructions | All detailed path logic inline in SKILL.md when it pushes beyond size guidelines |
+| Load from `references/*.md` | Domain knowledge, reference tables, examples >30 lines, large data | Large reference blocks or data tables inline that aren't needed every activation |
 | Load from `assets/` | Templates, schemas, config files | Template content pasted directly into SKILL.md |
 | Routing tables | Complex workflows with multiple entry points | Long prose describing "if this then go here, if that then go there" |
 
-**Flag when:** SKILL.md contains detailed content that belongs in prompts/ or resources/ — data tables, schemas, long reference material, or detailed multi-step procedures for branches that could be separate prompts.
+**Flag when:** SKILL.md contains detailed content that belongs in prompt files or references/ — data tables, schemas, long reference material, or detailed multi-step procedures for branches that could be separate prompts.
 
 **Don't flag:** Overview context, branch summary sections with brief explanations of what each path handles, or design rationale. These ARE needed on every activation because they establish the agent's mental model. A multi-branch SKILL.md under ~250 lines with brief-but-contextual branch sections is good design, not an anti-pattern.
 
@@ -103,8 +103,8 @@ A skill that has been aggressively optimized — or built too lean from the star
 
 | Pattern | Why It's a Problem | Fix |
 |---------|-------------------|-----|
-| SKILL.md exceeds size guidelines with no progressive disclosure | Context-heavy on every activation, likely contains extractable content | Extract detailed procedures to prompts/, reference material and data to resources/ |
-| Large data tables, schemas, or reference material inline | This is never needed on every activation — bloats context | Move to `resources/` or `assets/`, load on demand |
+| SKILL.md exceeds size guidelines with no progressive disclosure | Context-heavy on every activation, likely contains extractable content | Extract detailed procedures to prompt files at root, reference material and data to references/ |
+| Large data tables, schemas, or reference material inline | This is never needed on every activation — bloats context | Move to `references/` or `assets/`, load on demand |
 | No Overview or empty Overview | Agent follows steps without understanding why — brittle when situations vary | Add Overview with mission, domain framing, and relevant context |
 | Overview without connection to behavior | Philosophy that doesn't change how the agent executes | Either connect it to specific instructions or remove it |
 | Multi-branch sections with zero context | Agent can't understand what each branch is for | Add 1-2 sentence explanation per branch — what it handles and why |
@@ -116,7 +116,7 @@ A skill that has been aggressively optimized — or built too lean from the star
 
 ## Part 2: Stage Prompt Craft
 
-Stage prompts (`prompts/*.md`) are the working instructions for each phase of execution. These should be more procedural than SKILL.md, but still benefit from brief context about WHY this stage matters.
+Stage prompts (prompt `*.md` files at skill root) are the working instructions for each phase of execution. These should be more procedural than SKILL.md, but still benefit from brief context about WHY this stage matters.
 
 ### Config Header
 
@@ -235,60 +235,71 @@ You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/prompt-craft-temp.json`
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+**Field mapping for this scanner:**
+- `title` — Brief description of the issue (was `issue`)
+- `detail` — Why this matters and any nuance about whether it might be intentional (merges `rationale` + `nuance`)
+- `action` — Specific action to resolve (was `fix`)
+
 ```json
 {
   "scanner": "prompt-craft",
   "skill_path": "{path}",
-  "skill_type_assessment": "simple-utility|simple-workflow|complex-workflow|interactive-workflow",
-  "skillmd_assessment": {
-    "overview_quality": "appropriate|excessive|missing|disconnected",
-    "progressive_disclosure": "good|needs-extraction|monolithic",
-    "notes": "Brief assessment of SKILL.md craft"
-  },
-  "prompts_scanned": 0,
-  "issues": [
+  "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md",
+      "file": "SKILL.md",
       "line": 42,
-      "severity": "critical|high|medium|low|note",
-      "category": "token-waste|anti-pattern|outcome-balance|progression|self-containment|intelligence-placement|overview-quality|progressive-disclosure|under-contextualized|inline-data",
-      "issue": "Brief description",
-      "rationale": "Why this matters for prompt craft",
-      "fix": "Specific action to resolve",
-      "nuance": "Optional — why this might be intentional or context-dependent"
+      "severity": "medium",
+      "category": "token-waste",
+      "title": "Defensive padding in activation instructions",
+      "detail": "Three instances of 'Make sure to...' and 'Don't forget to...' add tokens without value. These are genuine waste, not contextual framing.",
+      "action": "Replace with direct imperatives: 'Load config first' instead of 'Make sure to load config first.'"
     }
   ],
-  "prompt_health": {
-    "prompts_with_config_header": 0,
-    "prompts_with_progression_conditions": 0,
-    "prompts_self_contained": 0,
-    "total_prompts": 0
+  "assessments": {
+    "skill_type_assessment": "simple-utility|simple-workflow|complex-workflow|interactive-workflow",
+    "skillmd_assessment": {
+      "overview_quality": "appropriate|excessive|missing|disconnected",
+      "progressive_disclosure": "good|needs-extraction|monolithic",
+      "notes": "Brief assessment of SKILL.md craft"
+    },
+    "prompts_scanned": 0,
+    "prompt_health": {
+      "prompts_with_config_header": 0,
+      "prompts_with_progression_conditions": 0,
+      "prompts_self_contained": 0,
+      "total_prompts": 0
+    }
   },
   "summary": {
-    "total_issues": 0,
+    "total_findings": 0,
     "by_severity": {"critical": 0, "high": 0, "medium": 0, "low": 0, "note": 0},
-    "craft_assessment": "Brief 1-2 sentence overall assessment of prompt craft quality",
-    "top_improvement": "The single highest-impact improvement for this skill's prompts"
+    "assessment": "Brief 1-2 sentence overall assessment of prompt craft quality"
   }
 }
 ```
 
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 ## Process
 
-1. Read SKILL.md — assess skill type, evaluate Overview quality and progressive disclosure
-2. Read all prompt files in prompts/
-3. Check resources/ to verify progressive disclosure is working (detail is where it belongs)
+1. **Parallel read batch:** Read SKILL.md, all prompt files at skill root, and list references/ contents — in a single parallel batch
+2. Assess skill type from SKILL.md, evaluate Overview quality and progressive disclosure
+3. Check references/ to verify progressive disclosure is working (detail is where it belongs)
 4. For SKILL.md: evaluate Overview quality (present? appropriate? excessive? disconnected? **missing?**)
 5. For SKILL.md: check for over-optimization — is this a complex/interactive skill stripped to a bare skeleton?
-6. For SKILL.md: check size and progressive disclosure — does it exceed guidelines? Are data tables, schemas, or reference material inline that should be in resources/?
+6. For SKILL.md: check size and progressive disclosure — does it exceed guidelines? Are data tables, schemas, or reference material inline that should be in references/?
 7. For multi-branch SKILL.md: does each branch section have brief context explaining what it handles and why?
-7. For each stage prompt: check config header, progression conditions, self-containment
-8. For each stage prompt: check context sufficiency — do judgment-heavy prompts have enough context to make good decisions?
-9. For all files: scan for genuine token waste (repetition, defensive padding, meta-explanation)
-10. For all files: evaluate outcome vs implementation balance given the skill type
-11. For all files: check intelligence placement (judgment in prompts, determinism in scripts)
-12. Write JSON to `{quality-report-dir}/prompt-craft-temp.json`
-13. Return only the filename: `prompt-craft-temp.json`
+8. For each stage prompt: check config header, progression conditions, self-containment
+9. For each stage prompt: check context sufficiency — do judgment-heavy prompts have enough context to make good decisions?
+10. For all files: scan for genuine token waste (repetition, defensive padding, meta-explanation)
+11. For all files: evaluate outcome vs implementation balance given the skill type
+12. For all files: check intelligence placement (judgment in prompts, determinism in scripts)
+13. Write JSON to `{quality-report-dir}/prompt-craft-temp.json`
+14. Return only the filename: `prompt-craft-temp.json`
 
 ## Critical After Draft Output
 
@@ -307,7 +318,7 @@ Write JSON findings to: `{quality-report-dir}/prompt-craft-temp.json`
 - Did I include the `nuance` field for findings that could be intentional?
 - Am I flagging Overview content as waste? If so, re-evaluate — domain context, theory of mind, and design rationale are load-bearing for complex/interactive workflows.
 - Did I check for under-contextualization? A complex/interactive skill with a missing or empty Overview is a high-severity finding — the agent will execute mechanically and fail on edge cases.
-- Did I check for inline data (tables, schemas, reference material) that should be in resources/ or assets/?
+- Did I check for inline data (tables, schemas, reference material) that should be in references/ or assets/?
 
 ### Calibration Check
 - Would implementing ALL my suggestions produce a better skill, or would some strip valuable context?
diff --git a/src/skills/bmad-workflow-builder/agents/quality-scan-script-opportunities.md b/src/skills/bmad-workflow-builder/quality-scan-script-opportunities.md
similarity index 80%
rename from src/skills/bmad-workflow-builder/agents/quality-scan-script-opportunities.md
rename to src/skills/bmad-workflow-builder/quality-scan-script-opportunities.md
index 7da0314..310e769 100644
--- a/src/skills/bmad-workflow-builder/agents/quality-scan-script-opportunities.md
+++ b/src/skills/bmad-workflow-builder/quality-scan-script-opportunities.md
@@ -16,8 +16,8 @@ Read every prompt file and SKILL.md. For each instruction that tells the LLM to
 
 Find and read:
 - `SKILL.md` — On Activation patterns, inline operations
-- `prompts/*.md` — Each prompt for deterministic operations hiding in LLM instructions
-- `resources/*.md` — Check if any resource content could be generated by scripts instead
+- `*.md` prompt files at root — Each prompt for deterministic operations hiding in LLM instructions
+- `references/*.md` — Check if any resource content could be generated by scripts instead
 - `scripts/` — Understand what scripts already exist (to avoid suggesting duplicates)
 
 ---
@@ -184,50 +184,56 @@ You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/script-opportunities-temp.json`
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+**Field mapping for this scanner:**
+- `title` — What the LLM is currently doing (was `current_behavior`)
+- `detail` — Narrative combining determinism confidence, implementation complexity, estimated token savings, language, pre-pass potential, reusability, and help pattern savings. Weave the specifics into a readable paragraph rather than separate fields.
+- `action` — What a script would do instead (was `script_alternative`)
+
 ```json
 {
   "scanner": "script-opportunities",
   "skill_path": "{path}",
-  "existing_scripts": ["list of scripts that already exist in skills/scripts/"],
   "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md",
+      "file": "SKILL.md",
       "line": 42,
-      "severity": "high|medium|low",
-      "category": "validation|extraction|transformation|counting|comparison|structure|graph|preprocessing|postprocessing",
-      "current_behavior": "What the LLM is currently doing",
-      "script_alternative": "What a script would do instead",
-      "determinism_confidence": "certain|high|moderate",
-      "estimated_token_savings": "tokens saved per invocation",
-      "implementation_complexity": "trivial|moderate|complex",
-      "language": "python|bash|either",
-      "could_be_prepass": false,
-      "feeds_scanner": "scanner name if applicable",
-      "reusable_across_skills": false,
-      "help_pattern_savings": "additional prompt tokens saved by using --help instead of inlining interface"
+      "severity": "high",
+      "category": "validation",
+      "title": "LLM validates frontmatter has required fields on every invocation",
+      "detail": "Determinism: certain. A Python script with pyyaml could validate frontmatter fields in <10ms. Estimated savings: ~500 tokens/invocation. Implementation: trivial (Python). This is reusable across all skills and could serve as a pre-pass feeding the workflow-integrity scanner. Using --help self-documentation would save an additional ~200 prompt tokens.",
+      "action": "Create a Python script that parses YAML frontmatter and checks required fields (name, description), returning JSON pass/fail with details."
     }
   ],
+  "assessments": {
+    "existing_scripts": ["list of scripts that already exist in skills/scripts/"]
+  },
   "summary": {
     "total_findings": 0,
     "by_severity": {"high": 0, "medium": 0, "low": 0},
     "by_category": {},
     "total_estimated_token_savings": "aggregate estimate across all findings",
-    "highest_value_opportunity": "The single biggest win — describe it",
-    "prepass_opportunities": "How many findings could become pre-pass scripts for LLM scanners"
+    "assessment": "Brief overall assessment including the single biggest win and how many findings could become pre-pass scripts"
   }
 }
 ```
 
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 ## Process
 
-1. Check `scripts/` directory — inventory what scripts already exist (avoid suggesting duplicates)
-2. Read SKILL.md — check On Activation and inline operations for deterministic work
-3. Read all prompt files — for each instruction, apply the determinism test
-4. Read resource files — check if any resource content could be generated/validated by scripts
-5. For each finding: estimate LLM tax, assess implementation complexity, check pre-pass potential
-6. For each finding: consider the --help pattern — if a prompt currently inlines a script's interface, note the additional savings
-7. Write JSON to `{quality-report-dir}/script-opportunities-temp.json`
-8. Return only the filename: `script-opportunities-temp.json`
+1. **Parallel read batch:** List `scripts/` directory, read SKILL.md, all prompt files, and resource files — in a single parallel batch
+2. Inventory existing scripts (avoid suggesting duplicates)
+3. Check On Activation and inline operations for deterministic work
+4. For each prompt instruction, apply the determinism test
+5. Check if any resource content could be generated/validated by scripts
+6. For each finding: estimate LLM tax, assess implementation complexity, check pre-pass potential
+7. For each finding: consider the --help pattern — if a prompt currently inlines a script's interface, note the additional savings
+8. Write JSON to `{quality-report-dir}/script-opportunities-temp.json`
+9. Return only the filename: `script-opportunities-temp.json`
 
 ## Critical After Draft Output
 
diff --git a/src/skills/bmad-workflow-builder/agents/quality-scan-skill-cohesion.md b/src/skills/bmad-workflow-builder/quality-scan-skill-cohesion.md
similarity index 66%
rename from src/skills/bmad-workflow-builder/agents/quality-scan-skill-cohesion.md
rename to src/skills/bmad-workflow-builder/quality-scan-skill-cohesion.md
index e3eb9e7..4231c5d 100644
--- a/src/skills/bmad-workflow-builder/agents/quality-scan-skill-cohesion.md
+++ b/src/skills/bmad-workflow-builder/quality-scan-skill-cohesion.md
@@ -22,8 +22,8 @@ This is an **opinionated, advisory scan**. Findings are suggestions, not errors.
 Find and read:
 - `SKILL.md` — Identity, purpose, role guidance, description
 - `bmad-manifest.json` — All capabilities with dependencies and metadata
-- `prompts/*.md` — What each stage prompt actually does
-- `resources/*.md` — Supporting resources and patterns
+- `*.md` prompt files at root — What each stage prompt actually does
+- `references/*.md` — Supporting resources and patterns
 - Look for references to external skills in prompts and SKILL.md
 
 ## Cohesion Dimensions
@@ -164,101 +164,131 @@ You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/skill-cohesion-temp.json`
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+**Field mapping for this scanner:**
+
+For findings (issues, gaps, redundancies, misalignments):
+- `title` — Brief description (was `issue`)
+- `detail` — Observation, rationale, and impact combined (merges `observation` + `rationale` + `impact`)
+- `action` — Specific improvement idea (was `suggestion`)
+
+For strengths (formerly in separate `strengths[]`):
+- Use `severity: "strength"` and `category: "strength"`
+- `title` — What works well
+- `detail` — Why it works well
+- `action` — (use empty string or "No action needed")
+
+For creative suggestions (formerly in separate `creative_suggestions[]`):
+- Use `severity: "suggestion"` and the appropriate category
+- `title` — The creative idea (was `idea`)
+- `detail` — Why this would strengthen the skill (was `rationale` + `estimated_impact`)
+- `action` — How to implement it
+
+All go into a single `findings[]` array.
+
 ```json
 {
   "scanner": "skill-cohesion",
   "skill_path": "{path}",
-  "skill_identity": {
-    "name": "{skill-name}",
-    "purpose_summary": "Brief characterization of what this skill does",
-    "primary_outcome": "What this skill produces",
-    "stage_count": 7
-  },
   "findings": [
     {
-      "file": "SKILL.md|bmad-manifest.json|prompts/{name}.md",
-      "severity": "high|medium|low|suggestion",
-      "category": "gap|redundancy|misalignment|opportunity|strength",
-      "issue": "Brief description",
-      "observation": "What you noticed that led to this finding",
-      "rationale": "Why this matters for cohesion",
-      "suggestion": "Specific improvement idea",
-      "impact": "What value this would add if addressed"
-    }
-  ],
-  "cohesion_analysis": {
-    "stage_flow_coherence": {
-      "score": "strong|moderate|weak",
-      "notes": "Brief explanation of how well stages flow together"
-    },
-    "purpose_alignment": {
-      "score": "strong|moderate|weak",
-      "notes": "Brief explanation of why purpose fits or doesn't fit stages"
-    },
-    "complexity_appropriateness": {
-      "score": "appropriate|over-engineered|under-engineered",
-      "notes": "Is this the right level of complexity for the task?"
+      "file": "SKILL.md",
+      "severity": "medium",
+      "category": "gap",
+      "title": "No validation stage after artifact creation",
+      "detail": "Stage 04 produces the final artifact but nothing verifies it meets the declared schema. Users would need to manually validate. This matters because invalid artifacts propagate errors downstream.",
+      "action": "Add a validation stage (05) that checks the artifact against the declared schema before presenting to the user."
     },
-    "stage_completeness": {
-      "score": "complete|mostly-complete|gaps-obvious",
-      "missing_areas": ["area1", "area2"],
-      "notes": "What's missing that should probably be there"
-    },
-    "redundancy_level": {
-      "score": "clean|some-overlap|significant-redundancy",
-      "consolidation_opportunities": [
-        {
-          "stages": ["stage-a", "stage-b"],
-          "suggested_consolidation": "How these could be combined"
-        }
-      ]
-    },
-    "dependency_graph": {
-      "score": "sound|minor-issues|significant-issues",
-      "circular_deps": false,
-      "unnecessary_bottlenecks": [],
-      "missing_dependencies": [],
-      "notes": "Assessment of after/before/is-required correctness"
-    },
-    "output_location_alignment": {
-      "score": "aligned|partially-aligned|misaligned",
-      "undeclared_outputs": [],
-      "declared_but_not_produced": [],
-      "notes": "Do output-location entries match what stages actually produce?"
-    },
-    "external_integration": {
-      "external_skills_referenced": 0,
-      "integration_pattern": "intentional|incidental|unclear",
-      "notes": "How external skills fit into the overall design"
+    {
+      "file": "SKILL.md",
+      "severity": "strength",
+      "category": "strength",
+      "title": "Excellent progressive disclosure in stage routing",
+      "detail": "The routing table cleanly separates entry points and each branch loads only what it needs. This keeps context lean across all paths.",
+      "action": ""
     },
-    "user_journey_score": {
-      "score": "complete-end-to-end|mostly-complete|fragmented",
-      "broken_workflows": ["workflow that can't be completed"],
-      "notes": "Can the skill accomplish its stated purpose end-to-end?"
-    }
-  },
-  "creative_suggestions": [
     {
-      "type": "new-stage|consolidation|refinement|complexity-shift|dependency-fix",
-      "idea": "Brief creative suggestion for improvement",
-      "rationale": "Why this would strengthen the skill",
-      "estimated_impact": "high|medium|low"
+      "file": "bmad-manifest.json",
+      "severity": "suggestion",
+      "category": "opportunity",
+      "title": "Consolidate stages 02 and 03 into a single analysis stage",
+      "detail": "Both stages read overlapping file sets and produce similar output structures. Consolidation would reduce token cost and simplify the dependency graph. Estimated impact: high.",
+      "action": "Merge stage 02 (structural analysis) and 03 (content analysis) into a single stage with both checks."
     }
   ],
-  "strengths": [
-    "Something this skill does really well - positive feedback is useful!",
-    "Another strength..."
-  ],
+  "assessments": {
+    "cohesion_analysis": {
+      "stage_flow_coherence": {
+        "score": "strong|moderate|weak",
+        "notes": "Brief explanation of how well stages flow together"
+      },
+      "purpose_alignment": {
+        "score": "strong|moderate|weak",
+        "notes": "Brief explanation of why purpose fits or doesn't fit stages"
+      },
+      "complexity_appropriateness": {
+        "score": "appropriate|over-engineered|under-engineered",
+        "notes": "Is this the right level of complexity for the task?"
+      },
+      "stage_completeness": {
+        "score": "complete|mostly-complete|gaps-obvious",
+        "missing_areas": ["area1", "area2"],
+        "notes": "What's missing that should probably be there"
+      },
+      "redundancy_level": {
+        "score": "clean|some-overlap|significant-redundancy",
+        "consolidation_opportunities": [
+          {
+            "stages": ["stage-a", "stage-b"],
+            "suggested_consolidation": "How these could be combined"
+          }
+        ]
+      },
+      "dependency_graph": {
+        "score": "sound|minor-issues|significant-issues",
+        "circular_deps": false,
+        "unnecessary_bottlenecks": [],
+        "missing_dependencies": [],
+        "notes": "Assessment of after/before/is-required correctness"
+      },
+      "output_location_alignment": {
+        "score": "aligned|partially-aligned|misaligned",
+        "undeclared_outputs": [],
+        "declared_but_not_produced": [],
+        "notes": "Do output-location entries match what stages actually produce?"
+      },
+      "external_integration": {
+        "external_skills_referenced": 0,
+        "integration_pattern": "intentional|incidental|unclear",
+        "notes": "How external skills fit into the overall design"
+      },
+      "user_journey_score": {
+        "score": "complete-end-to-end|mostly-complete|fragmented",
+        "broken_workflows": ["workflow that can't be completed"],
+        "notes": "Can the skill accomplish its stated purpose end-to-end?"
+      }
+    },
+    "skill_identity": {
+      "name": "{skill-name}",
+      "purpose_summary": "Brief characterization of what this skill does",
+      "primary_outcome": "What this skill produces",
+      "stage_count": 7
+    }
+  },
   "summary": {
     "total_findings": 0,
-    "by_severity": {"high": 0, "medium": 0, "low": 0, "suggestion": 0},
-    "by_category": {"gap": 0, "redundancy": 0, "misalignment": 0, "opportunity": 0, "strength": 0},
+    "by_severity": {"high": 0, "medium": 0, "low": 0, "suggestion": 0, "strength": 0},
     "overall_cohesion": "cohesive|mostly-cohesive|fragmented|confused",
     "single_most_important_fix": "The ONE thing that would most improve this skill"
   }
 }
 ```
 
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 ## Severity Guidelines
 
 | Severity | When to Use |
@@ -270,16 +300,13 @@ Write JSON findings to: `{quality-report-dir}/skill-cohesion-temp.json`
 
 ## Process
 
-1. Read SKILL.md to understand purpose and role guidance
-2. Read bmad-manifest.json to enumerate all capabilities and dependencies
-3. Read all prompts to understand what each stage actually does
-4. Read resources if available for additional context
-5. Build mental model of the skill as a whole
-6. Evaluate cohesion across all dimensions (flow, purpose, complexity, completeness, redundancy, dependencies, creates alignment, external integration, journey)
-7. Generate findings with specific, actionable suggestions
-8. Identify strengths (positive feedback is valuable!)
-9. Write JSON to `{quality-report-dir}/skill-cohesion-temp.json`
-10. Return only the filename: `skill-cohesion-temp.json`
+1. **Parallel read batch:** Read SKILL.md, bmad-manifest.json, all prompt files, and list resources/ — in a single parallel batch
+2. Build mental model of the skill as a whole from all files read
+3. Evaluate cohesion across all dimensions (flow, purpose, complexity, completeness, redundancy, dependencies, creates alignment, external integration, journey)
+4. Generate findings with specific, actionable suggestions
+5. Identify strengths (positive feedback is valuable!)
+6. Write JSON to `{quality-report-dir}/skill-cohesion-temp.json`
+7. Return only the filename: `skill-cohesion-temp.json`
 
 ## Critical After Draft Output
 
diff --git a/src/skills/bmad-workflow-builder/agents/quality-scan-workflow-integrity.md b/src/skills/bmad-workflow-builder/quality-scan-workflow-integrity.md
similarity index 83%
rename from src/skills/bmad-workflow-builder/agents/quality-scan-workflow-integrity.md
rename to src/skills/bmad-workflow-builder/quality-scan-workflow-integrity.md
index 12fb733..c42f7dc 100644
--- a/src/skills/bmad-workflow-builder/agents/quality-scan-workflow-integrity.md
+++ b/src/skills/bmad-workflow-builder/quality-scan-workflow-integrity.md
@@ -4,7 +4,7 @@ You are **WorkflowIntegrityBot**, a quality engineer who validates that a skill
 
 ## Overview
 
-You validate structural completeness and correctness across the entire skill: SKILL.md, stage prompts, manifest, and their interconnections. **Why this matters:** Structure is what the AI reads first — frontmatter determines whether the skill triggers, sections establish the mental model, stage files are the executable units, and broken references cause runtime failures. A structurally sound skill is one where the blueprint (SKILL.md) and the implementation (prompts/, resources/, manifest) are aligned and complete.
+You validate structural completeness and correctness across the entire skill: SKILL.md, stage prompts, manifest, and their interconnections. **Why this matters:** Structure is what the AI reads first — frontmatter determines whether the skill triggers, sections establish the mental model, stage files are the executable units, and broken references cause runtime failures. A structurally sound skill is one where the blueprint (SKILL.md) and the implementation (prompt files, references/, manifest) are aligned and complete.
 
 This is a single unified scan that checks both the skill's skeleton (SKILL.md structure) and its organs (stage files, progression, config, manifest). Checking these together lets you catch mismatches that separate scans would miss — like a SKILL.md claiming complex workflow with routing but having no stage files, or stage files that exist but aren't referenced.
 
@@ -16,7 +16,7 @@ Read the skill's SKILL.md, all stage prompts, and manifest (if present). Verify
 
 Find and read:
 - `SKILL.md` — Primary structure and blueprint
-- `prompts/*.md` — Stage prompt files (if complex workflow)
+- `*.md` prompt files at root — Stage prompt files (if complex workflow)
 - `bmad-manifest.json` — Module manifest (if present)
 
 ---
@@ -89,7 +89,7 @@ Determine workflow type from SKILL.md before applying type-specific checks:
 
 | Type | Indicators |
 |------|-----------|
-| Complex Workflow | Has routing logic, references stage files in prompts/, stages table |
+| Complex Workflow | Has routing logic, references stage files at root, stages table |
 | Simple Workflow | Has inline numbered steps, no external stage files |
 | Simple Utility | Input/output focused, transformation rules, minimal process |
 
@@ -99,8 +99,8 @@ Determine workflow type from SKILL.md before applying type-specific checks:
 
 | Check | Why It Matters |
 |-------|----------------|
-| Each stage referenced in SKILL.md exists in `prompts/` | Missing stage file means workflow cannot proceed — **critical** |
-| All stage files in `prompts/` are referenced in SKILL.md | Orphaned stage files indicate incomplete refactoring |
+| Each stage referenced in SKILL.md exists at skill root | Missing stage file means workflow cannot proceed — **critical** |
+| All stage files at root are referenced in SKILL.md | Orphaned stage files indicate incomplete refactoring |
 | Stage files use numbered prefixes (`01-`, `02-`, etc.) | Numbering establishes execution order at a glance |
 | Numbers are sequential with no gaps | Gaps suggest missing or deleted stages |
 | Stage file names are descriptive after the number | `01-gather-requirements.md` is clear; `01-step.md` is not |
@@ -192,44 +192,57 @@ You will receive `{skill-path}` and `{quality-report-dir}` as inputs.
 
 Write JSON findings to: `{quality-report-dir}/workflow-integrity-temp.json`
 
+Output your findings using the universal schema defined in `references/universal-scan-schema.md`.
+
+Use EXACTLY these field names: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`. Do not rename, restructure, or add fields to findings.
+
+**Field mapping for this scanner:**
+- `title` — Brief description of the issue (was `issue`)
+- `detail` — Why this is a problem (was `rationale`)
+- `action` — Specific action to resolve (was `fix`)
+
 ```json
 {
   "scanner": "workflow-integrity",
   "skill_path": "{path}",
-  "workflow_type": "complex|simple-workflow|simple-utility",
-  "issues": [
+  "findings": [
     {
-      "file": "SKILL.md|prompts/{name}.md|bmad-manifest.json",
+      "file": "SKILL.md",
       "line": 42,
-      "severity": "critical|high|medium|low",
-      "category": "frontmatter|sections|type-structure|config|config-header|language|artifacts|consistency|progression|missing-stage|naming|inline-steps|input-output|manifest|headless|invalid-section",
-      "issue": "Brief description",
-      "rationale": "Why this is a problem",
-      "fix": "Specific action to resolve"
+      "severity": "critical",
+      "category": "progression",
+      "title": "Stage 03 has no progression conditions",
+      "detail": "Without explicit conditions, the AI does not know when to advance to the next stage, causing stalls or premature transitions.",
+      "action": "Add progression conditions: 'Advance when all required fields are populated and user confirms.'"
     }
   ],
-  "stage_summary": {
-    "total_stages": 0,
-    "missing_stages": [],
-    "orphaned_stages": [],
-    "stages_without_progression": [],
-    "stages_without_config_header": []
+  "assessments": {
+    "workflow_type": "complex|simple-workflow|simple-utility",
+    "stage_summary": {
+      "total_stages": 0,
+      "missing_stages": [],
+      "orphaned_stages": [],
+      "stages_without_progression": [],
+      "stages_without_config_header": []
+    }
   },
   "summary": {
-    "total_issues": 0,
+    "total_findings": 0,
     "by_severity": {"critical": 0, "high": 0, "medium": 0, "low": 0},
-    "by_category": {"frontmatter": 0, "sections": 0, "type-structure": 0, "config": 0, "config-header": 0, "language": 0, "artifacts": 0, "consistency": 0, "progression": 0, "missing-stage": 0, "naming": 0, "inline-steps": 0, "input-output": 0, "manifest": 0, "headless": 0}
+    "assessment": "Brief 1-2 sentence overall assessment of workflow integrity"
   }
 }
 ```
 
+Before writing output, verify: Is your array called `findings`? Does every item have `title`, `detail`, `action`? Is `assessments` an object, not items in the findings array?
+
 ## Process
 
-1. Read SKILL.md — validate frontmatter, sections, language, template artifacts
-2. Determine workflow type (complex, simple workflow, simple utility)
-3. For complex workflows: list all stage files in prompts/, cross-reference with SKILL.md references
-4. For complex workflows: read each stage prompt — check progression conditions, config headers, naming
-5. For complex workflows: check bmad-manifest.json if module-based
+1. **Parallel read batch:** Read SKILL.md, bmad-manifest.json (if present), and list all `.md` files at skill root — in a single parallel batch
+2. Validate frontmatter, sections, language, template artifacts from SKILL.md
+3. Determine workflow type (complex, simple workflow, simple utility)
+4. For complex workflows: **parallel read batch** — read all stage prompt files identified in step 1
+5. For complex workflows: cross-reference stage files with SKILL.md references, check progression conditions, config headers, naming
 6. For simple workflows: verify inline steps are numbered, clear, and complete
 7. For simple utilities: verify input/output format and transformation rules
 8. Check headless mode if declared
@@ -244,7 +257,7 @@ Write JSON findings to: `{quality-report-dir}/workflow-integrity-temp.json`
 ### Scan Completeness
 - Did I read the entire SKILL.md file?
 - Did I correctly identify the workflow type?
-- Did I read ALL stage files in prompts/ (for complex workflows)?
+- Did I read ALL stage files at skill root (for complex workflows)?
 - Did I verify every stage reference in SKILL.md has a corresponding file?
 - Did I check progression conditions in every stage prompt?
 - Did I check config headers in stage prompts?
diff --git a/src/skills/bmad-workflow-builder/resources/classification-reference.md b/src/skills/bmad-workflow-builder/references/classification-reference.md
similarity index 85%
rename from src/skills/bmad-workflow-builder/resources/classification-reference.md
rename to src/skills/bmad-workflow-builder/references/classification-reference.md
index 82e7777..70a520c 100644
--- a/src/skills/bmad-workflow-builder/resources/classification-reference.md
+++ b/src/skills/bmad-workflow-builder/references/classification-reference.md
@@ -7,8 +7,8 @@ Classify the skill type based on user requirements. This table is for internal u
 | Type | Description | Structure | When to Use |
 |------|-------------|-----------|-------------|
 | **Simple Utility** | Input/output building block. Headless, composable, often has scripts. May opt out of bmad-init for true standalone use. | Single SKILL.md + scripts/ | Composable building block with clear input/output, single-purpose |
-| **Simple Workflow** | Multi-step process contained in a single SKILL.md. Uses bmad-init. Minimal or no prompts/. | SKILL.md + optional resources/ | Multi-step process that fits in one file, no progressive disclosure needed |
-| **Complex Workflow** | Multi-stage with progressive disclosure, numbered prompts/, config integration. May support headless mode. | SKILL.md (routing) + prompts/ stages + resources/ | Multiple stages, long-running process, progressive disclosure, routing logic |
+| **Simple Workflow** | Multi-step process contained in a single SKILL.md. Uses bmad-init. Minimal or no prompt files. | SKILL.md + optional references/ | Multi-step process that fits in one file, no progressive disclosure needed |
+| **Complex Workflow** | Multi-stage with progressive disclosure, numbered prompt files at root, config integration. May support headless mode. | SKILL.md (routing) + prompt stages at root + references/ | Multiple stages, long-running process, progressive disclosure, routing logic |
 
 ## Decision Tree
 
diff --git a/src/skills/bmad-workflow-builder/resources/complex-workflow-patterns.md b/src/skills/bmad-workflow-builder/references/complex-workflow-patterns.md
similarity index 90%
rename from src/skills/bmad-workflow-builder/resources/complex-workflow-patterns.md
rename to src/skills/bmad-workflow-builder/references/complex-workflow-patterns.md
index efda4ca..038816b 100644
--- a/src/skills/bmad-workflow-builder/resources/complex-workflow-patterns.md
+++ b/src/skills/bmad-workflow-builder/references/complex-workflow-patterns.md
@@ -307,23 +307,22 @@ If context is compacted mid-workflow:
 
 ## Sequential Progressive Disclosure
 
-Use the `prompts/` subfolder when:
+Place numbered prompt files at the skill root when:
 - Multi-phase workflow with ordered questions
 - Input of one phase affects the next
 - User requires specific sequence
 - Workflow is long-running and stages shouldn't be visible upfront
 
-### prompts/ Structure
+### Prompt File Structure
 
 ```
 my-workflow/
 ├── SKILL.md
-├── prompts/
-│   ├── 01-discovery.md       # Stage 1: Gather requirements, start output doc
-│   ├── 02-planning.md        # Stage 2: Create plan (uses discovery output)
-│   ├── 03-execution.md       # Stage 3: Execute (uses plan, updates output)
-│   └── 04-review.md          # Stage 4: Review and polish final output
-└── resources/
+├── 01-discovery.md           # Stage 1: Gather requirements, start output doc
+├── 02-planning.md            # Stage 2: Create plan (uses discovery output)
+├── 03-execution.md           # Stage 3: Execute (uses plan, updates output)
+├── 04-review.md              # Stage 4: Review and polish final output
+└── references/
     └── stage-templates.md
 ```
 
@@ -332,7 +331,7 @@ my-workflow/
 Each prompt file specifies when to proceed:
 
 ```markdown
-# prompts/02-planning.md
+# 02-planning.md
 
 ## Prerequisites
 - Discovery complete (output doc exists and has discovery section)
@@ -348,10 +347,10 @@ Each prompt file specifies when to proceed:
 Proceed to execution stage when user confirms: "Proceed with plan" OR user provides modifications
 
 ## On User Approval
-Route to prompts/03-execution.md
+Route to 03-execution.md
 ```
 
-### SKILL.md Routes to prompts/
+### SKILL.md Routes to Prompt Files
 
 Main SKILL.md is minimal — just routing logic:
 
@@ -364,12 +363,12 @@ Main SKILL.md is minimal — just routing logic:
    - If output doc exists (user specifies path or we prompt):
      - Read doc to determine current stage
      - Resume from last completed section
-   - Else: Start at prompts/01-discovery.md
+   - Else: Start at 01-discovery.md
 
-3. Route to appropriate prompts/ file based on stage
+3. Route to appropriate prompt file based on stage
 ```
 
-### When NOT to Use prompts/
+### When NOT to Use Separate Prompt Files
 
 Keep inline in SKILL.md when:
 - Simple skill (session-long context fits)
@@ -381,7 +380,7 @@ Keep inline in SKILL.md when:
 
 ## Module Metadata Reference
 
-BMad module workflows require extended frontmatter metadata. See `resources/metadata-reference.md` for the metadata template, field explanations, and comparisons between standalone skills and module workflows.
+BMad module workflows require extended frontmatter metadata. See `references/metadata-reference.md` for the metadata template, field explanations, and comparisons between standalone skills and module workflows.
 
 ---
 
@@ -396,7 +395,7 @@ Before finalizing a BMad module workflow, verify:
 - [ ] **Document-as-cache**: Output doc has YAML front matter with `status` and `inputs` for recovery?
 - [ ] **Input tracking**: Does front matter list relative paths to all input files used?
 - [ ] **Final polish**: Does workflow include a subagent polish step at the end?
-- [ ] **Progressive disclosure**: Are stages in `prompts/` with clear progression conditions?
+- [ ] **Progressive disclosure**: Are stages in prompt files at root with clear progression conditions?
 - [ ] **Metadata complete**: All bmad-* fields present and accurate?
 - [ ] **Recovery pattern**: Can the workflow resume by reading the output doc front matter?
 
@@ -407,12 +406,11 @@ Before finalizing a BMad module workflow, verify:
 ```
 my-module-workflow/
 ├── SKILL.md                              # Routing + entry logic
-├── prompts/
-│   ├── 01-discovery.md                   # Gather requirements
-│   ├── 02-planning.md                    # Create plan
-│   ├── 03-execution.md                   # Execute
-│   └── 04-review.md                      # Review results
-├── resources/
+├── 01-discovery.md                       # Gather requirements
+├── 02-planning.md                        # Create plan
+├── 03-execution.md                       # Execute
+├── 04-review.md                          # Review results
+├── references/
 │   └── templates.md                      # Stage templates
 └── scripts/
     └── validator.sh                      # Output validation
@@ -433,12 +431,12 @@ description: Complex multi-stage workflow for my module. Use when user requests
 
 3. Check if doc exists:
    - If yes: read to determine current stage, resume
-   - If no: start at prompts/01-discovery.md
+   - If no: start at 01-discovery.md
 
-4. Route to appropriate prompts/ file based on stage
+4. Route to appropriate prompt file based on stage
 ```
 
-**prompts/01-discovery.md**:
+**01-discovery.md**:
 ```markdown
 Language: {communication_language}
 Output Language: {document_output_language}
@@ -473,10 +471,10 @@ updated: "{timestamp}"
 ```
 
 ## Progression
-When complete → prompts/02-planning.md
+When complete → 02-planning.md
 ```
 
-**prompts/02-planning.md**:
+**02-planning.md**:
 ```markdown
 Language: {communication_language}
 Output Language: {document_output_language}
@@ -494,10 +492,10 @@ Output Language: {document_output_language}
    - Append planning section
 
 ## Progression
-When complete → prompts/03-execution.md
+When complete → 03-execution.md
 ```
 
-**prompts/04-review.md**:
+**04-review.md**:
 ```markdown
 Language: {communication_language}
 Output Language: {document_output_language}
diff --git a/src/skills/bmad-workflow-builder/resources/metadata-reference.md b/src/skills/bmad-workflow-builder/references/metadata-reference.md
similarity index 89%
rename from src/skills/bmad-workflow-builder/resources/metadata-reference.md
rename to src/skills/bmad-workflow-builder/references/metadata-reference.md
index 3746ce2..df2ac60 100644
--- a/src/skills/bmad-workflow-builder/resources/metadata-reference.md
+++ b/src/skills/bmad-workflow-builder/references/metadata-reference.md
@@ -35,7 +35,7 @@ description: [5-8 word summary]. [Use when user says 'X' or 'Y'.]
       "menu-code": "BP",
       "description": "Builds skills through conversational discovery. Outputs to skill folder.",
       "supports-headless": true,
-      "prompt": "prompts/build-process.md",
+      "prompt": "build-process.md",
       "phase-name": "design",
       "after": ["create-requirements"],
       "before": ["quality-optimize"],
@@ -99,11 +99,11 @@ No type field needed — inferred from content:
 
 All module skills MUST use the `bmad-init` skill at startup.
 
-See `resources/complex-workflow-patterns.md` for the config loading pattern.
+See `references/complex-workflow-patterns.md` for the config loading pattern.
 
 ## Path Construction Rules — CRITICAL
 
-Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths.
+Only use `{project-root}` for `_bmad` paths.
 
 **Three path types:**
 - **Skill-internal** — bare relative paths (no prefix)
@@ -112,15 +112,15 @@ Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths.
 
 **Correct:**
 ```
-resources/reference.md                # Skill-internal (bare relative)
-prompts/stage-one.md                  # Skill-internal (bare relative)
+references/reference.md                # Skill-internal (bare relative)
+stage-one.md                          # Skill-internal (prompt at root)
 {project-root}/_bmad/planning/prd.md  # Project _bmad path
 {planning_artifacts}/prd.md           # Config var (already has full path)
 ```
 
 **Never use:**
 ```
-{skill-root}/resources/reference.md   # {skill-root} doesn't exist
+../../other-skill/file.md              # Cross-skill relative path breaks with reorganization
 {project-root}/{config_var}/output.md # Double-prefix
-./resources/reference.md              # Relative prefix breaks context changes
+./references/reference.md              # Relative prefix breaks context changes
 ```
diff --git a/src/skills/bmad-workflow-builder/resources/quality-dimensions.md b/src/skills/bmad-workflow-builder/references/quality-dimensions.md
similarity index 82%
rename from src/skills/bmad-workflow-builder/resources/quality-dimensions.md
rename to src/skills/bmad-workflow-builder/references/quality-dimensions.md
index 5f54535..df6f6d8 100644
--- a/src/skills/bmad-workflow-builder/resources/quality-dimensions.md
+++ b/src/skills/bmad-workflow-builder/references/quality-dimensions.md
@@ -22,8 +22,8 @@ Scripts handle plumbing (fetch, transform, validate). Prompts handle judgment (i
 
 SKILL.md stays focused. Detail goes where it belongs.
 
-- Stage instructions → `prompts/`
-- Reference data, schemas, large tables → `resources/`
+- Stage instructions → prompt files at skill root
+- Reference data, schemas, large tables → `references/`
 - Templates, config files → `assets/`
 - Multi-branch SKILL.md under ~250 lines: fine as-is
 - Single-purpose up to ~500 lines: acceptable if focused
@@ -32,13 +32,13 @@ SKILL.md stays focused. Detail goes where it belongs.
 
 Two parts: `[5-8 word summary]. [Use when user says 'X' or 'Y'.]`
 
-Default to conservative triggering. See `resources/standard-fields.md` for full format and examples.
+Default to conservative triggering. See `references/standard-fields.md` for full format and examples.
 
 ## 5. Path Construction
 
-Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths. Config variables used directly — they already contain `{project-root}`.
+Only use `{project-root}` for `_bmad` paths. Config variables used directly — they already contain `{project-root}`.
 
-See `resources/standard-fields.md` for correct/incorrect patterns.
+See `references/standard-fields.md` for correct/incorrect patterns.
 
 ## 6. Token Efficiency
 
diff --git a/src/skills/bmad-workflow-builder/resources/script-opportunities-reference.md b/src/skills/bmad-workflow-builder/references/script-opportunities-reference.md
similarity index 97%
rename from src/skills/bmad-workflow-builder/resources/script-opportunities-reference.md
rename to src/skills/bmad-workflow-builder/references/script-opportunities-reference.md
index a48cc3c..73986fb 100644
--- a/src/skills/bmad-workflow-builder/resources/script-opportunities-reference.md
+++ b/src/skills/bmad-workflow-builder/references/script-opportunities-reference.md
@@ -132,7 +132,7 @@ Each entry follows the format: What it does, Why it matters for workflows, What
 **Why:** Capability misalignment between prompts and the manifest causes routing failures — the skill advertises a capability it can't deliver, or has a prompt that's never reachable.
 
 **Checks:**
-- Every prompt in `prompts/` has frontmatter with `name`, `description`, `menu-code`
+- Every prompt file at root has frontmatter with `name`, `description`, `menu-code`
 - Prompt `name` matches manifest capability name
 - `menu-code` matches manifest entry (case-insensitive)
 - Every manifest capability with `type: "prompt"` has a corresponding file
@@ -140,7 +140,7 @@ Each entry follows the format: What it does, Why it matters for workflows, What
 
 **Output:** JSON with mismatches, missing files, orphaned prompts.
 
-**Implementation:** Python, reads `bmad-skill-manifest.yaml` and all `.md` files in `prompts/`.
+**Implementation:** Python, reads `bmad-skill-manifest.yaml` and all prompt `.md` files at skill root.
 
 ---
 
@@ -225,8 +225,8 @@ Each entry follows the format: What it does, Why it matters for workflows, What
 **Checks:**
 - Parse SKILL.md and prompts for `Load resource` / `Read` / file reference patterns
 - Map each resource to the stage/prompt where it's first loaded
-- Identify resources in `resources/` that are never referenced
-- Identify resources referenced but missing from `resources/`
+- Identify resources in `references/` that are never referenced
+- Identify resources referenced but missing from `references/`
 - Calculate cumulative token cost at each loading point
 
 **Output:** JSON with resource file, loading trigger (which prompt/stage), and orphan/missing flags.
@@ -257,10 +257,10 @@ Each entry follows the format: What it does, Why it matters for workflows, What
 
 **What:** Trace the chain of prompt loads through a workflow and verify every path is valid.
 
-**Why:** Workflows route between prompts based on user intent and stage progression. A broken link in the chain — a `Load prompts/foo.md` where `foo.md` doesn't exist — halts the workflow.
+**Why:** Workflows route between prompts based on user intent and stage progression. A broken link in the chain — a `Load foo.md` where `foo.md` doesn't exist — halts the workflow.
 
 **Checks:**
-- Extract all `Load prompts/*.md` references from SKILL.md and every prompt file
+- Extract all `Load *.md` prompt references from SKILL.md and every prompt file
 - Verify each referenced prompt file exists
 - Build a reachability map from SKILL.md entry points
 - Flag prompts that exist but are unreachable from any entry point
diff --git a/src/skills/bmad-workflow-builder/resources/skill-best-practices.md b/src/skills/bmad-workflow-builder/references/skill-best-practices.md
similarity index 98%
rename from src/skills/bmad-workflow-builder/resources/skill-best-practices.md
rename to src/skills/bmad-workflow-builder/references/skill-best-practices.md
index df277a2..8e341c1 100644
--- a/src/skills/bmad-workflow-builder/resources/skill-best-practices.md
+++ b/src/skills/bmad-workflow-builder/references/skill-best-practices.md
@@ -1,6 +1,6 @@
 # Skill Authoring Best Practices
 
-Practical patterns for writing effective BMad skills. For field definitions and description format, see `resources/standard-fields.md`. For quality dimensions, see `resources/quality-dimensions.md`.
+Practical patterns for writing effective BMad skills. For field definitions and description format, see `references/standard-fields.md`. For quality dimensions, see `references/quality-dimensions.md`.
 
 ## Core Principle: Informed Autonomy
 
diff --git a/src/skills/bmad-workflow-builder/resources/standard-fields.md b/src/skills/bmad-workflow-builder/references/standard-fields.md
similarity index 96%
rename from src/skills/bmad-workflow-builder/resources/standard-fields.md
rename to src/skills/bmad-workflow-builder/references/standard-fields.md
index 2eb45bc..521f887 100644
--- a/src/skills/bmad-workflow-builder/resources/standard-fields.md
+++ b/src/skills/bmad-workflow-builder/references/standard-fields.md
@@ -98,12 +98,12 @@ This provides quick prompt priming for expertise and tone. Workflows may also us
 
 ## Path Rules
 
-**Critical**: Never use `{skill-root}`. Only use `{project-root}` for `_bmad` paths.
+Only use `{project-root}` for `_bmad` paths.
 
 ### Skill-Internal Files
 Use bare relative paths (no prefix):
-- `resources/reference.md`
-- `prompts/01-discover.md`
+- `references/reference.md`
+- `01-discover.md`
 - `scripts/validate.py`
 
 ### Project `_bmad` Paths
@@ -117,6 +117,5 @@ Use directly — they already contain `{project-root}` in their resolved values:
 - `{planning_artifacts}/prd.md`
 
 **Never:**
-- `{skill-root}/anything` (WRONG — `{skill-root}` is never used)
 - `{project-root}/{output_folder}/file.md` (WRONG — double-prefix, config var already has path)
 - `_bmad/planning/prd.md` (WRONG — bare `_bmad` must have `{project-root}` prefix)
diff --git a/src/skills/bmad-workflow-builder/resources/template-substitution-rules.md b/src/skills/bmad-workflow-builder/references/template-substitution-rules.md
similarity index 95%
rename from src/skills/bmad-workflow-builder/resources/template-substitution-rules.md
rename to src/skills/bmad-workflow-builder/references/template-substitution-rules.md
index 0d706c7..fb89b3f 100644
--- a/src/skills/bmad-workflow-builder/resources/template-substitution-rules.md
+++ b/src/skills/bmad-workflow-builder/references/template-substitution-rules.md
@@ -80,6 +80,6 @@ Replace all content placeholders with skill-specific values:
 
 All generated skills use these paths:
 - `bmad-manifest.json` — Module metadata (if module-based)
-- `resources/{reference}.md` — Reference documents loaded on demand
-- `prompts/01-{stage}.md` — Numbered stage prompts (complex workflows)
+- `references/{reference}.md` — Reference documents loaded on demand
+- `01-{stage}.md` — Numbered stage prompts at skill root (complex workflows)
 - `scripts/` — Python/shell scripts for deterministic operations (if needed)
diff --git a/src/skills/bmad-workflow-builder/references/universal-scan-schema.md b/src/skills/bmad-workflow-builder/references/universal-scan-schema.md
new file mode 100644
index 0000000..11e6df8
--- /dev/null
+++ b/src/skills/bmad-workflow-builder/references/universal-scan-schema.md
@@ -0,0 +1,267 @@
+# Universal Scanner Output Schema
+
+All quality scanners — both LLM-based and deterministic lint scripts — MUST produce output conforming to this schema. No exceptions.
+
+## Top-Level Structure
+
+```json
+{
+  "scanner": "scanner-name",
+  "skill_path": "{path}",
+  "findings": [],
+  "assessments": {},
+  "summary": {
+    "total_findings": 0,
+    "by_severity": {},
+    "assessment": "1-2 sentence overall assessment"
+  }
+}
+```
+
+| Key | Type | Required | Description |
+|-----|------|----------|-------------|
+| `scanner` | string | yes | Scanner identifier (e.g., `"workflow-integrity"`, `"prompt-craft"`) |
+| `skill_path` | string | yes | Absolute path to the skill being scanned |
+| `findings` | array | yes | ALL items — issues, strengths, suggestions, opportunities. Always an array, never an object |
+| `assessments` | object | yes | Scanner-specific structured analysis (cohesion tables, health metrics, user journeys, etc.). Free-form per scanner |
+| `summary` | object | yes | Aggregate counts and brief overall assessment |
+
+## Finding Schema (7 fields)
+
+Every item in `findings[]` has exactly these 7 fields:
+
+```json
+{
+  "file": "SKILL.md",
+  "line": 42,
+  "severity": "high",
+  "category": "frontmatter",
+  "title": "Brief headline of the finding",
+  "detail": "Full context — rationale, what was observed, why it matters",
+  "action": "What to do about it — fix, suggestion, or script to create"
+}
+```
+
+| Field | Type | Required | Description |
+|-------|------|----------|-------------|
+| `file` | string | yes | Relative path to the affected file (e.g., `"SKILL.md"`, `"scripts/build.py"`). Empty string if not file-specific |
+| `line` | int\|null | no | Line number (1-based). `null` or `0` if not line-specific |
+| `severity` | string | yes | One of the severity values below |
+| `category` | string | yes | Scanner-specific category (e.g., `"frontmatter"`, `"token-waste"`, `"lint"`) |
+| `title` | string | yes | Brief headline (1 sentence). This is the primary display text |
+| `detail` | string | yes | Full context — fold rationale, observation, impact, nuance into one narrative. Empty string if title is self-explanatory |
+| `action` | string | yes | What to do — fix instruction, suggestion, or script to create. Empty string for strengths/notes |
+
+## Severity Values (complete enum)
+
+```
+critical | high | medium | low | high-opportunity | medium-opportunity | low-opportunity | suggestion | strength | note
+```
+
+**Routing rules:**
+- `critical`, `high` → "Truly Broken" section in report
+- `medium`, `low` → category-specific findings sections
+- `high-opportunity`, `medium-opportunity`, `low-opportunity` → enhancement/creative sections
+- `suggestion` → creative suggestions section
+- `strength` → strengths section (positive observations worth preserving)
+- `note` → informational observations, also routed to strengths
+
+## Assessment Sub-Structure Contracts
+
+The `assessments` object is free-form per scanner, but the HTML report renderer expects specific shapes for specific keys. These are the canonical formats.
+
+### user_journeys (enhancement-opportunities scanner)
+
+**Always an array of objects. Never an object keyed by persona.**
+
+```json
+"user_journeys": [
+  {
+    "archetype": "first-timer",
+    "summary": "Brief narrative of this user's experience",
+    "friction_points": ["moment 1", "moment 2"],
+    "bright_spots": ["what works well"]
+  }
+]
+```
+
+### autonomous_assessment (enhancement-opportunities scanner)
+
+```json
+"autonomous_assessment": {
+  "potential": "headless-ready|easily-adaptable|partially-adaptable|fundamentally-interactive",
+  "hitl_points": 3,
+  "auto_resolvable": 2,
+  "needs_input": 1,
+  "notes": "Brief assessment"
+}
+```
+
+### top_insights (enhancement-opportunities scanner)
+
+**Always an array of objects with title/detail/action (same shape as findings but without file/line/severity/category).**
+
+```json
+"top_insights": [
+  {
+    "title": "The key observation",
+    "detail": "Why it matters",
+    "action": "What to do about it"
+  }
+]
+```
+
+### cohesion_analysis (skill-cohesion / agent-cohesion scanner)
+
+```json
+"cohesion_analysis": {
+  "dimension_name": { "score": "strong|moderate|weak", "notes": "explanation" }
+}
+```
+
+Dimension names are scanner-specific (e.g., `stage_flow_coherence`, `persona_alignment`). The report renderer iterates all keys and renders a table row per dimension.
+
+### skill_identity / agent_identity (cohesion scanners)
+
+```json
+"skill_identity": {
+  "name": "skill-name",
+  "purpose_summary": "Brief characterization",
+  "primary_outcome": "What this skill produces"
+}
+```
+
+### skillmd_assessment (prompt-craft scanner)
+
+```json
+"skillmd_assessment": {
+  "overview_quality": "appropriate|excessive|missing",
+  "progressive_disclosure": "good|needs-extraction|monolithic",
+  "notes": "brief assessment"
+}
+```
+
+Agent variant adds `"persona_context": "appropriate|excessive|missing"`.
+
+### prompt_health (prompt-craft scanner)
+
+```json
+"prompt_health": {
+  "total_prompts": 3,
+  "with_config_header": 2,
+  "with_progression": 1,
+  "self_contained": 3
+}
+```
+
+### skill_understanding (enhancement-opportunities scanner)
+
+```json
+"skill_understanding": {
+  "purpose": "what this skill does",
+  "primary_user": "who it's for",
+  "assumptions": ["assumption 1", "assumption 2"]
+}
+```
+
+### stage_summary (workflow-integrity scanner)
+
+```json
+"stage_summary": {
+  "total_stages": 0,
+  "missing_stages": [],
+  "orphaned_stages": [],
+  "stages_without_progression": [],
+  "stages_without_config_header": []
+}
+```
+
+### metadata (structure scanner)
+
+Free-form key-value pairs. Rendered as a metadata block.
+
+### script_summary (scripts lint)
+
+```json
+"script_summary": {
+  "total_scripts": 5,
+  "by_type": {"python": 3, "shell": 2},
+  "missing_tests": ["script1.py"]
+}
+```
+
+### existing_scripts (script-opportunities scanner)
+
+Array of strings (script paths that already exist).
+
+## Complete Example
+
+```json
+{
+  "scanner": "workflow-integrity",
+  "skill_path": "/path/to/skill",
+  "findings": [
+    {
+      "file": "SKILL.md",
+      "line": 12,
+      "severity": "high",
+      "category": "frontmatter",
+      "title": "Missing required 'version' field in frontmatter",
+      "detail": "The SKILL.md frontmatter is missing the version field. This prevents the manifest generator from producing correct output and breaks version-aware consumers.",
+      "action": "Add 'version: 1.0.0' to the YAML frontmatter block"
+    },
+    {
+      "file": "build-process.md",
+      "line": null,
+      "severity": "strength",
+      "category": "design",
+      "title": "Excellent progressive disclosure pattern in build stages",
+      "detail": "Each stage provides exactly the context needed without front-loading information. This reduces token waste and improves LLM comprehension.",
+      "action": ""
+    },
+    {
+      "file": "SKILL.md",
+      "line": 45,
+      "severity": "medium-opportunity",
+      "category": "experience-gap",
+      "title": "No guidance for first-time users unfamiliar with build workflows",
+      "detail": "A user encountering this skill for the first time has no onboarding path. The skill assumes familiarity with stage-based workflows, which creates friction for newcomers.",
+      "action": "Add a 'Getting Started' section or link to onboarding documentation"
+    }
+  ],
+  "assessments": {
+    "stage_summary": {
+      "total_stages": 7,
+      "missing_stages": [],
+      "orphaned_stages": ["cleanup"]
+    }
+  },
+  "summary": {
+    "total_findings": 3,
+    "by_severity": {"high": 1, "medium-opportunity": 1, "strength": 1},
+    "assessment": "Well-structured skill with one critical frontmatter gap. Progressive disclosure is a notable strength."
+  }
+}
+```
+
+## DO NOT
+
+- **DO NOT** rename fields. Use exactly: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`
+- **DO NOT** use `issues` instead of `findings` — the array is always called `findings`
+- **DO NOT** add fields to findings beyond the 7 defined above. Put scanner-specific structured data in `assessments`
+- **DO NOT** use separate arrays for strengths, suggestions, or opportunities — they go in `findings` with appropriate severity values
+- **DO NOT** change `user_journeys` from an array to an object keyed by persona name
+- **DO NOT** restructure assessment sub-objects — use the shapes defined above
+- **DO NOT** put free-form narrative data into `assessments` — that belongs in `detail` fields of findings or in `summary.assessment`
+
+## Self-Check Before Output
+
+Before writing your JSON output, verify:
+
+1. Is your array called `findings` (not `issues`, not `opportunities`)?
+2. Does every item in `findings` have all 7 fields: `file`, `line`, `severity`, `category`, `title`, `detail`, `action`?
+3. Are strengths in `findings` with `severity: "strength"` (not in a separate `strengths` array)?
+4. Are suggestions in `findings` with `severity: "suggestion"` (not in a separate `creative_suggestions` array)?
+5. Is `assessments` an object containing structured analysis data (not items that belong in findings)?
+6. Is `user_journeys` an array of objects (not an object keyed by persona)?
+7. Do `top_insights` items use `title`/`detail`/`action` (not `insight`/`suggestion`/`why_it_matters`)?
diff --git a/src/skills/bmad-workflow-builder/report-quality-scan-creator.md b/src/skills/bmad-workflow-builder/report-quality-scan-creator.md
new file mode 100644
index 0000000..56d53a1
--- /dev/null
+++ b/src/skills/bmad-workflow-builder/report-quality-scan-creator.md
@@ -0,0 +1,134 @@
+# Quality Scan Report Creator
+
+You are a master quality engineer tech writer agent QualityReportBot-9001. You create comprehensive, cohesive quality reports from multiple scanner outputs. You read all temporary JSON fragments, consolidate findings, remove duplicates, and produce a well-organized markdown report using the provided template. You are quality obsessed — nothing gets dropped. You will never attempt to fix anything — you are a writer, not a fixer.
+
+## Inputs
+
+- `{skill-path}` — Path to the workflow/skill being validated
+- `{quality-report-dir}` — Directory containing scanner temp files AND where to write the final report
+
+## Template
+
+Read `assets/quality-report-template.md` for the report structure. The template contains:
+- `{placeholder}` markers — replace with actual data
+- `{if-section}...{/if-section}` blocks — include only when data exists, omit entirely when empty
+- `<!-- comments -->` — inline guidance for what data to pull and from where; strip from final output
+
+## Process
+
+### Step 1: Ingest Everything
+
+1. Read `assets/quality-report-template.md`
+2. List ALL files in `{quality-report-dir}` — both `*-temp.json` (scanner findings) and `*-prepass.json` (structural metrics)
+3. Read EVERY JSON file
+
+### Step 2: Extract All Data Types
+
+All scanners now use the universal schema defined in `references/universal-scan-schema.md`. Scanner-specific data lives in `assessments{}`, not as top-level keys.
+
+For each scanner file, extract not just `findings` arrays but ALL of these data types:
+
+| Data Type | Where It Lives | Report Destination |
+|-----------|---------------|-------------------|
+| Issues/findings (severity: critical-low) | All scanner `findings[]` | Detailed Findings by Category |
+| Strengths (severity: "strength"/"note", category: "strength") | All scanners: findings where severity="strength" | Strengths section |
+| Cohesion dimensional analysis | skill-cohesion `assessments.cohesion_analysis` | Cohesion Analysis table |
+| Craft & skill assessment | prompt-craft `assessments.skillmd_assessment`, `assessments.prompt_health`, `summary.assessment` | Prompt Craft section header + Executive Summary |
+| User journeys | enhancement-opportunities `assessments.user_journeys[]` | User Journeys section |
+| Autonomous assessment | enhancement-opportunities `assessments.autonomous_assessment` | Autonomous Readiness section |
+| Skill understanding | enhancement-opportunities `assessments.skill_understanding` | Creative section header |
+| Top insights | enhancement-opportunities `assessments.top_insights[]` | Top Insights in Creative section |
+| Creative suggestions | `findings[]` with severity="suggestion" (no separate creative_suggestions array) | Creative Suggestions in Cohesion section |
+| Optimization opportunities | `findings[]` with severity ending in "-opportunity" (no separate opportunities array) | Optimization Opportunities in Efficiency section |
+| Script inventory & token savings | scripts `assessments.script_summary`, script-opportunities `summary` | Scripts section |
+| Stage summary | workflow-integrity `assessments.stage_summary` | Structural section header |
+| Prepass metrics | `*-prepass.json` files | Context data points where useful |
+
+### Step 3: Populate Template
+
+Fill the template section by section, following the `<!-- comment -->` guidance in each. Key rules:
+
+- **Conditional sections:** Only include `{if-...}` blocks when the data exists. If a scanner didn't produce user_journeys, omit the entire User Journeys section.
+- **Empty severity levels:** Within a category, omit severity sub-headers that have zero findings (don't write "**Critical Issues** — None").
+- **Strip comments:** Remove all `<!-- ... -->` blocks from final output.
+
+### Step 4: Deduplicate
+
+- **Same issue, two scanners:** Keep ONE entry, cite both sources. Use the more detailed description.
+- **Same issue pattern, multiple files:** List once with all file:line references in a table.
+- **Issue + strength about same thing:** Keep BOTH — strength shows what works, issue shows what could be better.
+- **Overlapping creative suggestions:** Merge into the richer description.
+- **Routing:** "note"/"strength" severity → Strengths section. "suggestion" severity → Creative subsection. Do not mix these into issue lists.
+
+### Step 5: Verification Pass
+
+**This step is mandatory.** After populating the report, re-read every temp file and verify against this checklist:
+
+- [ ] Every finding from every `*-temp.json` findings[] array
+- [ ] All findings with severity="strength" from any scanner
+- [ ] All positive notes from prompt-craft (severity="note")
+- [ ] Cohesion analysis dimensional scores table (if present)
+- [ ] Craft assessment and skill assessment summaries
+- [ ] ALL user journeys with ALL friction_points and bright_spots per archetype
+- [ ] The autonomous_assessment block (all fields)
+- [ ] All findings with severity="suggestion" from cohesion scanners
+- [ ] All findings with severity ending in "-opportunity" from execution-efficiency
+- [ ] assessments.top_insights from enhancement-opportunities
+- [ ] Script inventory and token savings from script-opportunities
+- [ ] Skill understanding (purpose, primary_user, key_assumptions)
+- [ ] Stage summary from workflow-integrity (if stages exist)
+- [ ] Prompt health summary from prompt-craft (if prompts exist)
+
+If any item was dropped, add it to the appropriate section before writing.
+
+### Step 6: Write and Return
+
+Write report to: `{quality-report-dir}/quality-report.md`
+
+Return JSON:
+
+```json
+{
+  "report_file": "{full-path-to-report}",
+  "summary": {
+    "total_issues": 0,
+    "critical": 0,
+    "high": 0,
+    "medium": 0,
+    "low": 0,
+    "strengths_count": 0,
+    "enhancements_count": 0,
+    "user_journeys_count": 0,
+    "overall_quality": "Excellent|Good|Fair|Poor",
+    "overall_cohesion": "cohesive|mostly-cohesive|fragmented|confused",
+    "craft_assessment": "brief summary from prompt-craft",
+    "truly_broken_found": true,
+    "truly_broken_count": 0
+  },
+  "by_category": {
+    "structural": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "prompt_craft": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "cohesion": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "efficiency": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "quality": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "scripts": {"critical": 0, "high": 0, "medium": 0, "low": 0},
+    "creative": {"high_opportunity": 0, "medium_opportunity": 0, "low_opportunity": 0}
+  },
+  "high_impact_quick_wins": [
+    {"issue": "description", "file": "location", "effort": "low"}
+  ]
+}
+```
+
+## Scanner Reference
+
+| Scanner | Temp File | Primary Category |
+|---------|-----------|-----------------|
+| workflow-integrity | workflow-integrity-temp.json | Structural |
+| prompt-craft | prompt-craft-temp.json | Prompt Craft |
+| skill-cohesion | skill-cohesion-temp.json | Cohesion |
+| execution-efficiency | execution-efficiency-temp.json | Efficiency |
+| path-standards | path-standards-temp.json | Quality |
+| scripts | scripts-temp.json | Scripts |
+| script-opportunities | script-opportunities-temp.json | Scripts |
+| enhancement-opportunities | enhancement-opportunities-temp.json | Creative |
diff --git a/src/skills/bmad-workflow-builder/scripts/bmad-manifest-schema.json b/src/skills/bmad-workflow-builder/scripts/bmad-manifest-schema.json
index 90e66db..ea674b5 100644
--- a/src/skills/bmad-workflow-builder/scripts/bmad-manifest-schema.json
+++ b/src/skills/bmad-workflow-builder/scripts/bmad-manifest-schema.json
@@ -61,7 +61,7 @@
           },
 
           "prompt": {
-            "description": "Relative path to the prompt file for internal capabilities (e.g., prompts/build-process.md). Omit if handled by SKILL.md directly or if this is an external skill call.",
+            "description": "Relative path to the prompt file for internal capabilities (e.g., build-process.md). Omit if handled by SKILL.md directly or if this is an external skill call.",
             "type": "string"
           },
           "skill-name": {
diff --git a/src/skills/bmad-workflow-builder/scripts/generate-html-report.py b/src/skills/bmad-workflow-builder/scripts/generate-html-report.py
new file mode 100644
index 0000000..a8614db
--- /dev/null
+++ b/src/skills/bmad-workflow-builder/scripts/generate-html-report.py
@@ -0,0 +1,1002 @@
+# /// script
+# requires-python = ">=3.9"
+# ///
+
+#!/usr/bin/env python3
+"""
+Generate an interactive HTML quality report from scanner temp JSON files.
+
+Reads all *-temp.json and *-prepass.json files from a quality scan output
+directory, normalizes findings into a unified data model, and produces a
+self-contained HTML report with:
+  - Collapsible sections with severity filter badges
+  - Per-item copy-prompt buttons
+  - Multi-select batch prompt generator
+  - Executive summary with severity counts
+
+Usage:
+  python3 generate-html-report.py {quality-report-dir} [--open] [--skill-path /path/to/skill]
+
+The --skill-path is embedded in the prompt context so generated prompts
+reference the correct location. If omitted, it is read from the first
+temp JSON that contains a skill_path field.
+"""
+
+from __future__ import annotations
+
+import argparse
+import json
+import platform
+import subprocess
+import sys
+from datetime import datetime, timezone
+from pathlib import Path
+
+
+# =============================================================================
+# Normalization — diverse scanner JSONs → unified item model
+# =============================================================================
+
+SEVERITY_RANK = {
+    'critical': 0, 'high': 1, 'medium': 2, 'low': 3,
+    'high-opportunity': 1, 'medium-opportunity': 2, 'low-opportunity': 3,
+    'note': 4, 'strength': 5, 'suggestion': 4, 'info': 5,
+}
+
+# Map scanner names to report sections
+SCANNER_SECTIONS = {
+    'workflow-integrity': 'structural',
+    'structure': 'structure-capabilities',
+    'prompt-craft': 'prompt-craft',
+    'execution-efficiency': 'efficiency',
+    'skill-cohesion': 'cohesion',
+    'agent-cohesion': 'cohesion',
+    'path-standards': 'quality',
+    'scripts': 'scripts',
+    'script-opportunities': 'script-opportunities',
+    'enhancement-opportunities': 'creative',
+}
+
+SECTION_LABELS = {
+    'structural': 'Structural',
+    'structure-capabilities': 'Structure & Capabilities',
+    'prompt-craft': 'Prompt Craft',
+    'efficiency': 'Efficiency',
+    'cohesion': 'Cohesion',
+    'quality': 'Path & Script Standards',
+    'scripts': 'Scripts',
+    'script-opportunities': 'Script Opportunities',
+    'creative': 'Creative & Enhancements',
+}
+
+
+def _coalesce(*values) -> str:
+    """Return the first truthy string value, or empty string."""
+    for v in values:
+        if v and isinstance(v, str) and v.strip() and v.strip() not in ('N/A', 'n/a', 'None'):
+            return v.strip()
+    return ''
+
+
+def _norm_severity(sev: str) -> str:
+    """Normalize severity to lowercase, handle variants."""
+    if not sev:
+        return 'low'
+    s = sev.strip().lower()
+    # Map common variants
+    return {
+        'high-opportunity': 'high-opportunity',
+        'medium-opportunity': 'medium-opportunity',
+        'low-opportunity': 'low-opportunity',
+    }.get(s, s)
+
+
+def normalize_finding(f: dict, scanner: str, idx: int) -> dict:
+    """
+    Normalize a single finding/issue dict into the unified item model.
+
+    Handles all known field name variants across scanners:
+      Title:  issue | title | description (fallback)
+      Desc:   description | rationale | observation | insight | scenario |
+              current_behavior | current_pattern | context | nuance
+      Action: fix | recommendation | suggestion | suggested_approach |
+              efficient_alternative | script_alternative
+      File:   file | location | current_location
+      Line:   line | lines
+      Cat:    category | dimension
+      Impact: user_impact | impact | estimated_savings | estimated_token_savings
+    """
+    sev = _norm_severity(f.get('severity', 'low'))
+    section = SCANNER_SECTIONS.get(scanner, 'other')
+
+    # Determine item type from severity
+    if sev in ('strength', 'note') or f.get('category') == 'strength':
+        item_type = 'strength'
+        action_type = 'none'
+        selectable = False
+    elif sev.endswith('-opportunity'):
+        item_type = 'enhancement'
+        action_type = 'enhance'
+        selectable = True
+    elif f.get('category') == 'suggestion' or sev == 'suggestion':
+        item_type = 'suggestion'
+        action_type = 'refactor'
+        selectable = True
+    else:
+        item_type = 'issue'
+        action_type = 'fix'
+        selectable = True
+
+    # --- Title: prefer 'title', fall back to old field names ---
+    title = _coalesce(
+        f.get('title'),
+        f.get('issue'),
+        _truncate(f.get('scenario', ''), 150),
+        _truncate(f.get('current_behavior', ''), 150),
+        _truncate(f.get('description', ''), 150),
+        f.get('observation', ''),
+    )
+    if not title:
+        title = f.get('id', 'Finding')
+
+    # --- Detail/description: prefer 'detail', fall back to old field names ---
+    description = _coalesce(f.get('detail'))
+    if not description:
+        # Backward compat: coalesce old field names
+        desc_candidates = []
+        for key in ('description', 'rationale', 'observation', 'insight', 'scenario',
+                    'current_behavior', 'current_pattern', 'context', 'nuance',
+                    'assessment'):
+            v = f.get(key)
+            if v and isinstance(v, str) and v.strip() and v != title:
+                desc_candidates.append(v.strip())
+        description = ' '.join(desc_candidates) if desc_candidates else ''
+
+    # --- Action: prefer 'action', fall back to old field names ---
+    action = _coalesce(
+        f.get('action'),
+        f.get('fix'),
+        f.get('recommendation'),
+        f.get('suggestion'),
+        f.get('suggested_approach'),
+        f.get('efficient_alternative'),
+        f.get('script_alternative'),
+    )
+
+    # --- File reference ---
+    file_ref = _coalesce(
+        f.get('file'),
+        f.get('location'),
+        f.get('current_location'),
+    )
+
+    # --- Line reference ---
+    line = f.get('line')
+    if line is None:
+        lines_str = f.get('lines')
+        if lines_str:
+            line = str(lines_str)
+
+    # --- Category ---
+    category = _coalesce(
+        f.get('category'),
+        f.get('dimension'),
+    )
+
+    # --- Impact (backward compat only - new schema folds into detail) ---
+    impact = _coalesce(
+        f.get('user_impact'),
+        f.get('impact'),
+        f.get('estimated_savings'),
+        str(f.get('estimated_token_savings', '')) if f.get('estimated_token_savings') else '',
+    )
+
+    # --- Extra fields for specific scanners ---
+    extra = {}
+    if scanner == 'script-opportunities':
+        action_type = 'create-script'
+        for k in ('determinism_confidence', 'implementation_complexity',
+                   'language', 'could_be_prepass', 'reusable_across_skills'):
+            if k in f:
+                extra[k] = f[k]
+
+    # Use scanner-provided id if available
+    item_id = f.get('id', f'{scanner}-{idx:03d}')
+
+    return {
+        'id': item_id,
+        'scanner': scanner,
+        'section': section,
+        'type': item_type,
+        'severity': sev,
+        'rank': SEVERITY_RANK.get(sev, 3),
+        'category': category,
+        'file': file_ref,
+        'line': line,
+        'title': title,
+        'description': description,
+        'action': action,
+        'impact': impact,
+        'extra': extra,
+        'selectable': selectable,
+        'action_type': action_type,
+    }
+
+
+def _truncate(text: str, max_len: int) -> str:
+    """Truncate text to max_len, breaking at sentence boundary if possible."""
+    if not text:
+        return ''
+    text = text.strip()
+    if len(text) <= max_len:
+        return text
+    # Try to break at sentence boundary
+    for end in ('. ', '.\n', ' — ', '; '):
+        pos = text.find(end)
+        if 0 < pos < max_len:
+            return text[:pos + 1].strip()
+    return text[:max_len].strip() + '...'
+
+
+def normalize_scanner(data: dict) -> tuple[list[dict], dict]:
+    """
+    Normalize a full scanner JSON into (items, meta).
+    Returns list of normalized items + dict of meta/assessment data.
+    Handles all known scanner output variants.
+    """
+    scanner = data.get('scanner', 'unknown')
+    items = []
+    meta = {}
+
+    # New schema: findings[]. Backward compat: issues[] or findings[]
+    findings = data.get('findings') or data.get('issues') or []
+    for idx, f in enumerate(findings):
+        items.append(normalize_finding(f, scanner, idx))
+
+    # Backward compat: opportunities[] (execution-efficiency had separate array)
+    for idx, opp in enumerate(data.get('opportunities', []), start=len(findings)):
+        opp_item = normalize_finding(opp, scanner, idx)
+        opp_item['type'] = 'enhancement'
+        opp_item['action_type'] = 'enhance'
+        opp_item['selectable'] = True
+        items.append(opp_item)
+
+    # Backward compat: strengths[] (old cohesion scanners — plain strings)
+    for idx, s in enumerate(data.get('strengths', [])):
+        text = s if isinstance(s, str) else (s.get('title', '') if isinstance(s, dict) else str(s))
+        desc = '' if isinstance(s, str) else (s.get('description', s.get('detail', '')) if isinstance(s, dict) else '')
+        items.append({
+            'id': f'{scanner}-str-{idx:03d}',
+            'scanner': scanner,
+            'section': SCANNER_SECTIONS.get(scanner, 'cohesion'),
+            'type': 'strength',
+            'severity': 'strength',
+            'rank': 5,
+            'category': 'strength',
+            'file': '',
+            'line': None,
+            'title': text,
+            'description': desc,
+            'action': '',
+            'impact': '',
+            'extra': {},
+            'selectable': False,
+            'action_type': 'none',
+        })
+
+    # Backward compat: creative_suggestions[] (old cohesion scanners)
+    for idx, cs in enumerate(data.get('creative_suggestions', [])):
+        if isinstance(cs, str):
+            cs_title, cs_desc = cs, ''
+        else:
+            cs_title = _coalesce(cs.get('title'), cs.get('idea'), '')
+            cs_desc = _coalesce(cs.get('description'), cs.get('detail'), cs.get('rationale'), '')
+        items.append({
+            'id': cs.get('id', f'{scanner}-cs-{idx:03d}') if isinstance(cs, dict) else f'{scanner}-cs-{idx:03d}',
+            'scanner': scanner,
+            'section': SCANNER_SECTIONS.get(scanner, 'cohesion'),
+            'type': 'suggestion',
+            'severity': 'suggestion',
+            'rank': 4,
+            'category': cs.get('type', 'suggestion') if isinstance(cs, dict) else 'suggestion',
+            'file': '',
+            'line': None,
+            'title': cs_title,
+            'description': cs_desc,
+            'action': cs_title,
+            'impact': cs.get('estimated_impact', '') if isinstance(cs, dict) else '',
+            'extra': {},
+            'selectable': True,
+            'action_type': 'refactor',
+        })
+
+    # New schema: assessments{} contains all structured analysis
+    # Backward compat: also collect from top-level keys
+    if 'assessments' in data:
+        meta.update(data['assessments'])
+
+    # Backward compat: collect meta from top-level keys
+    skip_keys = {'scanner', 'script', 'version', 'skill_path', 'agent_path',
+                 'timestamp', 'scan_date', 'status', 'issues', 'findings',
+                 'strengths', 'creative_suggestions', 'opportunities', 'assessments'}
+    for key, val in data.items():
+        if key not in skip_keys and key not in meta:
+            meta[key] = val
+
+    return items, meta
+
+
+def build_journeys(data: dict) -> list[dict]:
+    """
+    Extract user journey data from enhancement-opportunities scanner.
+    Handles two formats:
+      - Array of objects: [{archetype, journey_summary, friction_points, bright_spots}]
+      - Object keyed by persona: {first_timer: {entry_friction, mid_flow_resilience, exit_satisfaction}}
+    """
+    journeys_raw = data.get('user_journeys')
+    if not journeys_raw:
+        return []
+
+    # Format 1: already a list — normalize field names
+    if isinstance(journeys_raw, list):
+        normalized = []
+        for j in journeys_raw:
+            if isinstance(j, dict):
+                normalized.append({
+                    'archetype': j.get('archetype', 'unknown'),
+                    'journey_summary': j.get('summary', j.get('journey_summary', '')),
+                    'friction_points': j.get('friction_points', []),
+                    'bright_spots': j.get('bright_spots', []),
+                })
+            else:
+                normalized.append(j)
+        return normalized
+
+    # Format 2: object keyed by persona name
+    if isinstance(journeys_raw, dict):
+        result = []
+        for persona, details in journeys_raw.items():
+            if isinstance(details, dict):
+                # Convert the dict-based format to the expected format
+                journey = {
+                    'archetype': persona.replace('_', ' ').title(),
+                    'journey_summary': '',
+                    'friction_points': [],
+                    'bright_spots': [],
+                }
+                # Map known sub-keys to friction/bright spots
+                for key, val in details.items():
+                    if isinstance(val, str):
+                        # Heuristic: negative-sounding keys → friction, positive → bright
+                        if any(neg in key.lower() for neg in ('friction', 'issue', 'problem', 'gap', 'pain')):
+                            journey['friction_points'].append(val)
+                        elif any(pos in key.lower() for pos in ('bright', 'strength', 'satisfaction', 'delight')):
+                            journey['bright_spots'].append(val)
+                        else:
+                            # Neutral keys — include as summary parts
+                            if journey['journey_summary']:
+                                journey['journey_summary'] += f' | {key}: {val}'
+                            else:
+                                journey['journey_summary'] = f'{key}: {val}'
+                    elif isinstance(val, list):
+                        for item in val:
+                            if isinstance(item, str):
+                                journey['friction_points'].append(item)
+                # Build summary from all fields if not yet set
+                if not journey['journey_summary']:
+                    parts = []
+                    for k, v in details.items():
+                        if isinstance(v, str):
+                            parts.append(f'**{k.replace("_", " ").title()}:** {v}')
+                    journey['journey_summary'] = ' | '.join(parts) if parts else str(details)
+                result.append(journey)
+            elif isinstance(details, str):
+                result.append({
+                    'archetype': persona.replace('_', ' ').title(),
+                    'journey_summary': details,
+                    'friction_points': [],
+                    'bright_spots': [],
+                })
+        return result
+
+    return []
+
+
+# =============================================================================
+# Report Data Assembly
+# =============================================================================
+
+def load_report_data(report_dir: Path, skill_path: str | None) -> dict:
+    """Load all temp/prepass JSONs and assemble normalized report data."""
+    all_items = []
+    all_meta = {}
+    journeys = []
+    detected_skill_path = skill_path
+
+    # Read all JSON files
+    json_files = sorted(report_dir.glob('*.json'))
+    for jf in json_files:
+        try:
+            data = json.loads(jf.read_text(encoding='utf-8'))
+        except (json.JSONDecodeError, OSError):
+            continue
+
+        if not isinstance(data, dict):
+            continue
+
+        scanner = data.get('scanner', jf.stem.replace('-temp', '').replace('-prepass', ''))
+
+        # Detect skill path from scanner data
+        if not detected_skill_path:
+            detected_skill_path = data.get('skill_path') or data.get('agent_path')
+
+        # Only normalize temp files (not prepass)
+        if '-temp' in jf.name or jf.name in ('path-standards-temp.json', 'scripts-temp.json'):
+            items, meta = normalize_scanner(data)
+            all_items.extend(items)
+            all_meta[scanner] = meta
+
+            if scanner == 'enhancement-opportunities':
+                journeys = build_journeys(data)
+        elif '-prepass' in jf.name:
+            all_meta[f'prepass-{scanner}'] = data
+
+    # Sort items: severity rank first, then section
+    all_items.sort(key=lambda x: (x['rank'], x['section']))
+
+    # Build severity counts
+    counts = {'critical': 0, 'high': 0, 'medium': 0, 'low': 0}
+    for item in all_items:
+        if item['type'] == 'issue' and item['severity'] in counts:
+            counts[item['severity']] += 1
+
+    enhancement_count = sum(1 for i in all_items if i['type'] == 'enhancement')
+    strength_count = sum(1 for i in all_items if i['type'] == 'strength')
+    total_issues = sum(counts.values())
+
+    # Quality grade
+    if counts['critical'] > 0:
+        grade = 'Poor'
+    elif counts['high'] > 2:
+        grade = 'Fair'
+    elif counts['high'] > 0 or counts['medium'] > 5:
+        grade = 'Good'
+    else:
+        grade = 'Excellent'
+
+    # Extract assessments for display
+    assessments = {}
+    for scanner_key, meta in all_meta.items():
+        for akey in ('cohesion_analysis', 'autonomous_assessment', 'skill_understanding',
+                      'agent_identity', 'skill_identity', 'prompt_health',
+                      'skillmd_assessment', 'top_insights'):
+            if akey in meta:
+                assessments[akey] = meta[akey]
+        if 'summary' in meta:
+            s = meta['summary']
+            if 'craft_assessment' in s:
+                assessments['craft_assessment'] = s['craft_assessment']
+            if 'overall_cohesion' in s:
+                assessments['overall_cohesion'] = s['overall_cohesion']
+
+    # Skill name from path
+    sp = detected_skill_path or str(report_dir)
+    skill_name = Path(sp).name
+
+    return {
+        'meta': {
+            'skill_name': skill_name,
+            'skill_path': detected_skill_path or '',
+            'timestamp': datetime.now(timezone.utc).isoformat(),
+            'scanner_count': len([f for f in json_files if '-temp' in f.name]),
+            'report_dir': str(report_dir),
+        },
+        'executive_summary': {
+            'total_issues': total_issues,
+            'counts': counts,
+            'enhancement_count': enhancement_count,
+            'strength_count': strength_count,
+            'grade': grade,
+            'craft_assessment': assessments.get('craft_assessment', ''),
+            'overall_cohesion': assessments.get('overall_cohesion', ''),
+        },
+        'items': all_items,
+        'journeys': journeys,
+        'assessments': assessments,
+        'section_labels': SECTION_LABELS,
+    }
+
+
+# =============================================================================
+# HTML Generation
+# =============================================================================
+
+HTML_TEMPLATE = r"""<!DOCTYPE html>
+<html lang="en">
+<head>
+<meta charset="utf-8">
+<meta name="viewport" content="width=device-width, initial-scale=1">
+<title>Quality Report: SKILL_NAME_PLACEHOLDER</title>
+<style>
+:root {
+  --bg: #0d1117; --surface: #161b22; --surface2: #21262d; --border: #30363d;
+  --text: #e6edf3; --text-muted: #8b949e; --text-dim: #6e7681;
+  --critical: #f85149; --high: #f0883e; --medium: #d29922; --low: #58a6ff;
+  --strength: #3fb950; --suggestion: #a371f7; --info: #8b949e;
+  --accent: #58a6ff; --accent-hover: #79c0ff;
+  --font: -apple-system, BlinkMacSystemFont, "Segoe UI", Helvetica, Arial, sans-serif;
+  --mono: ui-monospace, SFMono-Regular, "SF Mono", Menlo, Consolas, monospace;
+}
+@media (prefers-color-scheme: light) {
+  :root {
+    --bg: #ffffff; --surface: #f6f8fa; --surface2: #eaeef2; --border: #d0d7de;
+    --text: #1f2328; --text-muted: #656d76; --text-dim: #8c959f;
+    --critical: #cf222e; --high: #bc4c00; --medium: #9a6700; --low: #0969da;
+    --strength: #1a7f37; --suggestion: #8250df; --info: #656d76;
+    --accent: #0969da; --accent-hover: #0550ae;
+  }
+}
+* { margin: 0; padding: 0; box-sizing: border-box; }
+body { font-family: var(--font); background: var(--bg); color: var(--text); line-height: 1.5; padding: 2rem; max-width: 960px; margin: 0 auto; padding-bottom: 6rem; }
+h1 { font-size: 1.5rem; margin-bottom: 0.25rem; }
+.subtitle { color: var(--text-muted); font-size: 0.85rem; margin-bottom: 1.5rem; }
+.badge { display: inline-flex; align-items: center; padding: 0.15rem 0.5rem; border-radius: 2rem; font-size: 0.75rem; font-weight: 600; cursor: pointer; border: 2px solid transparent; transition: all 0.15s; user-select: none; }
+.badge:hover { filter: brightness(1.2); }
+.badge.active { border-color: currentColor; }
+.badge-critical { background: color-mix(in srgb, var(--critical) 20%, transparent); color: var(--critical); }
+.badge-high { background: color-mix(in srgb, var(--high) 20%, transparent); color: var(--high); }
+.badge-medium { background: color-mix(in srgb, var(--medium) 20%, transparent); color: var(--medium); }
+.badge-low { background: color-mix(in srgb, var(--low) 20%, transparent); color: var(--low); }
+.badge-strength { background: color-mix(in srgb, var(--strength) 20%, transparent); color: var(--strength); }
+.badge-suggestion, .badge-note { background: color-mix(in srgb, var(--suggestion) 20%, transparent); color: var(--suggestion); }
+.badge-high-opportunity { background: color-mix(in srgb, var(--high) 20%, transparent); color: var(--high); }
+.badge-medium-opportunity { background: color-mix(in srgb, var(--medium) 20%, transparent); color: var(--medium); }
+.badge-low-opportunity { background: color-mix(in srgb, var(--low) 20%, transparent); color: var(--low); }
+.badge-info { background: color-mix(in srgb, var(--info) 20%, transparent); color: var(--info); }
+.grade { font-size: 2rem; font-weight: 700; }
+.grade-Excellent { color: var(--strength); }
+.grade-Good { color: var(--low); }
+.grade-Fair { color: var(--medium); }
+.grade-Poor { color: var(--critical); }
+.summary-grid { display: grid; grid-template-columns: auto 1fr; gap: 0.75rem 2rem; margin: 1rem 0; align-items: baseline; }
+.summary-grid dt { color: var(--text-muted); font-size: 0.85rem; }
+.summary-grid dd { font-size: 0.95rem; }
+.filters { display: flex; gap: 0.5rem; flex-wrap: wrap; margin: 1rem 0; }
+.section { border: 1px solid var(--border); border-radius: 0.5rem; margin: 0.75rem 0; overflow: hidden; }
+.section-header { display: flex; align-items: center; gap: 0.75rem; padding: 0.75rem 1rem; background: var(--surface); cursor: pointer; user-select: none; }
+.section-header:hover { background: var(--surface2); }
+.section-header .arrow { font-size: 0.7rem; transition: transform 0.15s; color: var(--text-muted); width: 1rem; }
+.section-header.open .arrow { transform: rotate(90deg); }
+.section-header .label { font-weight: 600; flex: 1; }
+.section-header .count { font-size: 0.8rem; color: var(--text-muted); }
+.section-body { display: none; }
+.section-body.open { display: block; }
+.item { display: flex; gap: 0.75rem; padding: 0.75rem 1rem; border-top: 1px solid var(--border); align-items: flex-start; }
+.item:hover { background: var(--surface); }
+.item-check { margin-top: 0.2rem; accent-color: var(--accent); flex-shrink: 0; }
+.item-body { flex: 1; min-width: 0; }
+.item-title { font-weight: 600; font-size: 0.9rem; }
+.item-file { font-family: var(--mono); font-size: 0.75rem; color: var(--text-muted); }
+.item-desc { font-size: 0.85rem; color: var(--text-muted); margin-top: 0.25rem; }
+.item-action { font-size: 0.85rem; margin-top: 0.25rem; }
+.item-action strong { color: var(--strength); }
+.item-impact { font-size: 0.8rem; color: var(--text-dim); margin-top: 0.2rem; font-style: italic; }
+.item-actions { flex-shrink: 0; display: flex; gap: 0.25rem; }
+.copy-btn { background: none; border: 1px solid var(--border); border-radius: 0.25rem; padding: 0.2rem 0.4rem; cursor: pointer; color: var(--text-muted); font-size: 0.75rem; transition: all 0.15s; }
+.copy-btn:hover { border-color: var(--accent); color: var(--accent); }
+.copy-btn.copied { border-color: var(--strength); color: var(--strength); }
+.journey { padding: 0.75rem 1rem; border-top: 1px solid var(--border); }
+.journey h4 { font-size: 0.9rem; text-transform: capitalize; }
+.journey p { font-size: 0.85rem; color: var(--text-muted); margin: 0.25rem 0; }
+.journey ul { font-size: 0.85rem; padding-left: 1.25rem; margin: 0.25rem 0; }
+.journey .friction { color: var(--high); }
+.journey .bright { color: var(--strength); }
+.assessment { padding: 0.75rem 1rem; border-top: 1px solid var(--border); }
+.assessment table { width: 100%; border-collapse: collapse; font-size: 0.85rem; margin-top: 0.5rem; }
+.assessment th, .assessment td { text-align: left; padding: 0.3rem 0.5rem; border-bottom: 1px solid var(--border); }
+.assessment th { color: var(--text-muted); font-weight: 600; }
+.sticky-footer { position: fixed; bottom: 0; left: 0; right: 0; background: var(--surface); border-top: 1px solid var(--border); padding: 0.75rem 2rem; display: flex; align-items: center; justify-content: center; gap: 1rem; z-index: 100; transition: transform 0.2s; }
+.sticky-footer.hidden { transform: translateY(100%); }
+.gen-btn { background: var(--accent); color: #fff; border: none; padding: 0.5rem 1.25rem; border-radius: 0.375rem; cursor: pointer; font-weight: 600; font-size: 0.9rem; }
+.gen-btn:hover { background: var(--accent-hover); }
+.sel-count { font-size: 0.9rem; color: var(--text-muted); }
+.modal-overlay { display: none; position: fixed; inset: 0; background: rgba(0,0,0,0.6); z-index: 200; align-items: center; justify-content: center; }
+.modal-overlay.visible { display: flex; }
+.modal { background: var(--surface); border: 1px solid var(--border); border-radius: 0.5rem; padding: 1.5rem; width: 90%; max-width: 700px; max-height: 80vh; overflow-y: auto; }
+.modal h3 { margin-bottom: 0.75rem; }
+.modal pre { background: var(--bg); border: 1px solid var(--border); border-radius: 0.375rem; padding: 1rem; font-family: var(--mono); font-size: 0.8rem; white-space: pre-wrap; word-wrap: break-word; max-height: 50vh; overflow-y: auto; }
+.modal-actions { display: flex; gap: 0.75rem; margin-top: 1rem; justify-content: flex-end; }
+.modal-actions button { padding: 0.4rem 1rem; border-radius: 0.375rem; cursor: pointer; font-size: 0.85rem; }
+.modal-close { background: var(--surface2); border: 1px solid var(--border); color: var(--text); }
+.modal-copy { background: var(--accent); border: none; color: #fff; font-weight: 600; }
+.empty-msg { color: var(--text-dim); font-size: 0.85rem; padding: 1rem; font-style: italic; }
+</style>
+</head>
+<body>
+
+<h1>Quality Report: <span id="skill-name"></span></h1>
+<div class="subtitle" id="subtitle"></div>
+
+<div id="exec-summary"></div>
+
+<div class="filters" id="filters"></div>
+
+<div id="sections"></div>
+
+<div class="sticky-footer hidden" id="footer">
+  <span class="sel-count"><span id="sel-count">0</span> selected</span>
+  <button class="gen-btn" onclick="showBatchPrompt()">Generate Prompt</button>
+</div>
+
+<div class="modal-overlay" id="modal" onclick="if(event.target===this)closeModal()">
+  <div class="modal">
+    <h3 id="modal-title">Generated Prompt</h3>
+    <pre id="modal-content"></pre>
+    <div class="modal-actions">
+      <button class="modal-close" onclick="closeModal()">Close</button>
+      <button class="modal-copy" onclick="copyModal()">Copy to Clipboard</button>
+    </div>
+  </div>
+</div>
+
+<script>
+const DATA = JSON.parse(document.getElementById('report-data').textContent);
+const selected = new Set();
+
+function init() {
+  const m = DATA.meta;
+  const es = DATA.executive_summary;
+  document.getElementById('skill-name').textContent = m.skill_name;
+  document.getElementById('subtitle').textContent = `${m.skill_path} \u2022 ${m.timestamp.split('T')[0]} \u2022 ${m.scanner_count} scanners`;
+
+  // Executive summary
+  let html = `<div class="grade grade-${es.grade}">${es.grade}</div>`;
+  html += `<dl class="summary-grid">`;
+  html += `<dt>Issues</dt><dd>${es.total_issues} total \u2014 ${es.counts.critical} critical, ${es.counts.high} high, ${es.counts.medium} medium, ${es.counts.low} low</dd>`;
+  if (es.enhancement_count) html += `<dt>Enhancements</dt><dd>${es.enhancement_count} opportunities identified</dd>`;
+  if (es.strength_count) html += `<dt>Strengths</dt><dd>${es.strength_count} noted</dd>`;
+  if (es.craft_assessment) html += `<dt>Craft</dt><dd>${esc(es.craft_assessment)}</dd>`;
+  if (es.overall_cohesion) html += `<dt>Cohesion</dt><dd>${esc(es.overall_cohesion)}</dd>`;
+  html += `</dl>`;
+  document.getElementById('exec-summary').innerHTML = html;
+
+  // Severity filters
+  renderFilters();
+
+  // Sections
+  renderSections();
+}
+
+// --- Severity filters ---
+const activeFilters = new Set(['critical','high','medium','low','high-opportunity','medium-opportunity','low-opportunity','strength','suggestion','note','info']);
+
+function renderFilters() {
+  const counts = {};
+  DATA.items.forEach(i => { counts[i.severity] = (counts[i.severity]||0) + 1; });
+  const order = ['critical','high','medium','low','high-opportunity','medium-opportunity','low-opportunity','strength','suggestion','note'];
+  let html = '';
+  order.forEach(s => {
+    if (!counts[s]) return;
+    const active = activeFilters.has(s) ? 'active' : '';
+    html += `<span class="badge badge-${s} ${active}" data-sev="${s}" onclick="toggleFilter('${s}')">${s.replace('-',' ')} ${counts[s]}</span>`;
+  });
+  document.getElementById('filters').innerHTML = html;
+}
+
+function toggleFilter(sev) {
+  if (activeFilters.has(sev)) activeFilters.delete(sev); else activeFilters.add(sev);
+  renderFilters();
+  renderSections();
+}
+
+// --- Sections ---
+function renderSections() {
+  const groups = {};
+  const sectionOrder = ['structural','structure-capabilities','prompt-craft','cohesion','efficiency','quality','scripts','script-opportunities','creative'];
+
+  DATA.items.forEach(i => {
+    if (!activeFilters.has(i.severity)) return;
+    const s = i.section;
+    if (!groups[s]) groups[s] = [];
+    groups[s].push(i);
+  });
+
+  // Truly broken (always first, always open)
+  const broken = DATA.items.filter(i => i.type === 'issue' && (i.severity === 'critical' || i.severity === 'high'));
+  const brokenIds = new Set(broken.map(i => i.id));
+  // Strengths
+  const strengths = DATA.items.filter(i => i.type === 'strength' && activeFilters.has(i.severity));
+
+  let html = '';
+
+  if (broken.length) {
+    html += renderSection('truly-broken', `Truly Broken / Missing (${broken.length})`, broken, true);
+  }
+  if (strengths.length) {
+    html += renderSection('strengths', `Strengths (${strengths.length})`, strengths, false);
+  }
+
+  sectionOrder.forEach(sec => {
+    // Exclude strengths (shown above) and items already in Truly Broken
+    const items = (groups[sec] || []).filter(i => i.type !== 'strength' && !brokenIds.has(i.id));
+    if (!items.length) return;
+    const label = DATA.section_labels[sec] || sec;
+    html += renderSection(sec, `${label} (${items.length})`, items, false);
+  });
+
+  // User journeys
+  if (DATA.journeys.length) {
+    html += renderJourneysSection();
+  }
+
+  // Assessments
+  if (Object.keys(DATA.assessments).length) {
+    html += renderAssessmentsSection();
+  }
+
+  document.getElementById('sections').innerHTML = html;
+}
+
+function renderSection(id, label, items, startOpen) {
+  const openCls = startOpen ? 'open' : '';
+  let html = `<div class="section"><div class="section-header ${openCls}" onclick="toggleSection(this)">`;
+  html += `<span class="arrow">\u25B6</span><span class="label">${label}</span>`;
+  html += `</div><div class="section-body ${openCls}">`;
+  items.forEach(i => { html += renderItem(i); });
+  html += `</div></div>`;
+  return html;
+}
+
+function renderItem(item) {
+  const isStrength = item.type === 'strength';
+  const chk = item.selectable ? `<input type="checkbox" class="item-check" data-id="${item.id}" ${selected.has(item.id)?'checked':''} onchange="toggleSelect('${item.id}', this.checked)">` : '';
+  const sev = `<span class="badge badge-${item.severity}">${item.severity.replace('-',' ')}</span>`;
+  const file = item.file ? `<span class="item-file">${esc(item.file)}${item.line ? ':'+item.line : ''}</span>` : '';
+  const desc = item.description && item.description !== item.title ? `<div class="item-desc">${esc(item.description)}</div>` : '';
+  // Suppress action/impact for strengths — "N/A" is noise
+  const actionText = item.action && !isStrength && item.action !== 'N/A' ? item.action : '';
+  const action = actionText ? `<div class="item-action"><strong>${item.action_type === 'fix' ? 'Fix' : item.action_type === 'create-script' ? 'Script' : 'Suggestion'}:</strong> ${esc(actionText)}</div>` : '';
+  const impactText = item.impact && !isStrength && item.impact !== 'N/A' ? item.impact : '';
+  const impact = impactText ? `<div class="item-impact">Impact: ${esc(impactText)}</div>` : '';
+  const copyBtn = item.selectable ? `<button class="copy-btn" onclick="copySinglePrompt('${item.id}')" title="Copy prompt for this item">\u2398</button>` : '';
+
+  return `<div class="item">${chk}<div class="item-body">${sev} ${file}<div class="item-title">${esc(item.title)}</div>${desc}${action}${impact}</div><div class="item-actions">${copyBtn}</div></div>`;
+}
+
+function renderJourneysSection() {
+  let html = `<div class="section"><div class="section-header" onclick="toggleSection(this)">`;
+  html += `<span class="arrow">\u25B6</span><span class="label">User Journeys (${DATA.journeys.length})</span>`;
+  html += `</div><div class="section-body">`;
+  DATA.journeys.forEach(j => {
+    html += `<div class="journey"><h4>${esc(j.archetype)}</h4>`;
+    html += `<p>${esc(j.journey_summary)}</p>`;
+    if (j.friction_points && j.friction_points.length) {
+      html += `<ul class="friction">`;
+      j.friction_points.forEach(fp => { html += `<li>${esc(fp)}</li>`; });
+      html += `</ul>`;
+    }
+    if (j.bright_spots && j.bright_spots.length) {
+      html += `<ul class="bright">`;
+      j.bright_spots.forEach(bs => { html += `<li>${esc(bs)}</li>`; });
+      html += `</ul>`;
+    }
+    html += `</div>`;
+  });
+  html += `</div></div>`;
+  return html;
+}
+
+function renderAssessmentsSection() {
+  let html = `<div class="section"><div class="section-header" onclick="toggleSection(this)">`;
+  html += `<span class="arrow">\u25B6</span><span class="label">Assessments & Analysis</span>`;
+  html += `</div><div class="section-body">`;
+
+  const ca = DATA.assessments.cohesion_analysis;
+  if (ca) {
+    html += `<div class="assessment"><h4>Cohesion Analysis</h4><table><tr><th>Dimension</th><th>Score</th><th>Notes</th></tr>`;
+    Object.entries(ca).forEach(([dim, val]) => {
+      if (typeof val === 'object' && val.score) {
+        html += `<tr><td>${esc(dim.replace(/_/g, ' '))}</td><td>${esc(val.score)}</td><td>${esc(val.notes || '')}</td></tr>`;
+      }
+    });
+    html += `</table></div>`;
+  }
+
+  const aa = DATA.assessments.autonomous_assessment;
+  if (aa) {
+    html += `<div class="assessment"><h4>Autonomous Readiness</h4><table>`;
+    html += `<tr><td>Overall Potential</td><td>${esc(aa.potential||aa.overall_potential||'')}</td></tr>`;
+    html += `<tr><td>HITL Points</td><td>${aa.hitl_points||aa.hitl_interaction_points||0}</td></tr>`;
+    html += `<tr><td>Auto-Resolvable</td><td>${aa.auto_resolvable||0}</td></tr>`;
+    html += `<tr><td>Needs Input</td><td>${aa.needs_input||0}</td></tr>`;
+    if (aa.notes) html += `<tr><td>Notes</td><td>${esc(aa.notes)}</td></tr>`;
+    html += `</table></div>`;
+  }
+
+  const ti = DATA.assessments.top_insights;
+  if (ti && ti.length) {
+    html += `<div class="assessment"><h4>Top Insights</h4>`;
+    ti.forEach(t => {
+      const tiTitle = t.title || t.insight || '';
+      const tiDetail = t.detail || t.why_it_matters || '';
+      const tiAction = t.action || t.suggestion || '';
+      html += `<div style="margin:0.5rem 0"><strong>${esc(tiTitle)}</strong>`;
+      if (tiDetail) html += `<br><em>Context:</em> ${esc(tiDetail)}`;
+      if (tiAction) html += `<br><em>Suggestion:</em> ${esc(tiAction)}`;
+      html += `</div>`;
+    });
+    html += `</div>`;
+  }
+
+  html += `</div></div>`;
+  return html;
+}
+
+// --- Interactions ---
+function toggleSection(el) {
+  el.classList.toggle('open');
+  el.nextElementSibling.classList.toggle('open');
+}
+
+function toggleSelect(id, checked) {
+  if (checked) selected.add(id); else selected.delete(id);
+  document.getElementById('sel-count').textContent = selected.size;
+  document.getElementById('footer').classList.toggle('hidden', selected.size === 0);
+}
+
+// --- Prompt Generation ---
+function itemById(id) { return DATA.items.find(i => i.id === id); }
+
+function buildPromptForItem(item) {
+  let p = '';
+  const sev = item.severity.replace('-', ' ').toUpperCase();
+  const loc = item.file ? `${item.file}${item.line ? ':'+item.line : ''}` : '';
+  p += `**[${sev}] ${item.title}**\n`;
+  if (loc) p += `- File: ${loc}\n`;
+  if (item.description && item.description !== item.title) p += `- Context: ${item.description}\n`;
+  if (item.action) {
+    const label = item.action_type === 'fix' ? 'Fix' : item.action_type === 'create-script' ? 'Create script' : 'Suggestion';
+    p += `- ${label}: ${item.action}\n`;
+  }
+  if (item.impact) p += `- Impact: ${item.impact}\n`;
+  return p;
+}
+
+function buildPrompt(ids) {
+  const items = ids.map(itemById).filter(Boolean);
+  const fixes = items.filter(i => i.action_type === 'fix');
+  const scripts = items.filter(i => i.action_type === 'create-script');
+  const enhancements = items.filter(i => i.action_type === 'enhance' || i.action_type === 'refactor');
+
+  let prompt = `## Task: Quality Improvements for ${DATA.meta.skill_name}\nSkill path: ${DATA.meta.skill_path}\n\n`;
+
+  if (fixes.length) {
+    prompt += `### Fix These Issues (${fixes.length})\n\n`;
+    fixes.forEach((item, i) => { prompt += `${i+1}. ${buildPromptForItem(item)}\n`; });
+  }
+  if (scripts.length) {
+    prompt += `### Create These Scripts (${scripts.length})\n\n`;
+    scripts.forEach((item, i) => { prompt += `${i+1}. ${buildPromptForItem(item)}\n`; });
+  }
+  if (enhancements.length) {
+    prompt += `### Implement These Enhancements (${enhancements.length})\n\n`;
+    enhancements.forEach((item, i) => { prompt += `${i+1}. ${buildPromptForItem(item)}\n`; });
+  }
+  return prompt.trim();
+}
+
+function copySinglePrompt(id) {
+  const item = itemById(id);
+  if (!item) return;
+  let prompt = `## Task: Quality Fix for ${DATA.meta.skill_name}\nSkill path: ${DATA.meta.skill_path}\n\n`;
+  prompt += buildPromptForItem(item);
+  navigator.clipboard.writeText(prompt).then(() => {
+    const btn = document.querySelector(`[onclick="copySinglePrompt('${id}')"]`);
+    if (btn) { btn.classList.add('copied'); btn.textContent = '\u2713'; setTimeout(() => { btn.classList.remove('copied'); btn.textContent = '\u2398'; }, 1500); }
+  });
+}
+
+function showBatchPrompt() {
+  const prompt = buildPrompt([...selected]);
+  document.getElementById('modal-content').textContent = prompt;
+  document.getElementById('modal').classList.add('visible');
+}
+
+function closeModal() { document.getElementById('modal').classList.remove('visible'); }
+
+function copyModal() {
+  const text = document.getElementById('modal-content').textContent;
+  navigator.clipboard.writeText(text).then(() => {
+    const btn = document.querySelector('.modal-copy');
+    btn.textContent = 'Copied!';
+    setTimeout(() => { btn.textContent = 'Copy to Clipboard'; }, 1500);
+  });
+}
+
+function esc(s) {
+  if (!s) return '';
+  const d = document.createElement('div');
+  d.textContent = String(s);
+  return d.innerHTML;
+}
+
+init();
+</script>
+</body>
+</html>"""
+
+
+def generate_html(report_data: dict) -> str:
+    """Inject report data into the HTML template."""
+    data_json = json.dumps(report_data, indent=None, ensure_ascii=False)
+    # Embed the JSON as a script tag before the main script
+    data_tag = f'<script id="report-data" type="application/json">{data_json}</script>'
+    # Insert before the main <script> tag
+    html = HTML_TEMPLATE.replace('<script>\nconst DATA', f'{data_tag}\n<script>\nconst DATA')
+    html = html.replace('SKILL_NAME_PLACEHOLDER', report_data['meta']['skill_name'])
+    return html
+
+
+# =============================================================================
+# CLI
+# =============================================================================
+
+def main() -> int:
+    parser = argparse.ArgumentParser(
+        description='Generate interactive HTML quality report from scanner JSON files',
+    )
+    parser.add_argument(
+        'report_dir',
+        type=Path,
+        help='Directory containing *-temp.json and *-prepass.json files',
+    )
+    parser.add_argument(
+        '--skill-path',
+        help='Path to the skill being scanned (auto-detected from JSON if omitted)',
+    )
+    parser.add_argument(
+        '--open',
+        action='store_true',
+        help='Open the HTML report in the default browser',
+    )
+    parser.add_argument(
+        '--output', '-o',
+        type=Path,
+        help='Output HTML file path (default: {report_dir}/quality-report.html)',
+    )
+    args = parser.parse_args()
+
+    if not args.report_dir.is_dir():
+        print(f'Error: {args.report_dir} is not a directory', file=sys.stderr)
+        return 2
+
+    report_data = load_report_data(args.report_dir, args.skill_path)
+
+    if not report_data['items']:
+        print('Warning: No scanner data found in directory', file=sys.stderr)
+
+    html = generate_html(report_data)
+
+    output_path = args.output or (args.report_dir / 'quality-report.html')
+    output_path.write_text(html, encoding='utf-8')
+    print(json.dumps({
+        'html_report': str(output_path),
+        'items': len(report_data['items']),
+        'issues': report_data['executive_summary']['total_issues'],
+        'grade': report_data['executive_summary']['grade'],
+    }))
+
+    if args.open:
+        system = platform.system()
+        if system == 'Darwin':
+            subprocess.run(['open', str(output_path)])
+        elif system == 'Linux':
+            subprocess.run(['xdg-open', str(output_path)])
+        elif system == 'Windows':
+            subprocess.run(['start', str(output_path)], shell=True)
+
+    return 0
+
+
+if __name__ == '__main__':
+    sys.exit(main())
diff --git a/src/skills/bmad-workflow-builder/scripts/manifest.py b/src/skills/bmad-workflow-builder/scripts/manifest.py
index 9bae89f..30c3093 100644
--- a/src/skills/bmad-workflow-builder/scripts/manifest.py
+++ b/src/skills/bmad-workflow-builder/scripts/manifest.py
@@ -31,7 +31,7 @@
 try:
     from jsonschema import Draft7Validator
 except ImportError:
-    print("Error: jsonschema required. Install with: pip install jsonschema", file=sys.stderr)
+    print("Error: jsonschema required. Run with: uv run scripts/manifest.py (PEP 723 handles deps)", file=sys.stderr)
     sys.exit(2)
 
 MANIFEST_FILENAME = "bmad-manifest.json"
@@ -224,7 +224,7 @@ def cmd_update(args: argparse.Namespace) -> int:
         if key.startswith("capability."):
             parts = key.split(".", 2)
             if len(parts) != 3:
-                print(f"Error: Capability update format: capability.<name>.<field>=<value>", file=sys.stderr)
+                print("Error: Capability update format: capability.<name>.<field>=<value>", file=sys.stderr)
                 return 1
             cap_name, field = parts[1], parts[2]
             found = False
diff --git a/src/skills/bmad-workflow-builder/scripts/prepass-execution-deps.py b/src/skills/bmad-workflow-builder/scripts/prepass-execution-deps.py
index 58e640e..af6d14e 100755
--- a/src/skills/bmad-workflow-builder/scripts/prepass-execution-deps.py
+++ b/src/skills/bmad-workflow-builder/scripts/prepass-execution-deps.py
@@ -146,8 +146,8 @@ def scan_sequential_patterns(filepath: Path, rel_path: str) -> list[dict]:
 
     # Subagent spawning from subagent (impossible)
     if re.search(r'(?i)spawn.*subagent|launch.*subagent|create.*subagent', content):
-        # Check if this file IS a subagent (lives in agents/)
-        if '/agents/' in rel_path or rel_path.startswith('agents/'):
+        # Check if this file IS a subagent (non-SKILL.md, non-numbered prompt at root)
+        if rel_path != 'SKILL.md' and not re.match(r'^\d+-', rel_path):
             patterns.append({
                 'file': rel_path,
                 'type': 'subagent-chain-violation',
@@ -190,12 +190,10 @@ def scan_execution_deps(skill_path: Path) -> dict:
                 pass
             break
 
-    # Also check for stage-level manifests or stage definitions in SKILL.md
-    prompts_dir = skill_path / 'prompts'
-    if prompts_dir.exists():
-        for f in sorted(prompts_dir.iterdir()):
-            if f.is_file() and f.suffix == '.md':
-                all_stages.add(f.stem)
+    # Also check for stage-level prompt files at skill root
+    for f in sorted(skill_path.iterdir()):
+        if f.is_file() and f.suffix == '.md' and f.name != 'SKILL.md':
+            all_stages.add(f.stem)
 
     # Cycle detection
     cycles = detect_cycles(dep_graph)
@@ -206,15 +204,12 @@ def scan_execution_deps(skill_path: Path) -> dict:
     # Parallel groups
     parallel_groups = find_parallel_groups(dep_graph, all_stages)
 
-    # Sequential pattern detection across all prompt and agent files
+    # Sequential pattern detection across all prompt and agent files at root
     sequential_patterns = []
-    for scan_dir in ['prompts', 'agents']:
-        d = skill_path / scan_dir
-        if d.exists():
-            for f in sorted(d.iterdir()):
-                if f.is_file() and f.suffix == '.md':
-                    patterns = scan_sequential_patterns(f, f'{scan_dir}/{f.name}')
-                    sequential_patterns.extend(patterns)
+    for f in sorted(skill_path.iterdir()):
+        if f.is_file() and f.suffix == '.md' and f.name != 'SKILL.md':
+            patterns = scan_sequential_patterns(f, f.name)
+            sequential_patterns.extend(patterns)
 
     # Also scan SKILL.md
     skill_md = skill_path / 'SKILL.md'
diff --git a/src/skills/bmad-workflow-builder/scripts/prepass-prompt-metrics.py b/src/skills/bmad-workflow-builder/scripts/prepass-prompt-metrics.py
index 88bcd69..2408768 100755
--- a/src/skills/bmad-workflow-builder/scripts/prepass-prompt-metrics.py
+++ b/src/skills/bmad-workflow-builder/scripts/prepass-prompt-metrics.py
@@ -186,14 +186,12 @@ def scan_prompt_metrics(skill_path: Path) -> dict:
         data['is_skill_md'] = True
         files_data.append(data)
 
-    # Prompts
-    prompts_dir = skill_path / 'prompts'
-    if prompts_dir.exists():
-        for f in sorted(prompts_dir.iterdir()):
-            if f.is_file() and f.suffix == '.md':
-                data = scan_file_patterns(f, f'prompts/{f.name}')
-                data['is_skill_md'] = False
-                files_data.append(data)
+    # Prompt files at skill root (non-SKILL.md .md files)
+    for f in sorted(skill_path.iterdir()):
+        if f.is_file() and f.suffix == '.md' and f.name != 'SKILL.md':
+            data = scan_file_patterns(f, f.name)
+            data['is_skill_md'] = False
+            files_data.append(data)
 
     # Resources (just sizes, for progressive disclosure assessment)
     resources_dir = skill_path / 'resources'
diff --git a/src/skills/bmad-workflow-builder/scripts/prepass-workflow-integrity.py b/src/skills/bmad-workflow-builder/scripts/prepass-workflow-integrity.py
index 62debb3..e4b8767 100755
--- a/src/skills/bmad-workflow-builder/scripts/prepass-workflow-integrity.py
+++ b/src/skills/bmad-workflow-builder/scripts/prepass-workflow-integrity.py
@@ -213,20 +213,19 @@ def find_template_artifacts(filepath: Path, rel_path: str) -> list[dict]:
 
 
 def cross_reference_stages(skill_path: Path, skill_content: str) -> tuple[dict, list[dict]]:
-    """Cross-reference stage files between SKILL.md and prompts/ directory."""
+    """Cross-reference stage files between SKILL.md and numbered prompt files at skill root."""
     findings = []
-    prompts_dir = skill_path / 'prompts'
 
-    # Get actual prompt files
+    # Get actual numbered prompt files at skill root (exclude SKILL.md)
     actual_files = set()
-    if prompts_dir.exists():
-        for f in prompts_dir.iterdir():
-            if f.is_file() and f.suffix == '.md':
-                actual_files.add(f.name)
+    for f in skill_path.iterdir():
+        if f.is_file() and f.suffix == '.md' and f.name != 'SKILL.md' and re.match(r'^\d+-', f.name):
+            actual_files.add(f.name)
 
-    # Find stage references in SKILL.md
+    # Find stage references in SKILL.md — look for both old prompts/ style and new root style
     referenced = set()
-    ref_pattern = re.compile(r'prompts/([^\s)]+\.md)')
+    # Match `prompts/XX-name.md` (legacy) or bare `XX-name.md` references
+    ref_pattern = re.compile(r'(?:prompts/)?(\d+-[^\s)`]+\.md)')
     for m in ref_pattern.finditer(skill_content):
         referenced.add(m.group(1))
 
@@ -236,16 +235,16 @@ def cross_reference_stages(skill_path: Path, skill_content: str) -> tuple[dict,
         findings.append({
             'file': 'SKILL.md', 'line': 0,
             'severity': 'critical', 'category': 'missing-stage',
-            'issue': f'Referenced stage file does not exist: prompts/{f}',
+            'issue': f'Referenced stage file does not exist: {f}',
         })
 
     # Orphaned files (exist but not referenced)
     orphaned = actual_files - referenced
     for f in sorted(orphaned):
         findings.append({
-            'file': f'prompts/{f}', 'line': 0,
+            'file': f, 'line': 0,
             'severity': 'medium', 'category': 'naming',
-            'issue': f'Stage file exists but not referenced in SKILL.md: prompts/{f}',
+            'issue': f'Stage file exists but not referenced in SKILL.md: {f}',
         })
 
     # Stage numbering check
@@ -263,7 +262,7 @@ def cross_reference_stages(skill_path: Path, skill_content: str) -> tuple[dict,
             gaps = set(expected) - set(nums)
             if gaps:
                 findings.append({
-                    'file': 'prompts/', 'line': 0,
+                    'file': skill_path.name, 'line': 0,
                     'severity': 'medium', 'category': 'naming',
                     'issue': f'Stage numbering has gaps: missing {sorted(gaps)}',
                 })
@@ -283,15 +282,18 @@ def check_prompt_basics(skill_path: Path) -> tuple[list[dict], list[dict]]:
     """Check each prompt file for config header and progression conditions."""
     findings = []
     prompt_details = []
-    prompts_dir = skill_path / 'prompts'
-    if not prompts_dir.exists():
+
+    # Look for numbered prompt files at skill root
+    prompt_files = sorted(
+        f for f in skill_path.iterdir()
+        if f.is_file() and f.suffix == '.md' and f.name != 'SKILL.md' and re.match(r'^\d+-', f.name)
+    )
+    if not prompt_files:
         return prompt_details, findings
 
-    for f in sorted(prompts_dir.iterdir()):
-        if not f.is_file() or f.suffix != '.md':
-            continue
+    for f in prompt_files:
         content = f.read_text(encoding='utf-8')
-        rel_path = f'prompts/{f.name}'
+        rel_path = f.name
         detail = {'file': f.name, 'has_config_header': False, 'has_progression': False}
 
         # Config header check
@@ -301,7 +303,7 @@ def check_prompt_basics(skill_path: Path) -> tuple[list[dict], list[dict]]:
             findings.append({
                 'file': rel_path, 'line': 1,
                 'severity': 'medium', 'category': 'config-header',
-                'issue': f'No config header with language variables found',
+                'issue': 'No config header with language variables found',
             })
 
         # Progression condition check (look for progression-related keywords near end)
@@ -337,7 +339,7 @@ def check_prompt_basics(skill_path: Path) -> tuple[list[dict], list[dict]]:
 
 def detect_workflow_type(skill_content: str, has_prompts: bool) -> str:
     """Detect workflow type from SKILL.md content."""
-    has_stage_refs = bool(re.search(r'prompts/\d+-', skill_content))
+    has_stage_refs = bool(re.search(r'(?:prompts/)?\d+-\S+\.md', skill_content))
     has_routing = bool(re.search(r'(?i)(rout|stage|branch|path)', skill_content))
 
     if has_stage_refs or (has_prompts and has_routing):
@@ -392,7 +394,10 @@ def scan_workflow_integrity(skill_path: Path) -> dict:
             })
 
     # Workflow type
-    has_prompts = (skill_path / 'prompts').exists()
+    has_prompts = any(
+        f.is_file() and f.suffix == '.md' and f.name != 'SKILL.md' and re.match(r'^\d+-', f.name)
+        for f in skill_path.iterdir()
+    )
     workflow_type = detect_workflow_type(skill_content, has_prompts)
 
     # Stage cross-reference
diff --git a/src/skills/bmad-workflow-builder/scripts/scan-path-standards.py b/src/skills/bmad-workflow-builder/scripts/scan-path-standards.py
index b4d3fc8..88497c3 100755
--- a/src/skills/bmad-workflow-builder/scripts/scan-path-standards.py
+++ b/src/skills/bmad-workflow-builder/scripts/scan-path-standards.py
@@ -2,12 +2,11 @@
 """Deterministic path standards scanner for BMad skills.
 
 Validates all .md files against BMad path conventions:
-1. {skill-root} must never appear (always wrong)
-2. {project-root} only valid before /_bmad
-3. Bare _bmad references must have {project-root} prefix
-4. Config variables used directly (no double-prefix)
-5. No ./ or ../ relative prefixes
-6. No absolute paths
+1. {project-root} only valid before /_bmad
+2. Bare _bmad references must have {project-root} prefix
+3. Config variables used directly (no double-prefix)
+4. No ./ or ../ relative prefixes
+5. No absolute paths
 """
 
 # /// script
@@ -25,7 +24,6 @@
 
 
 # Patterns to detect
-SKILL_ROOT_RE = re.compile(r'\{skill-root\}')
 # {project-root} NOT followed by /_bmad
 PROJECT_ROOT_NOT_BMAD_RE = re.compile(r'\{project-root\}/(?!_bmad)')
 # Bare _bmad without {project-root} prefix — match _bmad at word boundary
@@ -61,8 +59,6 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
     rel_path = filepath.name
 
     checks = [
-        (SKILL_ROOT_RE, 'skill-root-found', 'critical',
-         '{skill-root} found — never use this, use bare relative paths for skill-internal files'),
         (PROJECT_ROOT_NOT_BMAD_RE, 'project-root-not-bmad', 'critical',
          '{project-root} used for non-_bmad path — only valid use is {project-root}/_bmad/...'),
         (ABSOLUTE_PATH_RE, 'absolute-path', 'high',
@@ -87,8 +83,9 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
                 'line': line_num,
                 'severity': severity,
                 'category': category,
-                'issue': message,
-                'context': line_content[:120],
+                'title': message,
+                'detail': line_content[:120],
+                'action': '',
             })
 
     # Bare _bmad check — more nuanced, need to avoid false positives
@@ -111,8 +108,9 @@ def scan_file(filepath: Path, skip_fenced: bool = True) -> list[dict]:
             'line': line_num,
             'severity': 'high',
             'category': 'bare-bmad',
-            'issue': 'Bare _bmad reference without {project-root} prefix',
-            'context': line_content[:120],
+            'title': 'Bare _bmad reference without {project-root} prefix',
+            'detail': line_content[:120],
+            'action': '',
         })
 
     return findings
@@ -129,9 +127,6 @@ def scan_skill(skill_path: Path, skip_fenced: bool = True) -> dict:
 
     files_scanned = []
     for md_file in md_files:
-        # Skip tests/fixtures
-        if 'tests/fixtures' in str(md_file):
-            continue
         rel = md_file.relative_to(skill_path)
         files_scanned.append(str(rel))
         file_findings = scan_file(md_file, skip_fenced)
@@ -142,7 +137,6 @@ def scan_skill(skill_path: Path, skip_fenced: bool = True) -> dict:
     # Build summary
     by_severity = {'critical': 0, 'high': 0, 'medium': 0, 'low': 0}
     by_category = {
-        'skill_root_found': 0,
         'project_root_not_bmad': 0,
         'bare_bmad': 0,
         'double_prefix': 0,
@@ -166,11 +160,13 @@ def scan_skill(skill_path: Path, skip_fenced: bool = True) -> dict:
         'timestamp': datetime.now(timezone.utc).isoformat(),
         'files_scanned': files_scanned,
         'status': 'pass' if not all_findings else 'fail',
-        'issues': all_findings,
+        'findings': all_findings,
+        'assessments': {},
         'summary': {
-            'total_issues': len(all_findings),
+            'total_findings': len(all_findings),
             'by_severity': by_severity,
             'by_category': by_category,
+            'assessment': 'Path standards scan complete',
         },
     }
 
diff --git a/src/skills/bmad-workflow-builder/scripts/scan-scripts.py b/src/skills/bmad-workflow-builder/scripts/scan-scripts.py
index 45e39df..28303c3 100755
--- a/src/skills/bmad-workflow-builder/scripts/scan-scripts.py
+++ b/src/skills/bmad-workflow-builder/scripts/scan-scripts.py
@@ -8,6 +8,7 @@
 - Agentic design: no input(), has argparse/--help, JSON output, exit codes
 - Unit test existence
 - Over-engineering signals (line count, simple-op imports)
+- External lint: ruff (Python), shellcheck (Bash), biome (JS/TS)
 """
 
 # /// script
@@ -20,11 +21,237 @@
 import ast
 import json
 import re
+import shutil
+import subprocess
 import sys
 from datetime import datetime, timezone
 from pathlib import Path
 
 
+# =============================================================================
+# External Linter Integration
+# =============================================================================
+
+def _run_command(cmd: list[str], timeout: int = 30) -> tuple[int, str, str]:
+    """Run a command and return (returncode, stdout, stderr)."""
+    try:
+        result = subprocess.run(
+            cmd, capture_output=True, text=True, timeout=timeout,
+        )
+        return result.returncode, result.stdout, result.stderr
+    except FileNotFoundError:
+        return -1, '', f'Command not found: {cmd[0]}'
+    except subprocess.TimeoutExpired:
+        return -2, '', f'Command timed out after {timeout}s: {" ".join(cmd)}'
+
+
+def _find_uv() -> str | None:
+    """Find uv binary on PATH."""
+    return shutil.which('uv')
+
+
+def _find_npx() -> str | None:
+    """Find npx binary on PATH."""
+    return shutil.which('npx')
+
+
+def lint_python_ruff(filepath: Path, rel_path: str) -> list[dict]:
+    """Run ruff on a Python file via uv. Returns lint findings."""
+    uv = _find_uv()
+    if not uv:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': 'uv not found on PATH — cannot run ruff for Python linting',
+            'detail': '',
+            'action': 'Install uv: https://docs.astral.sh/uv/getting-started/installation/',
+        }]
+
+    rc, stdout, stderr = _run_command([
+        uv, 'run', 'ruff', 'check', '--output-format', 'json', str(filepath),
+    ])
+
+    if rc == -1:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': f'Failed to run ruff via uv: {stderr.strip()}',
+            'detail': '',
+            'action': 'Ensure uv can install and run ruff: uv run ruff --version',
+        }]
+
+    if rc == -2:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'ruff timed out on {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    # ruff outputs JSON array on stdout (even on rc=1 when issues found)
+    findings = []
+    try:
+        issues = json.loads(stdout) if stdout.strip() else []
+    except json.JSONDecodeError:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'Failed to parse ruff output for {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    for issue in issues:
+        fix_msg = issue.get('fix', {}).get('message', '') if issue.get('fix') else ''
+        findings.append({
+            'file': rel_path,
+            'line': issue.get('location', {}).get('row', 0),
+            'severity': 'high',
+            'category': 'lint',
+            'title': f'[{issue.get("code", "?")}] {issue.get("message", "")}',
+            'detail': '',
+            'action': fix_msg or f'See https://docs.astral.sh/ruff/rules/{issue.get("code", "")}',
+        })
+
+    return findings
+
+
+def lint_shell_shellcheck(filepath: Path, rel_path: str) -> list[dict]:
+    """Run shellcheck on a shell script via uv. Returns lint findings."""
+    uv = _find_uv()
+    if not uv:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': 'uv not found on PATH — cannot run shellcheck for shell linting',
+            'detail': '',
+            'action': 'Install uv: https://docs.astral.sh/uv/getting-started/installation/',
+        }]
+
+    rc, stdout, stderr = _run_command([
+        uv, 'run', '--with', 'shellcheck-py',
+        'shellcheck', '--format', 'json', str(filepath),
+    ])
+
+    if rc == -1:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': f'Failed to run shellcheck via uv: {stderr.strip()}',
+            'detail': '',
+            'action': 'Ensure uv can install shellcheck-py: uv run --with shellcheck-py shellcheck --version',
+        }]
+
+    if rc == -2:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'shellcheck timed out on {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    findings = []
+    # shellcheck outputs JSON on stdout (rc=1 when issues found)
+    raw = stdout.strip() or stderr.strip()
+    try:
+        issues = json.loads(raw) if raw else []
+    except json.JSONDecodeError:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'Failed to parse shellcheck output for {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    # Map shellcheck levels to our severity
+    level_map = {'error': 'high', 'warning': 'high', 'info': 'high', 'style': 'medium'}
+
+    for issue in issues:
+        sc_code = issue.get('code', '')
+        findings.append({
+            'file': rel_path,
+            'line': issue.get('line', 0),
+            'severity': level_map.get(issue.get('level', ''), 'high'),
+            'category': 'lint',
+            'title': f'[SC{sc_code}] {issue.get("message", "")}',
+            'detail': '',
+            'action': f'See https://www.shellcheck.net/wiki/SC{sc_code}',
+        })
+
+    return findings
+
+
+def lint_node_biome(filepath: Path, rel_path: str) -> list[dict]:
+    """Run biome on a JS/TS file via npx. Returns lint findings."""
+    npx = _find_npx()
+    if not npx:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': 'npx not found on PATH — cannot run biome for JS/TS linting',
+            'detail': '',
+            'action': 'Install Node.js 20+: https://nodejs.org/',
+        }]
+
+    rc, stdout, stderr = _run_command([
+        npx, '--yes', '@biomejs/biome', 'lint', '--reporter', 'json', str(filepath),
+    ], timeout=60)
+
+    if rc == -1:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'high', 'category': 'lint-setup',
+            'title': f'Failed to run biome via npx: {stderr.strip()}',
+            'detail': '',
+            'action': 'Ensure npx can run biome: npx @biomejs/biome --version',
+        }]
+
+    if rc == -2:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'biome timed out on {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    findings = []
+    # biome outputs JSON on stdout
+    raw = stdout.strip()
+    try:
+        result = json.loads(raw) if raw else {}
+    except json.JSONDecodeError:
+        return [{
+            'file': rel_path, 'line': 0,
+            'severity': 'medium', 'category': 'lint',
+            'title': f'Failed to parse biome output for {rel_path}',
+            'detail': '',
+            'action': '',
+        }]
+
+    for diag in result.get('diagnostics', []):
+        loc = diag.get('location', {})
+        start = loc.get('start', {})
+        findings.append({
+            'file': rel_path,
+            'line': start.get('line', 0),
+            'severity': 'high',
+            'category': 'lint',
+            'title': f'[{diag.get("category", "?")}] {diag.get("message", "")}',
+            'detail': '',
+            'action': diag.get('advices', [{}])[0].get('message', '') if diag.get('advices') else '',
+        })
+
+    return findings
+
+
+# =============================================================================
+# BMad Pattern Checks (Existing)
+# =============================================================================
+
 def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
     """Check a Python script for standards compliance."""
     findings = []
@@ -39,8 +266,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': 1,
                 'severity': 'medium', 'category': 'dependencies',
-                'issue': 'No PEP 723 inline dependency block (# /// script)',
-                'fix': 'Add PEP 723 block with requires-python and dependencies',
+                'title': 'No PEP 723 inline dependency block (# /// script)',
+                'detail': '',
+                'action': 'Add PEP 723 block with requires-python and dependencies',
             })
     else:
         # Check requires-python is present
@@ -48,8 +276,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': 1,
                 'severity': 'low', 'category': 'dependencies',
-                'issue': 'PEP 723 block exists but missing requires-python constraint',
-                'fix': 'Add requires-python = ">=3.9" or appropriate version',
+                'title': 'PEP 723 block exists but missing requires-python constraint',
+                'detail': '',
+                'action': 'Add requires-python = ">=3.9" or appropriate version',
             })
 
     # requirements.txt reference
@@ -57,8 +286,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'high', 'category': 'dependencies',
-            'issue': 'References requirements.txt or pip install — use PEP 723 inline deps',
-            'fix': 'Replace with PEP 723 inline dependency block',
+            'title': 'References requirements.txt or pip install — use PEP 723 inline deps',
+            'detail': '',
+            'action': 'Replace with PEP 723 inline dependency block',
         })
 
     # Agentic design checks via AST
@@ -68,12 +298,13 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'critical', 'category': 'error-handling',
-            'issue': 'Python syntax error — script cannot be parsed',
+            'title': 'Python syntax error — script cannot be parsed',
+            'detail': '',
+            'action': '',
         })
         return findings
 
     has_argparse = False
-    has_input_call = False
     has_json_dumps = False
     has_sys_exit = False
     imports = set()
@@ -91,12 +322,12 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         if isinstance(node, ast.Call):
             func = node.func
             if isinstance(func, ast.Name) and func.id == 'input':
-                has_input_call = True
                 findings.append({
                     'file': rel_path, 'line': node.lineno,
                     'severity': 'critical', 'category': 'agentic-design',
-                    'issue': 'input() call found — blocks in non-interactive agent execution',
-                    'fix': 'Use argparse with required flags instead of interactive prompts',
+                    'title': 'input() call found — blocks in non-interactive agent execution',
+                    'detail': '',
+                    'action': 'Use argparse with required flags instead of interactive prompts',
                 })
             # json.dumps
             if isinstance(func, ast.Attribute) and func.attr == 'dumps':
@@ -115,24 +346,27 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'agentic-design',
-            'issue': 'No argparse found — script lacks --help self-documentation',
-            'fix': 'Add argparse with description and argument help text',
+            'title': 'No argparse found — script lacks --help self-documentation',
+            'detail': '',
+            'action': 'Add argparse with description and argument help text',
         })
 
     if not has_json_dumps and line_count > 20:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'agentic-design',
-            'issue': 'No json.dumps found — output may not be structured JSON',
-            'fix': 'Use json.dumps for structured output parseable by workflows',
+            'title': 'No json.dumps found — output may not be structured JSON',
+            'detail': '',
+            'action': 'Use json.dumps for structured output parseable by workflows',
         })
 
     if not has_sys_exit and line_count > 20:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'low', 'category': 'agentic-design',
-            'issue': 'No sys.exit() calls — may not return meaningful exit codes',
-            'fix': 'Return 0=success, 1=fail, 2=error via sys.exit()',
+            'title': 'No sys.exit() calls — may not return meaningful exit codes',
+            'detail': '',
+            'action': 'Return 0=success, 1=fail, 2=error via sys.exit()',
         })
 
     # Over-engineering: simple file ops in Python
@@ -142,8 +376,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'low', 'category': 'over-engineered',
-            'issue': f'Short script ({line_count} lines) imports {", ".join(over_eng)} — may be simpler as bash',
-            'fix': 'Consider if cp/mv/find shell commands would suffice',
+            'title': f'Short script ({line_count} lines) imports {", ".join(over_eng)} — may be simpler as bash',
+            'detail': '',
+            'action': 'Consider if cp/mv/find shell commands would suffice',
         })
 
     # Very short script
@@ -151,8 +386,9 @@ def scan_python_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'over-engineered',
-            'issue': f'Script is only {line_count} lines — could be an inline command',
-            'fix': 'Consider inlining this command directly in the prompt',
+            'title': f'Script is only {line_count} lines — could be an inline command',
+            'detail': '',
+            'action': 'Consider inlining this command directly in the prompt',
         })
 
     return findings
@@ -170,15 +406,17 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'high', 'category': 'portability',
-            'issue': 'Missing shebang line',
-            'fix': 'Add #!/usr/bin/env bash or #!/usr/bin/env sh',
+            'title': 'Missing shebang line',
+            'detail': '',
+            'action': 'Add #!/usr/bin/env bash or #!/usr/bin/env sh',
         })
     elif '/usr/bin/env' not in lines[0]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'portability',
-            'issue': f'Shebang uses hardcoded path: {lines[0].strip()}',
-            'fix': 'Use #!/usr/bin/env bash for cross-platform compatibility',
+            'title': f'Shebang uses hardcoded path: {lines[0].strip()}',
+            'detail': '',
+            'action': 'Use #!/usr/bin/env bash for cross-platform compatibility',
         })
 
     # set -e
@@ -186,8 +424,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'error-handling',
-            'issue': 'Missing set -e — errors will be silently ignored',
-            'fix': 'Add set -e (or set -euo pipefail) near the top',
+            'title': 'Missing set -e — errors will be silently ignored',
+            'detail': '',
+            'action': 'Add set -e (or set -euo pipefail) near the top',
         })
 
     # Hardcoded interpreter paths
@@ -197,8 +436,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'medium', 'category': 'portability',
-                'issue': f'Hardcoded interpreter path: {line.strip()}',
-                'fix': 'Use /usr/bin/env or PATH-based lookup',
+                'title': f'Hardcoded interpreter path: {line.strip()}',
+                'detail': '',
+                'action': 'Use /usr/bin/env or PATH-based lookup',
             })
 
     # GNU-only tools
@@ -209,8 +449,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'medium', 'category': 'portability',
-                'issue': f'GNU-only tool: {m.group()} — not available on all platforms',
-                'fix': 'Use POSIX-compatible equivalent',
+                'title': f'GNU-only tool: {m.group()} — not available on all platforms',
+                'detail': '',
+                'action': 'Use POSIX-compatible equivalent',
             })
 
     # Unquoted variables (basic check)
@@ -226,8 +467,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'low', 'category': 'portability',
-                'issue': f'Potentially unquoted variable: {m.group()} — breaks with spaces in paths',
-                'fix': f'Use "{m.group()}" with double quotes',
+                'title': f'Potentially unquoted variable: {m.group()} — breaks with spaces in paths',
+                'detail': '',
+                'action': f'Use "{m.group()}" with double quotes',
             })
 
     # npx/uvx without version pinning
@@ -240,8 +482,9 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
             findings.append({
                 'file': rel_path, 'line': i,
                 'severity': 'medium', 'category': 'dependencies',
-                'issue': f'{m.group(1)} {m.group(2)} without version pinning',
-                'fix': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
+                'title': f'{m.group(1)} {m.group(2)} without version pinning',
+                'detail': '',
+                'action': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
             })
 
     # Very short script
@@ -249,17 +492,56 @@ def scan_shell_script(filepath: Path, rel_path: str) -> list[dict]:
         findings.append({
             'file': rel_path, 'line': 1,
             'severity': 'medium', 'category': 'over-engineered',
-            'issue': f'Script is only {line_count} lines — could be an inline command',
-            'fix': 'Consider inlining this command directly in the prompt',
+            'title': f'Script is only {line_count} lines — could be an inline command',
+            'detail': '',
+            'action': 'Consider inlining this command directly in the prompt',
         })
 
     return findings
 
 
+def scan_node_script(filepath: Path, rel_path: str) -> list[dict]:
+    """Check a JS/TS script for standards compliance."""
+    findings = []
+    content = filepath.read_text(encoding='utf-8')
+    lines = content.split('\n')
+    line_count = len(lines)
+
+    # npx/uvx without version pinning
+    no_pin = re.compile(r'\b(npx|uvx)\s+([a-zA-Z][\w-]+)(?!\S*@)')
+    for i, line in enumerate(lines, 1):
+        m = no_pin.search(line)
+        if m:
+            findings.append({
+                'file': rel_path, 'line': i,
+                'severity': 'medium', 'category': 'dependencies',
+                'title': f'{m.group(1)} {m.group(2)} without version pinning',
+                'detail': '',
+                'action': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
+            })
+
+    # Very short script
+    if line_count < 5:
+        findings.append({
+            'file': rel_path, 'line': 1,
+            'severity': 'medium', 'category': 'over-engineered',
+            'title': f'Script is only {line_count} lines — could be an inline command',
+            'detail': '',
+            'action': 'Consider inlining this command directly in the prompt',
+        })
+
+    return findings
+
+
+# =============================================================================
+# Main Scanner
+# =============================================================================
+
 def scan_skill_scripts(skill_path: Path) -> dict:
     """Scan all scripts in a skill directory."""
     scripts_dir = skill_path / 'scripts'
     all_findings = []
+    lint_findings = []
     script_inventory = {'python': [], 'shell': [], 'node': [], 'other': []}
     missing_tests = []
 
@@ -267,24 +549,34 @@ def scan_skill_scripts(skill_path: Path) -> dict:
         return {
             'scanner': 'scripts',
             'script': 'scan-scripts.py',
-            'version': '1.0.0',
+            'version': '2.0.0',
             'skill_path': str(skill_path),
             'timestamp': datetime.now(timezone.utc).isoformat(),
             'status': 'pass',
-            'issues': [{
+            'findings': [{
                 'file': 'scripts/',
                 'severity': 'info',
                 'category': 'none',
-                'issue': 'No scripts/ directory found — nothing to scan',
+                'title': 'No scripts/ directory found — nothing to scan',
+                'detail': '',
+                'action': '',
             }],
-            'script_summary': {
-                'total_scripts': 0,
-                'by_type': script_inventory,
-                'missing_tests': [],
+            'assessments': {
+                'lint_summary': {
+                    'tools_used': [],
+                    'files_linted': 0,
+                    'lint_issues': 0,
+                },
+                'script_summary': {
+                    'total_scripts': 0,
+                    'by_type': script_inventory,
+                    'missing_tests': [],
+                },
             },
             'summary': {
-                'total_issues': 0,
+                'total_findings': 0,
                 'by_severity': {'critical': 0, 'high': 0, 'medium': 0, 'low': 0},
+                'assessment': '',
             },
         }
 
@@ -295,6 +587,7 @@ def scan_skill_scripts(skill_path: Path) -> dict:
             script_files.append(f)
 
     tests_dir = scripts_dir / 'tests'
+    lint_tools_used = set()
 
     for script_file in script_files:
         rel_path = f'scripts/{script_file.name}'
@@ -303,24 +596,24 @@ def scan_skill_scripts(skill_path: Path) -> dict:
         if ext == '.py':
             script_inventory['python'].append(script_file.name)
             findings = scan_python_script(script_file, rel_path)
+            lf = lint_python_ruff(script_file, rel_path)
+            lint_findings.extend(lf)
+            if lf and not any(f['category'] == 'lint-setup' for f in lf):
+                lint_tools_used.add('ruff')
         elif ext in ('.sh', '.bash'):
             script_inventory['shell'].append(script_file.name)
             findings = scan_shell_script(script_file, rel_path)
+            lf = lint_shell_shellcheck(script_file, rel_path)
+            lint_findings.extend(lf)
+            if lf and not any(f['category'] == 'lint-setup' for f in lf):
+                lint_tools_used.add('shellcheck')
         elif ext in ('.js', '.ts', '.mjs'):
             script_inventory['node'].append(script_file.name)
-            # Check for npx/uvx version pinning in node scripts
-            content = script_file.read_text(encoding='utf-8')
-            findings = []
-            no_pin = re.compile(r'\b(npx|uvx)\s+([a-zA-Z][\w-]+)(?!\S*@)')
-            for i, line in enumerate(content.split('\n'), 1):
-                m = no_pin.search(line)
-                if m:
-                    findings.append({
-                        'file': rel_path, 'line': i,
-                        'severity': 'medium', 'category': 'dependencies',
-                        'issue': f'{m.group(1)} {m.group(2)} without version pinning',
-                        'fix': f'Pin version: {m.group(1)} {m.group(2)}@<version>',
-                    })
+            findings = scan_node_script(script_file, rel_path)
+            lf = lint_node_biome(script_file, rel_path)
+            lint_findings.extend(lf)
+            if lf and not any(f['category'] == 'lint-setup' for f in lf):
+                lint_tools_used.add('biome')
         else:
             script_inventory['other'].append(script_file.name)
             findings = []
@@ -342,8 +635,9 @@ def scan_skill_scripts(skill_path: Path) -> dict:
             findings.append({
                 'file': rel_path, 'line': 1,
                 'severity': 'medium', 'category': 'tests',
-                'issue': f'No unit test found for {script_file.name}',
-                'fix': f'Create scripts/tests/test-{script_file.stem}{ext} with test cases',
+                'title': f'No unit test found for {script_file.name}',
+                'detail': '',
+                'action': f'Create scripts/tests/test-{script_file.stem}{ext} with test cases',
             })
 
         all_findings.extend(findings)
@@ -355,10 +649,14 @@ def scan_skill_scripts(skill_path: Path) -> dict:
             'line': 0,
             'severity': 'high',
             'category': 'tests',
-            'issue': 'scripts/tests/ directory does not exist — no unit tests',
-            'fix': 'Create scripts/tests/ with test files for each script',
+            'title': 'scripts/tests/ directory does not exist — no unit tests',
+            'detail': '',
+            'action': 'Create scripts/tests/ with test files for each script',
         })
 
+    # Merge lint findings into all findings
+    all_findings.extend(lint_findings)
+
     # Build summary
     by_severity = {'critical': 0, 'high': 0, 'medium': 0, 'low': 0}
     by_category: dict[str, int] = {}
@@ -378,31 +676,41 @@ def scan_skill_scripts(skill_path: Path) -> dict:
     elif total_scripts == 0:
         status = 'pass'
 
+    lint_issue_count = sum(1 for f in lint_findings if f['category'] == 'lint')
+
     return {
         'scanner': 'scripts',
         'script': 'scan-scripts.py',
-        'version': '1.0.0',
+        'version': '2.0.0',
         'skill_path': str(skill_path),
         'timestamp': datetime.now(timezone.utc).isoformat(),
         'status': status,
-        'issues': all_findings,
-        'script_summary': {
-            'total_scripts': total_scripts,
-            'by_type': {k: len(v) for k, v in script_inventory.items()},
-            'scripts': {k: v for k, v in script_inventory.items() if v},
-            'missing_tests': missing_tests,
+        'findings': all_findings,
+        'assessments': {
+            'lint_summary': {
+                'tools_used': sorted(lint_tools_used),
+                'files_linted': total_scripts,
+                'lint_issues': lint_issue_count,
+            },
+            'script_summary': {
+                'total_scripts': total_scripts,
+                'by_type': {k: len(v) for k, v in script_inventory.items()},
+                'scripts': {k: v for k, v in script_inventory.items() if v},
+                'missing_tests': missing_tests,
+            },
         },
         'summary': {
-            'total_issues': len(all_findings),
+            'total_findings': len(all_findings),
             'by_severity': by_severity,
             'by_category': by_category,
+            'assessment': '',
         },
     }
 
 
 def main() -> int:
     parser = argparse.ArgumentParser(
-        description='Scan BMad skill scripts for quality, portability, and agentic design',
+        description='Scan BMad skill scripts for quality, portability, agentic design, and lint issues',
     )
     parser.add_argument(
         'skill_path',
diff --git a/src/skills/bmad-workflow-builder/tests/fixtures/complex/SKILL.md b/src/skills/bmad-workflow-builder/tests/fixtures/complex/SKILL.md
deleted file mode 100644
index f3e2fb5..0000000
--- a/src/skills/bmad-workflow-builder/tests/fixtures/complex/SKILL.md
+++ /dev/null
@@ -1,34 +0,0 @@
----
-name: bmad-cq-code-review
-description: Use when the user requests to "review code", "code quality check", or "run code review workflow". Multi-stage code review with automated analysis and consolidated reporting.
----
-
-# Code Review Workflow
-
-## Overview
-
-This skill helps you perform thorough, consistent code reviews through a multi-stage process. Act as a senior code reviewer, guiding the review through discovery, planning, multi-pass analysis, and consolidated reporting. Your output is a comprehensive code review report with actionable findings.
-
-## On Activation
-
-1. **Load config via bmad-init skill** — Store all returned vars
-   - Use `{user_name}` for greeting
-   - Use `{communication_language}` for communications
-   - Use `{document_output_language}` for the review report
-
-2. **Greet user** as `{user_name}`
-
-3. **Check if review in progress:**
-   - If output doc exists: read to determine current stage, resume
-   - Else: Start at `prompts/01-discover.md`
-
-4. **Route to appropriate stage**
-
-## Stages
-
-| # | Stage | Purpose | Prompt |
-|---|-------|---------|--------|
-| 1 | discover | Identify files and scope for review | `prompts/01-discover.md` |
-| 2 | plan | Create review strategy and checklist | `prompts/02-plan.md` |
-| 3 | analyze | Multi-pass code analysis | `prompts/03-analyze.md` |
-| 4 | report | Generate consolidated review report | `prompts/04-report.md` |
diff --git a/src/skills/bmad-workflow-builder/tests/fixtures/complex/bmad-manifest.json b/src/skills/bmad-workflow-builder/tests/fixtures/complex/bmad-manifest.json
deleted file mode 100644
index 2e70331..0000000
--- a/src/skills/bmad-workflow-builder/tests/fixtures/complex/bmad-manifest.json
+++ /dev/null
@@ -1,15 +0,0 @@
-{
-  "module-code": "cq",
-  "capabilities": [
-    {
-      "name": "code-review",
-      "menu-code": "CR",
-      "description": "Multi-stage code review with analysis and reporting.",
-      "phase-name": "development",
-      "is-required": false,
-      "after": [],
-      "before": [],
-      "output-location": "{code_review_output_folder}"
-    }
-  ]
-}
diff --git a/src/skills/bmad-workflow-builder/tests/fixtures/deficient/SKILL.md b/src/skills/bmad-workflow-builder/tests/fixtures/deficient/SKILL.md
deleted file mode 100644
index c7ad94a..0000000
--- a/src/skills/bmad-workflow-builder/tests/fixtures/deficient/SKILL.md
+++ /dev/null
@@ -1,19 +0,0 @@
----
-name: my-workflow
-description: Does stuff
----
-
-# My Workflow
-
-This workflow does things. It should be helpful.
-
-## Steps
-
-1. Do the first thing
-2. Handle appropriately
-3. Make sure everything is good
-4. You should probably check the output
-
-## On Activation
-
-Please load the config and then do the workflow steps above. Don't forget to greet the user!