Onboarding template overhaul + 6 evaluation-driven iterations#11
Open
davewaring wants to merge 6 commits intomainfrom
Open
Onboarding template overhaul + 6 evaluation-driven iterations#11davewaring wants to merge 6 commits intomainfrom
davewaring wants to merge 6 commits intomainfrom
Conversation
Starter pack templates (17 files):
- Bootstrap prompt: personal advisor identity, template-as-north-star
methodology, landscape-first, confirm-as-you-go, empowerment, owner
profile, no-generic-advice differentiator
- Domain AGENT.md files: self-contained expertise per domain, knowledge
literacy sections, blind spots, interview branches, domain tone
- Spec placeholders: 5-section structure with user stories
("As [who], I want [what], so that [why]" + success criteria)
- Plan placeholders: phased roadmap replacing flat milestones
- BD+1 AGENT.md: aligned with new methodology
Bug fixes:
- Project context injection in system prompt (gateway/server.ts) — AI
now knows which project it's in, stays scoped to that domain
- Chat separation fix (useGatewayChat.ts) — draft conversations start
empty instead of showing previous project's messages
- Model resolution (adapters/index.ts) — per-provider defaults only,
never falls back to global default_model across provider boundaries
- Provider switch sync (gateway/server.ts) — switching providers syncs
default_model to new provider's saved model
Source: BrainDrive-Library onboarding project (D22-D30)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
All 5 domain spec templates updated. "What's Still Missing" now has two tiers: - "Before the plan is complete" — gaps that could change the direction - "Worth exploring later" — interesting threads, not urgent Verified via relationships Katie B re-run: advisor correctly separated path-changing gaps (Maya risk, other work connections, Sunday triggers) from optional threads (mom calls, college friendships, initiator origins). Each gap now explains why it's at its tier. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…tency) All 5 domain plan templates updated with three fixes: - Roadmap: "Lead each phase with what life looks like when it's done" - Destination: "Use the owner's own words — don't editorialize or reframe" - What Needs More Work: respect spec gap tiers — only include "Before the plan is complete" items, not "Worth exploring later" Addresses issues found in 15 automated persona runs where: - Destination sections drifted into advisor editorial vs owner's words - Later phases were task-heavy without emotional pull - Plans included items the spec flagged as optional Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…d fixes Spec templates (all 5 domains): - Added "Use the owner's exact words — quote, don't paraphrase" to What You Want section. Addresses vague personas' phrases getting paraphrased away in generated specs. Domain AGENT.md additions (from 15-run evaluation findings): - Finance: life-transition must-asks (insurance, tax, deadlines), emotional spending triggers probe deeper, retirement account magnitude - Career: burnout/escape branch flags trauma as plan prerequisite, not afterthought - Fitness: post-injury fear flags mental health support as Phase 1 path-changing gap - New-project: ask about support systems during interview for sensitive topics, probe connections between multiple interests Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Bootstrap AGENT.md:
- User stories are the most important output — specific enough that
the owner reads them and thinks "yes, that's exactly what I want"
- Confirm user stories before writing the plan — the plan is built
to serve the stories, so they must be right first
All 5 spec templates:
- "What You Want" restructured: user stories are the core of the spec,
everything else supports them
- Examples rewritten with deeper motivation and personal success
criteria (not just functional goals)
- "Where You Are" and "What's In The Way" explicitly tied back to
the user stories ("the context that shapes each goal above")
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Two persistent issues from v1-v3 evaluations, now fixed: 1. Roadmap phases must only serve user stories from the spec. Items the spec flagged as "Worth exploring later" don't get phases. (Previously, plans were building phases for optional threads.) 2. Destination section quotes the owner's success criteria from their user stories — "quote them back, don't rewrite in your voice." (Previously, destination drifted into advisor editorial.) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Complete overhaul of the onboarding starter pack templates, driven by 45 automated persona evaluations (15 runs × 3 rounds). Every change was validated by re-running personas and comparing before/after output.
What changed
Bootstrap AGENT.md:
All 5 domain spec templates (finance, fitness, career, relationships, new-project):
All 5 domain plan templates:
All 4 domain AGENT.md files (domain-specific):
Gateway + BD+1:
Evaluation results
Test plan
🤖 Generated with Claude Code