Quest: Multi-Agent AI Orchestration with Human Oversight

Stop blaming the model. Fix the process.

Quest is a portable framework that coordinates specialized AI agents (planner, reviewers, builder) in isolated contexts with human approval gates. Two different models (Claude + GPT) review independently, an arbiter filters noise, and you approve before anything gets built.

Copy it into any repo. Tear it apart and study it. It's built for learning, experimentation, and real work.

Part of the Candid Talent Edge initiative by KjellKod

Watch the Quest Demo | Read the Honest Analysis | View the Portfolio Dashboard

How It Works

You → Planner → Reviewers → Arbiter ──→ Builder → Reviewers → Arbiter → Done
               (Claude)       │    ▲                (Claude)      │         ▲
               (Codex)        │    │                (Codex)       ▼         │
               iterate ───────┘    │                Fixer ────────┘         │
                                   │                                        │
                          GATE: you approve                        GATE: you approve

Where you spend your time: The beginning and the end. You shape the plan, approve it, then validate the built result. Most follow-up quests and v2 ideas come from this post-build validation, not from planning, because you don't fully understand a feature until you see it built. Quest handles the middle.

For lighter tasks, solo mode uses a single reviewer, same pipeline, fewer stages, faster turnaround.

Quick Start

# Download and run the installer
curl -fsSL https://raw.githubusercontent.com/KjellKod/quest/main/scripts/quest_installer.sh -o quest_installer.sh
chmod +x quest_installer.sh
./quest_installer.sh          # preview first with --check

Then start a quest:

claude
/quest "Add a loading skeleton to the user list"

That's it. Quest evaluates complexity, asks clarifying questions if needed, and routes to solo or full workflow. You approve at each gate.

Recommended: Add Codex CLI for dual-model reviews. See the Setup Guide for full instructions including Codex as orchestrator (BETA). To use Quest as a global Codex skill outside a specific repo, see Installing Quest for Codex.

Writing a Good Brief

Quest enforces spec → plan → build. You can start rough, Quest asks clarifying questions to fill gaps.

Input level	What you provide	What happens
Rough idea	`"add dark mode"`	Quest asks questions (max 10), then plans
Idea with context	`"add dark mode, persist in localStorage, respect OS preference, toggle in header"`	Plans with clear direction
Structured spec	Doc with intent, constraints, acceptance criteria	Tight plan on first pass

Say "just go with it" anytime to skip questions and proceed with assumptions. See the Input Routing Guide for details.

What You Can Do

# Scale from simple to complex
/quest "Add a loading spinner to the save button"
/quest "Implement user preferences with localStorage, follow idea document <path> and our RFC ..."
/quest "Build a real-time collaboration system, plan phases first, don't implement yet"

# Resume, redirect, swap models
/quest feature-x_2026-02-04__1430
/quest feature-x_2026-02-04__1430 "re-plan using only claude"
/quest feature-x_2026-02-04__1430 "re-plan using gpt-5.2"

# Point to specs, tickets, or docs
/quest "implement docs/specs/notifications.md"
/quest "implement PROJ-1234"                      # with Jira MCP

# Generate competing plans and pick the best
/quest "migrate to SQLite, zero-downtime, dual-write pattern"
/quest "migrate to SQLite, minimal changes, feature-flag cutover"

Abort anytime, resume later. State persists in .quest/<id>/state.json.

For advanced patterns (phased execution, plan comparison, model mixing), see the Quest Presentation.

The Agents

Role	Default model	What it does
Planner	Claude	Explores the codebase and writes the implementation plan
Reviewer A	Claude	Reviews plans and code from one perspective
Reviewer B	GPT-5.4	Reviews independently, different model, different blind spots
Arbiter	Claude	Synthesizes reviews, filters nitpicks, decides approve or iterate
Builder	GPT-5.4	Implements the approved plan, runs tests, produces PR description
Fixer	GPT-5.4	Surgical fixes from review feedback without rebuilding

These defaults work with Claude (Sonnet or Opus) or GPT-5.x (5.2 or later) as the orchestrator, and across runtimes: Claude Code, Codex CLI, or Cursor IDE.

Every role is swappable. Update models in .ai/allowlist.json to reassign roles, or just ask the orchestrator mid-quest to swap models. Want GPT as your planner and Claude as reviewer? KiMi as arbiter? Try it. With the installer setup you can mix and match any models you prefer. See the OpenCode Field Notes for tested configurations across 30+ models.

Solo mode skips Reviewer B and the Arbiter. Same pipeline, just faster.

Key Features

Artifact-driven, agents communicate through written artifacts, not conversation. No chat history, no accumulated drift, no hallucinated context. Each agent reads evidence and produces evidence
Clean context, each agent starts fresh with only the artifacts it needs
Dual-model review, different models catch different blind spots
Human gates, you approve before anything gets built
Smart routing, evaluates complexity/risk, routes to solo or full workflow
Smart intake, asks structured questions when your input needs more detail
Full audit trail, every artifact saved in .quest/
Multi-runtime, runs from Claude Code (/quest), Codex ($quest), or OpenCode

Philosophy

Autonomy is earned through constraints, not granted by capability. Context contamination is a system failure, not a user habit. Speed without rigor only accelerates failure.

Engineering principles baked into every agent:

KISS — Prefer simple solutions over clever ones
DRY — Extract common patterns, but not prematurely
YAGNI — Don't add features until they're needed
SRP — Each change, function or module should focusing on doing one thing

These aren't guidelines — they're the first thing every agent reads. AGENTS.md shapes how agents think, plan, review, and build. The process enforces the philosophy; the philosophy produces the quality.

Quest is built on a conviction: scaling AI output without scaling engineering discipline is a dead end. We don't trust single outputs, human or machine. We trust repeatable processes backed by evidence. The system makes correct behavior easy and incorrect behavior hard.

We're not replacing human judgment. We're amplifying it.

Read the full philosophy.

Install Options

Per repo (recommended): Use the installer above.

Workspace umbrella (multi-repo): Install once in a parent directory, all repos underneath inherit Quest's config. See the Setup Guide.

Manual copy: Grab .ai/, .skills/, .agents/, .claude/, .cursor/, .codex/, .opencode/, AGENTS.md, and DOCUMENTATION_STRUCTURE.md. See the Setup Guide.

Documentation

Setup Guide, prerequisites, Codex MCP, allowlist customization
Quest Presentation, how it works with diagrams
Input Routing Guide, complexity/risk evaluation and solo vs full workflow
Philosophy, the full manifesto
OpenCode Field Notes, multi-model testing and architecture comparison
Architecture, platform direction and runtime contracts
Portfolio Dashboard, live quest outcomes and journal entries

License

Public Domain (Unlicense). No warranty. See LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quest: Multi-Agent AI Orchestration with Human Oversight

How It Works

Quick Start

Writing a Good Brief

What You Can Do

The Agents

Key Features

Philosophy

Install Options

Documentation

License

About

Uh oh!

Releases 23

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
.agents/skills		.agents/skills
.ai		.ai
.claude		.claude
.codex		.codex
.cursor		.cursor
.github		.github
.opencode		.opencode
.skills		.skills
.vscode		.vscode
docs		docs
ideas		ideas
scripts		scripts
tests		tests
.gitignore		.gitignore
.quest-checksums		.quest-checksums
.quest-manifest		.quest-manifest
.quest-version		.quest-version
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
DOCUMENTATION_STRUCTURE.md		DOCUMENTATION_STRUCTURE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Quest: Multi-Agent AI Orchestration with Human Oversight

How It Works

Quick Start

Writing a Good Brief

What You Can Do

The Agents

Key Features

Philosophy

Install Options

Documentation

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 23

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors 2

Languages

Packages