sean-agentic-orchestrator
officialClaude Code skill for orchestrating complex engineering work with atomic task decomposition, verification gates, and cost-aware model routing
🤖 Agentic Engineering Orchestrator
A Claude Code skill that guarantees your engineering work is done — not just written.
Atomic task decomposition · 9-point verification gates · Advisor pattern · Code reuse enforcement · Anti-drift checks · Cost-aware model routing · Self-healing repair loops · Context engineering
🧠 What It Does
When you give Claude Code a large engineering goal, this skill activates and:
- 🔍 Decomposes the goal into atomic, testable tasks with task-type hints (bug fix, feature, refactor, perf, security)
- ⚡ Auto-detects your stack — language, framework, test runner, build system, linter
- 💰 Routes each task to the cheapest model tier capable of handling it (saves tokens)
- 🧠 Advisor pattern — cheap executor consults Opus at critical decision points for higher accuracy at lower cost
- 🔁 Executes with smart parallel/sequential decisions — explicit rules prevent wasted multi-agent token burn
- ♻️ Reuse-first — Reader scans for existing components before Builder writes; QA rejects duplicates
- ✅ Verifies every task through a 9-point Done Gate (build, tests, lint, regression, security, QA…)
- 🛠️ Self-heals — failures trigger repair loops with convergence detection and smart termination
- 🧭 Anti-drift checks — Leader verifies goal alignment every 3 tasks, catches scope creep before it wastes tokens
- 📊 Periodic self-eval — fresh QA subagent evaluates cumulative quality every 4-5 tasks (DRY, architecture, plan alignment)
- 🧹 Context engineering — just-in-time retrieval, structured notes, artifact-based communication, compaction strategy
No task is marked DONE until build passes, tests pass, and the leader agent explicitly approves.
🚀 Quick Install
npx sean-agentic-orchestrator
Then open Claude Code and type /agentic-orchestrator. That’s it.
📦 All Install Options
Option A — npx (Recommended)
npx sean-agentic-orchestrator
or
npx sean-agentic-orchestrator@latest
Automatically copies skill files to ~/.claude/skills/agentic-orchestrator.
Option B — Global Install
npm install -g sean-agentic-orchestrator
Option C — Git Clone
git clone https://github.com/seancheick/Sean-Agentic-Orchestrator.git /tmp/ao
mkdir -p ~/.claude/skills
cp -r /tmp/ao ~/.claude/skills/agentic-orchestrator
rm -rf /tmp/ao
Option D — Manual
- Download this repo as a ZIP
- Extract and copy the folder to
~/.claude/skills/agentic-orchestrator
✅ Verify Installation
/agentic-orchestrator
Claude will activate the orchestrator and begin the intake phase.
🛠️ Supported Stacks
Works with any tech stack. Includes built-in commands, pitfalls, and conventions for:
| Stack | Detection |
|---|---|
| ⚛️ Next.js / React / TypeScript | package.json, tsconfig.json, next.config.* |
| 🐦 Flutter / Dart | pubspec.yaml, .dart files |
| 🐍 Python / Django / FastAPI | pyproject.toml, manage.py, requirements.txt |
| 🦀 Rust | Cargo.toml |
| 🐹 Go | go.mod |
| 🍎 Swift / iOS | Package.swift, .xcodeproj |
| ☕ Spring Boot / Java / Kotlin | pom.xml, build.gradle |
| 🐘 Supabase / PostgreSQL | supabase/ directory, .sql files |
| 📦 Monorepos | turbo.json, nx.json, pnpm-workspace.yaml |
💡 Usage
Invoke Directly
/agentic-orchestrator
Add user authentication with JWT tokens, refresh token rotation,
and role-based access control to my Express API.
Natural Trigger Phrases
The skill also activates on:
- “Break this down and execute…”
- “Orchestrate the implementation of…”
- “Plan and execute this feature…”
- Any multi-step engineering goal
What You’ll See
Planning phase — structured task list with model tiers, risk levels, dependencies:
T1: Set up JWT signing/verification utilities [Mid] [Low Risk]
T2: Create auth middleware [Mid] [High Risk]
T3: Add refresh token rotation endpoint [Mid] [High Risk]
T4: Implement role-based access control [Mid] [Medium Risk]
T5: Add auth tests [Mid] [Low Risk]
Execution phase — verified output after each task:
TASK EXECUTION
Task ID: T1 | Status: QA_REVIEW
Files: src/utils/jwt.ts — signing/verification helpers
Build: PASS
Tests: PASS (12 passed, 0 failed)
Lint: PASS
Done Gate — explicit leader sign-off before the next task begins.
🤖 Agent System
9 specialized agents coordinate to execute your work:
| Agent | Role |
|---|---|
| 👑 Leader | Owns the plan, controls progress, runs done gates, anti-drift checks |
| 📋 Planner | Decomposes goals into atomic tasks |
| 📖 Reader | Cheap context gathering and targeted file scanning |
| 🔨 Builder | Implements changes with strict scope control |
| 🐛 Debug | Investigates failures, isolates root causes |
| 🧪 Test | Validates behavior, identifies missing coverage |
| 🔍 QA/QC | Reviews quality, rejects incomplete work |
| 🔒 Security | OWASP scanning, secrets detection, auth validation |
| ⚡ Perf | Profiling, bottleneck detection, optimization validation |
✅ The 9-Point Done Gate
Every task must pass ALL gates before being marked DONE:
| # | Gate | What It Checks |
|---|---|---|
| 1 | 📐 Scope | All requested work is implemented |
| 2 | 🎯 Acceptance | Success criteria are satisfied |
| 3 | 🏗️ Build | Zero errors, zero new warnings |
| 4 | 🧪 Tests | Pass — new tests where gaps existed |
| 5 | 🧹 Lint | Passes (if configured) |
| 6 | 🔄 Regression | No breakage in adjacent code |
| 7 | 🔒 Security | No new vulnerabilities (boundary tasks) |
| 8 | 💎 QA | Maintainable, follows conventions |
| 9 | 👑 Leader | Explicit approval |
Any gate fails → task status =
FIX_REQUIRED→ automatic repair loop.
💰 Model Routing
The orchestrator routes each task to the cheapest capable model tier — no wasted tokens:
| Tier | Model | Used For |
|---|---|---|
| 🟢 Cheap | haiku | Reading, scanning, extraction, summaries |
| 🟡 Mid | sonnet | Implementation, testing, QA, bug fixes |
| 🔴 High | opus | Architecture, ambiguous root-cause, critical reviews |
| 🧠 Advisor | opus (consulted) | Plan review, high-risk decisions, phase sign-off |
New in v2.3: The Advisor Pattern — instead of running entire tasks on the most expensive model, a Mid-tier executor consults a High-tier advisor only at critical decision points. Result: +2.7% accuracy, -11.9% cost.
🆕 What’s New
| Version | Feature | Impact |
|---|---|---|
| v2.4.0 | Code Reuse Enforcement — Reader scans for existing components; Builder reuses; QA rejects duplicates | Eliminates bloat and duplicate code |
| v2.4.0 | Clean & Simple Code (Principle #9) — 3+ uses to abstract, ~40 line max, no speculative code | Stops over-engineering |
| v2.4.0 | Periodic Self-Eval — fresh QA subagent evaluates every 4-5 tasks (DRY, architecture, alignment) | Catches quality issues mid-project |
| v2.4.0 | Harness Config — reasoning effort HIGH, hooks for enforcement, context discipline | More accurate responses |
| v2.4.0 | Opus 4.7 alignment — model IDs updated, adaptive-thinking xhigh guidance, self-verification awareness | Accurate routing on current model lineup |
| v2.3.1 | Anti-Drift Checks — Leader verifies goal alignment every 3 tasks | Catches scope creep before it burns tokens |
| v2.3.1 | Parallel Invocation Rules — explicit decision table for when to parallelize vs stay sequential | Prevents wasted multi-agent overhead (~15x token cost) |
| v2.3.1 | Task-Type Hints — starting pipeline suggestions for bug fix, feature, refactor, perf, security | Faster planning, but planner adapts based on actual task |
| v2.3.0 | Advisor Pattern — Opus consulted at plan review, critical decisions, and phase sign-off | +2.7% accuracy, -11.9% cost |
| v2.3.0 | Context Engineering — just-in-time retrieval, structured notes, artifact-based agent communication | Prevents context bloat, extends effective session length |
| v2.3.0 | Smart Repair Termination — convergence detection, quality thresholds, self-consistency trap avoidance | Stops wasted repair cycles, saves tokens |
All enhancements validated against Anthropic’s published guidance on building effective agents.
📁 Skill Structure
agentic-orchestrator/
├── SKILL.md # Core orchestrator — principles, loop, done gate
└── references/
├── agent-roles.md # 9 specialized agent definitions
├── advisor-pattern.md # Advisor consultation strategy & integration
├── context-engineering.md # Token efficiency & memory management
├── harness-config.md # Accuracy optimization settings & hooks
├── model-routing.md # Full escalation/de-escalation policy
├── output-templates.md # 8 structured output templates
├── stack-patterns.md # 9 tech stack command & pitfall guides
└── failure-recovery.md # Recovery patterns & escalation ladder
⚙️ Configuration
Zero configuration needed. The skill auto-reads your project’s existing config:
CLAUDE.md— project instructionspackage.json/Cargo.toml/go.mod/pyproject.toml— scripts & depstsconfig.json— TypeScript settings.eslintrc/biome.json/rustfmt.toml— linter configturbo.json/nx.json— monorepo config
🤝 Contributing
Contributions welcome!
- Fork this repository
- Edit
references/for stack patterns, agent roles, or recovery patterns - Edit
SKILL.mdfor core orchestration logic - Test locally: copy to
~/.claude/skills/agentic-orchestratorand run/agentic-orchestrator - Submit a PR with description of what changed and why
Adding a New Stack
Edit references/stack-patterns.md:
- Add detection (what config files indicate this stack)
- Add build/test/lint commands
- Add common pitfalls
- Add conventions to check
📄 License
MIT License — see LICENSE for details.
Built by
Sean Cheick Founder & CEO · B&Br Technology
⭐ If this skill saves you time, star the repo — it helps others find it.