You pay for code, not coordination
The state machine is pure deterministic logic — no LLM in the routing layer. Every token you spend goes to actual code work, not deciding what to do next.
“Red Queen is to Claude Code what Jenkins is to bash.”
A deterministic state machine shuttles tickets through spec → plan review → code → review → test → human review. You configure the phases, skills, and gates in YAML. The orchestrator itself spends zero AI tokens — it just shuttles work between workers and stops at the gates you set. Deterministic, like a build server.
You already have a coding agent. What you don't have is a deterministic pipeline to shuttle tickets through it — spec, plan review, code, review, test, human gates — while you're not looking. Red Queen is that pipeline. Configurable in YAML, auditable, and built on a state machine that spends zero tokens on routing.
The state machine is pure deterministic logic — no LLM in the routing layer. Every token you spend goes to actual code work, not deciding what to do next.
Each phase — spec writer, coder, reviewer, tester, comment-handler — runs a purpose-built prompt in isolation. Focused beats kitchen-sink.
Review checkpoints are first-class workflow configuration, not an afterthought. Add them, remove them, reorder them in redqueen.yaml.
Bidirectional sync with Jira and GitHub Issues. Work flows from your tracker through the pipeline and back. Linear on the roadmap.
Failed phases retry up to three times, then escalate to a human gate. No infinite loops. No runaway token bills.
Optional webhook server for instant response, with a polling reconciler that works out of the box. No external queue required.
Skills are markdown prompt templates. Drop SKILL.md into .redqueen/skills/<name>/ and your version wins over the built-in.
Every integration implements the IssueTracker or SourceControl interface. New trackers = new adapter, never a core change.
Background orchestrator keeps tickets moving through automated phases and parks work at human gates for when you're back at the keyboard.
You file a ticket. Claude writes a spec. An automated plan review scores it for blockers and ambiguity. You approve at the human gate. Claude writes the code and opens a PR. Another Claude reviews it. Another tests it. You review the final PR and merge. Jump in at any gate, or rip the gates out entirely.
Phases are dynamic — defined in redqueen.yaml, not baked into the code.
Add a security-review gate. Remove the human spec-review and let
plan-review auto-promote clean specs with skipSpecReviewIfReady.
The graph is yours to shape.
Red Queen sits above your coding agent — it doesn't replace it, host it, or manage its skills. If you've seen one of these, Red Queen is a different category of thing.
Red Queen spawns claude CLI subprocesses and drives them through phases. It's the dispatcher, not the agent runtime.
Our “skills” are phase prompt templates for the pipeline. They follow the agentskills.io format so they're portable, but Red Queen isn't a registry or loader for generic user skills.
No persistent PRDs, no cross-session planning layer. Red Queen executes a ticket through a pipeline — complementary to spec frameworks, not a replacement.
Red Queen runs in the background and moves tickets through phases while you're not looking. Use Claude Code or Aider for pair-programming at the keyboard.
From zero to your first AI-driven ticket in about 15 minutes.
claude --version. Red Queen dispatches Claude Code workers, so the CLI must be logged in.npm install -g redqueenredqueen init -y.env:
GITHUB_PAT=ghp_xxxxxxxxxxxxxxxxxxxxredqueen start
Open http://127.0.0.1:4400 — the live dashboard.rq:phase:spec-writing label. Watch the phases move.Full docs, adapter guides, and configuration reference live in the GitHub README.
Red Queen isn't a replacement for the tools below — it's the manager that delegates to them. It dispatches Claude Code today; the adapter pattern means swapping in other coding CLIs is a config change, not a rewrite.
| Red Queen | Claude Code (solo) | Aider | Cline / Roo | Devin / OpenHands | |
|---|---|---|---|---|---|
| Form factor | Background orchestrator | Interactive CLI | Interactive pair | IDE agent | Autonomous agent |
| Orchestration | Deterministic state machine | You are the orchestrator | You are the orchestrator | IDE-driven loop | LLM-driven loop |
| Human gates | First-class, configurable | Ad-hoc | Ad-hoc | Ad-hoc | Minimal |
| Token cost for routing | Zero | N/A (you route) | N/A (you route) | Per decision | Per decision |
| Works while you sleep | Yes | No | No | No | Yes |
| Debuggable flow | Read the state machine | N/A | N/A | Hope the LLM explains | Hope the LLM explains |
The questions you'd ask before adopting.
A deterministic orchestrator for AI coding agents. It's a state machine that dispatches isolated AI workers through a complete software development lifecycle: spec writing, coding, review, testing, and human review. The orchestrator spends zero AI tokens on routing — all decisions are made by pure deterministic logic, not an LLM.
Red Queen is to Claude Code what Jenkins is to bash — it sits above the coding agent and shuttles tickets through phases on a deterministic state machine. Devin, Cline, and Aider are coding agents you interact with directly. Red Queen is the pipeline that dispatches them in the background, with configurable human gates between phases. No LLM in the routing layer, so routing costs zero tokens and is fully debuggable.
No. Red Queen spawns claude CLI subprocesses and drives them through a phase graph — it's a
dispatcher, not an agent runtime or a generic skill registry. Our “skills” are phase prompt
templates for the pipeline; they follow the agentskills.io
frontmatter format so they're portable to any compatible client, but Red Queen itself does not load or
manage user-installed skills.
OpenSpec is a spec / PRD framework — persistent planning artifacts across sessions. Red Queen is an execution pipeline — one ticket in, PR out. They're complementary, not competitive. You can absolutely feed OpenSpec-authored tickets into a Red Queen pipeline.
Today, yes. Red Queen dispatches Claude Code workers. The adapter pattern makes adding support for other coding CLIs straightforward, and that's on the roadmap.
The orchestration itself is free — zero tokens spent on routing. Tokens are what your Claude subscription or API already charges for the actual code work.
Yes. Red Queen runs as a local Node.js 24+ process on your laptop or any server you own. No hosted service is required. Your code is only sent to the AI CLI you configure; Red Queen itself talks to GitHub or Jira over HTTPS.
Yes — project.buildCommand and testCommand scope per-run, and per-module commands
are configurable.
Phases retry up to three times and then escalate to a human gate. No infinite loops, no runaway bills.
Yes — edit redqueen.yaml. Gates are configuration, not hardcoded.
Yes. Drop a SKILL.md at .redqueen/skills/<name>/ and it wins over the built-in.
See the skills contract in the GitHub repository.
GitHub Issues, Jira, and GitHub (source control) ship today. Linear is on the roadmap.
It's a v0.1 preview. Use it, file bugs, expect rough edges — the API surface is stable enough to build on.
Yes — MIT-licensed, developed in the open at github.com/odyth/red-queen.
Open source. Self-hosted. Deterministic. Star the repo, kick the tires, file bugs.