Skip to content
/

claudeAutoPilot

claudeautopilot · travissutphin/openSource-claudeAutoPilot · ★ 0 · last commit 2026-02-22

PRD-driven autonomous development framework that manages a 4-column Kanban workflow, runs quality gates, and surfaces only classified decisions (deploy/architecture/scope/security) to the human.

Best when90% of workflow decisions can be automated; only production deploys, architecture choices, scope changes, and security issues require human input.
Skip ifStarting without a PRD, Running SetupProject without archiving the old kanban
vs seeds
bmad-method(multi-persona orchestration), it's single-agent with a 4-stage kanban. Claude Code only — no cross-tool portability.
Primitive shape 12 total
Commands 12
00

Summary

claudeAutoPilot — Summary

claudeAutoPilot is a PRD-driven autonomous development framework for Claude Code that transforms a Product Requirements Document into a managed Kanban workflow with automated quality gates (security scan, code review, test), environment setup, and deployment lifecycle management.

Problem it solves: Claude Code is a powerful assistant but requires the developer to manually manage the development lifecycle; claudeAutoPilot imposes a 4-column kanban (Backlog → In Progress → QA → Live) driven by PRD tasks, automating the transitions between stages and surfacing only the decisions that truly require human input.

Distinctive trait: The kanban board is a generated HTML file (docs/kanban/kanban_dev.html) — no external project management tool required — and the framework's decision loop routes only specific decision types (production deploy, architecture change, scope change, security issue) to the human, handling everything else autonomously.

Target audience: Solo developers and small teams using Claude Code who want to turn a PRD into a shipping application with minimal workflow overhead; the framework installs as a .claude/commands/ + .autopilot/ folder pair dropped into any project.

Production-readiness: v3.1.0 commands; MIT license; maintained by Travis Sutphin with 0 stars and 1 contributor; last pushed February 2026. Small project, early-stage.

Differs from seeds: Closest to agent-os in its Markdown-scaffold + bash-script architecture (no MCP, no skills beyond commands), but claudeAutoPilot adds an automated kanban board generation, PRD-driven task extraction with a Node.js validator, a 4-stage DevOps workflow with explicit QA gates, and daily digest/end-of-day commands. Unlike BMAD-METHOD (multi-persona orchestration), claudeAutoPilot is single-agent with human-mediated decision gates. Unlike taskmaster-ai (task decomposition via MCP), claudeAutoPilot uses bash scripts and simple JSON config files with no external server.

01

Overview

claudeAutoPilot — Origin and Philosophy

Origin

Created by Travis Sutphin (github.com/travissutphin) as an open-source project under MIT license. Repository: travissutphin/openSource-claudeAutoPilot. The author also maintains rapidPRD.app for AI-assisted PRD creation, suggesting the framework is part of a larger PRD-to-production toolchain.

Philosophy (from README)

"Turn a PRD into a production app with AI handling the process. You make decisions. AI does everything else."

On the decision model:

"Claude handles 90% of workflow automatically. You're only asked when:

  • Production deploy: 'Task #15 ready for production. Approve?'
  • Architecture change: 'Should we use Redis or in-memory caching?'
  • Scope change: 'This feature needs 2 more tasks. Proceed?'
  • Security issue: 'Medium vulnerability found. Fix now or defer?'"

On the workflow:

"Break down the PRD into tasks, Create and manage a kanban board, Move tasks through a 4-column DevOps workflow, Run tests, security scans, and code reviews automatically, Deploy to staging, verify, and request production approval, Only notify you when decisions are needed."

Design principles

  1. PRD-first — the framework's entry point is always a PRD document; no PRD → no workflow
  2. Kanban as artifact — project state lives in an HTML kanban board committed to the repo
  3. Decision taxonomy — decisions are classified (production, architecture, scope, security) and only the relevant decision types surface to the human
  4. Daily cadence[StartDay] / [EndDay] commands structure the workday around task execution
02

Architecture

claudeAutoPilot — Architecture

Distribution

Install

git clone https://github.com/travissutphin/openSource-claudeAutoPilot.git my-project
cd my-project
# Place PRD at docs/prd/PRD.md
# Open Claude Code: claude
# Run: [SetupProject]

Required runtime

  • Claude Code
  • Node.js 18+ (for automation scripts)
  • Git

Target AI tools

Claude Code exclusively.

Directory structure

my-project/
├── .autopilot/
│   ├── automation/          # Node.js scripts (10 scripts)
│   │   ├── ai-code-review.js
│   │   ├── evaluate-progression.js
│   │   ├── file-generator.js
│   │   ├── generate-secrets.js
│   │   ├── iterative-refinement.js
│   │   ├── kanban-updater.js
│   │   ├── pattern-analyzer.js
│   │   ├── populate-env.js
│   │   ├── prd-validator.js
│   │   └── setup-database.js
│   ├── config/              # Configuration JSON files
│   │   ├── approval-levels.json
│   │   ├── decision-taxonomy.json
│   │   ├── environments.json
│   │   ├── placeholders.json
│   │   ├── quality-gates.json
│   │   ├── refinement-rules.json
│   │   └── workflow-states.json
│   ├── docs/                # Framework documentation
│   │   ├── SETUP-NEW-PROJECT.md
│   │   └── TASK-LIFECYCLE.md
│   ├── examples/            # Sample PRDs
│   │   └── sample-prd-saas.md
│   └── templates/           # Templates for generated files
├── .claude/
│   └── commands/            # 12 slash commands
├── docs/
│   ├── prd/
│   │   └── PRD.md           # YOUR PRD GOES HERE
│   └── kanban/
│       └── kanban_dev.html  # Generated Kanban board
├── CLAUDE.md                # Generated AI instructions
└── .env.example             # Generated environment template

Workflow states (4-column kanban)

Column Color Next states
Backlog #ddd6fe in_progress
In Progress #bfdbfe qa, backlog
QA #fed7aa live, in_progress
Live #bbf7d0 (done)
03

Components

claudeAutoPilot — Components

Slash Commands (12)

Command When to use What happens
[SetupProject] New project Read + validate PRD, extract tasks, generate config, CLAUDE.md, kanban
[SetupEnvironment] After setup Provision database, generate secrets, populate .env
[StartDay] Start of session Greet user, check services, status report, suggest next task
[TaskStart] Begin a task Create branch, move card to In Progress
[TaskQA] Code complete Run security scan, code review, move to QA
[TaskComplete] QA passed Move to Live, update kanban
[DeployLive] Ready for prod Deploy to production (requires human approval)
[Monitor] Runs automatically Check environments, auto-progress tasks
[Digest] Daily or manual Generate daily summary
[EndDay] End of session Wrap up, set tomorrow's priorities
[TaskRefine] When task unclear Clarify task requirements via iterative refinement
[TaskReview] For code review Trigger AI code review via automation script
[TaskStage] Before deploy Stage deployment verification

Node.js Automation Scripts (10)

Script Purpose
prd-validator.js Validate PRD quality + extract setup data as JSON
kanban-updater.js Update kanban HTML board state
ai-code-review.js Run automated AI code review
evaluate-progression.js Evaluate task progression/completion
file-generator.js Generate CLAUDE.md, .env.example from templates
generate-secrets.js Generate secure secrets for .env
iterative-refinement.js Iterative task refinement loop
pattern-analyzer.js Analyze code patterns for consistency
populate-env.js Populate .env from template
setup-database.js Database provisioning

Config files (7)

File Purpose
workflow-states.json Kanban columns, transitions, automation triggers
decision-taxonomy.json Decision type classification (production/architecture/scope/security)
quality-gates.json QA gate definitions
approval-levels.json Who approves what
environments.json Environment definitions (dev/staging/prod)
placeholders.json Project variables (owner name, stack, etc.)
refinement-rules.json Rules for task refinement loop

Generated artifacts

  • CLAUDE.md — AI instructions generated from PRD and project config
  • docs/kanban/kanban_dev.html — Interactive HTML kanban board
  • .env.example — Environment template from PRD tech requirements
  • .autopilot/docs/decisions/pending.md — Pending decisions queue
05

Prompts

claudeAutoPilot — Prompt Files (Verbatim Excerpts)

Excerpt 1: SetupProject command (.claude/commands/setupproject.md)

Technique: PRD-validated setup with single smart form and safe file generation

# [SetupProject] - New Project Setup

**Version**: 3.1.0
**Command**: `[SetupProject]` or `/setupproject`
**Trigger**: Run when starting a new project or when [StartDay] detects unconfigured project
**Purpose**: Read PRD to extract project info, validate quality, confirm with user,
generate config files safely
**Executor**: [Codey] (TPM) with [PRODUCT_OWNER]

---

## AUTO-EXECUTION INSTRUCTIONS

**This is a PRD-DRIVEN workflow with validation. Look for PRD first, validate quality,
extract info, confirm in ONE interaction, preview changes, then generate files with backup.**

**Key Improvements in v3.1:**
1. **Single Smart Form** - All questions in one interaction (not 4+ rounds)
2. **PRD Validation** - Quality check before task extraction
3. **Safe File Generation** - Preview changes, automatic backups, preserves customizations

## STEP 0: Check for PRD
**Executor**: [Codey]

### Search for PRD:
```bash
PRD_LOCATIONS=("docs/prd/PRD.md" "docs/prd/*.md" "docs/PRD.md" "PRD.md" "*.prd.md")
for loc in "${PRD_LOCATIONS[@]}"; do
    if compgen -G "$loc" > /dev/null 2>&1; then
        PRD_FILE=$(ls $loc 2>/dev/null | head -1)
        break
    fi
done

If PRD Found:

Use the PRD validator to analyze and extract data:

node .autopilot/automation/prd-validator.js "$PRD_FILE" --setup-data > /tmp/prd-data.json
node .autopilot/automation/prd-validator.js "$PRD_FILE" --score

Then proceed to STEP 1B (Smart Setup Form with PRD data pre-filled).


## Excerpt 2: StartDay command (`.claude/commands/startday.md`)

Technique: **Daily context initialization with service health checks and proactive task suggestion**

```markdown
# [StartDay] - Proactive Session Startup

**Version**: 2.0.0
**Command**: `[StartDay]` or `/startday`
**Trigger**: AUTO-RUNS on every conversation start
**Purpose**: Initialize session, check status, ask about missing setup, start server
**Executor**: [Codey] (TPM) as proactive assistant

---

## AUTO-EXECUTION INSTRUCTIONS

**This workflow runs AUTOMATICALLY when a conversation starts.**
Be proactive. Don't just report - ASK questions and TAKE actions.

## STEP 1: Greet User & Load Context

### Actions:
1. Read `.autopilot/config/placeholders.json`
2. Get product_owner name, project name, tech stack
3. Greet user personally

### Output:

Good [morning/afternoon], [product_owner]!

Project: [project_name] Tech: [primary_language] + [css_framework]


## STEP 2: Check Required Services

### Purpose:
Check if required local services are running. If not, ASK user to start them.

[PostgreSQL, MySQL/MariaDB, MongoDB, Redis health checks with appropriate messaging]

Prompting techniques observed

  1. Named executor pattern — commands assign a named executor persona ("Codey", "Flow") to each step, creating role distinction without requiring separate agent instances
  2. Auto-execution annotation — explicit "AUTO-RUNS on every conversation start" triggers the command as a session initializer
  3. Version header**Version**: 3.1.0 and changelog notes in command header prevent running outdated instructions
  4. PRD-first gate — Step 0 always checks for a PRD before any other setup; prevents framework from running without required input
09

Uniqueness

claudeAutoPilot — Uniqueness and Positioning

Differs from seeds

claudeAutoPilot is closest to agent-os in its Markdown-scaffold + bash-script architecture (no MCP server, no skills beyond commands), but adds a PRD-driven setup phase, an HTML kanban board, a decision taxonomy classification system, and a daily cadence structure (StartDay/EndDay). Unlike taskmaster-ai which uses an MCP server and JSON task files, claudeAutoPilot uses Node.js automation scripts and a generated HTML kanban board. Unlike BMAD-METHOD which coordinates multiple named personas across the full development lifecycle, claudeAutoPilot is single-agent with a simpler 4-stage kanban. The decision-only notification model (routing only 4 decision types to the human) is closer to a "lights-out operations" philosophy than any seed framework explicitly implements.

Positioning

claudeAutoPilot targets developers who want to ship a PRD to production without managing a separate project management tool. The generated HTML kanban board is the project management surface, and Claude Code handles the workflow execution. The framework is explicitly "Claude Code only" — no cross-tool portability.

Observable failure modes

  1. PRD quality bottleneck — if the PRD is thin or vague, task extraction fails and the entire framework has nothing to work from; prd-validator.js adds a quality gate but can't fix a bad PRD
  2. HTML kanban limitation — a static HTML file doesn't support real-time multi-session collaboration; only one Claude session should update the board at a time
  3. Small project maturity — 0 stars, 1 contributor, last push Feb 2026; the framework is early-stage and may have rough edges
  4. No cross-tool portability — entirely Claude Code-specific; can't be used with Gemini CLI, Cursor, etc.
  5. Decision routing reliability — the "only 4 decision types surface to human" promise depends on Claude correctly classifying all decisions; misclassification could lead to autonomous decisions that should have involved the human

Explicit antipatterns

  • Starting without a PRD
  • Running [SetupProject] multiple times without archiving the old kanban
  • Manually editing the kanban HTML (use the automation scripts instead)
04

Workflow

claudeAutoPilot — Workflow

Phases

Phase Command Artifact
Project Setup [SetupProject] CLAUDE.md, kanban board, config files, tasks extracted from PRD
Environment Setup [SetupEnvironment] .env populated, database provisioned
Daily Start [StartDay] Status report, suggested next task
Task Development [TaskStart] → code → [TaskQA] Branch created, code written, security scan, code review
Task Completion [TaskComplete] Card moved to Live
Deploy [DeployLive] Staging verified → production deploy (human approval required)
Daily End [EndDay] Session wrap-up, tomorrow's priorities

4-column Kanban workflow

Backlog → In Progress → QA → Live
   │            │          │      │
   │            │          │      └── Production deployed
   │            │          └── Security scan + code review
   │            └── Active development (branch created)
   └── All tasks from PRD

Approval gates

Gate Type When
PRD quality validation file-review Before [SetupProject] completes
Project config confirmation freetext-clarify Setup form: single smart interaction
Production deploy yes-no [DeployLive] requires human approval
Architecture changes choice-list Autonomous discovery → surfaces to human
Scope changes yes-no Task expansion requires approval
Security issue handling choice-list Medium+ vulnerabilities surface to human

Decision-only notification model

Only these decision types surface to the human:

  1. Production deploy approval
  2. Architecture decisions (tech choices)
  3. Scope changes (task expansion)
  4. Security issues (medium+ severity)

All other workflow transitions are automated.

Daily digest

.autopilot/docs/decisions/pending.md — daily digest shows pending decisions with context and recommended actions.

06

Memory Context

claudeAutoPilot — Memory and Context

State storage

All state is file-based. No database or external service.

State Location
Project config .autopilot/config/placeholders.json (owner name, tech stack, etc.)
Workflow state .autopilot/config/workflow-states.json
Kanban board docs/kanban/kanban_dev.html
Decisions pending .autopilot/docs/decisions/pending.md
Task status Kanban board HTML (task cards with state column)
Environment config .env / .env.example
AI instructions CLAUDE.md (regenerated from PRD)
PRD docs/prd/PRD.md

Persistence

  • Project-scoped — all config files and kanban board are committed to the repo
  • Daily cadence[StartDay] reads placeholders.json to restore context each session

Context injection

CLAUDE.md is generated by [SetupProject] from the PRD. It serves as the primary persistent context for Claude Code between sessions — customized to the specific project's tech stack, coding conventions, and task backlog.

The [StartDay] command auto-runs on conversation start and reinitializes context by reading placeholders.json.

Compaction handling

None built-in. The generated CLAUDE.md provides sufficient project context for a fresh session to pick up from the kanban board state.

PRD as source of truth

The PRD (docs/prd/PRD.md) is the authoritative source for:

  • Task breakdown (extracted by prd-validator.js)
  • Technology stack
  • Environment requirements
  • Quality standards

Changes to scope require PRD updates followed by re-running [SetupProject].

07

Orchestration

claudeAutoPilot — Orchestration

Multi-agent

No — single Claude Code session. The commands define named personas ("Codey", "Flow") for different roles within a single conversation, but there is no multi-agent infrastructure.

Orchestration pattern

Sequential — Backlog → In Progress → QA → Live. Tasks move through stages one at a time with automated transitions except for human-required decision gates.

Execution mode

Continuous-ralph[StartDay] auto-runs on conversation start, [Monitor] runs periodically to check environments and auto-progress tasks. The framework is designed for autonomous continuous execution with interruptions only for classified decisions.

Subagent definition format

None — no subagents.

Isolation mechanism

git-branch[TaskStart] creates a feature branch for each task.

Multi-model

No — single Claude Code session with user's configured model.

Consensus mechanism

None.

Prompt chaining

Yes — PRD → task extraction (prd-validator.js) → CLAUDE.md generation → task execution. The PRD's content flows forward into all subsequent commands via the generated CLAUDE.md.

Git automation

Yes (partial):

  • [TaskStart] creates a branch
  • [TaskQA] runs automated checks
  • [TaskComplete] updates kanban

Commits and PR creation are not fully automated — the commands instruct Claude to commit and push but require Claude Code's standard git operations.

Auto-validators (quality gates)

The [TaskQA] command triggers:

  • Security scan (via ai-code-review.js)
  • Code review (automated)
  • Test execution

All before moving a card to QA column.

08

Ui Cli Surface

claudeAutoPilot — UI and CLI Surface

CLI binary

None — no dedicated CLI. The framework is invoked entirely through Claude Code slash commands ([SetupProject], [StartDay], etc.) within a Claude Code session.

Local UI

HTML Kanban boarddocs/kanban/kanban_dev.html is a generated interactive HTML file:

  • 4 columns (Backlog, In Progress, QA, Live) with color coding
  • Task cards with state transitions
  • Opened in a browser from the local filesystem
  • Updated by kanban-updater.js automation script
  • Not a server-based dashboard — a static HTML file

This is distinct from WORCA's real-time WebSocket dashboard or Aigon's Next.js server.

IDE integration

Claude Code only — commands are in .claude/commands/ format.

Observability

  • Kanban board — visual task state in HTML
  • Daily digest[Digest] generates a daily summary
  • Pending decisions.autopilot/docs/decisions/pending.md lists decisions awaiting human input
  • PRD validator scoreprd-validator.js --score rates PRD quality before setup

Decision routing surface

The framework classifies all decisions into 4 types (production deploy, architecture, scope, security) and routes them to .autopilot/docs/decisions/pending.md. Daily digest surfaces these to the developer.

No server required

Unlike WORCA (Python + Node.js servers), Aigon (localhost:4100), or DocBrain (Docker compose), claudeAutoPilot requires only Claude Code and Node.js with no running server or database.

Related frameworks

same archetype · same primary tool · same memory type

BMAD-METHOD ★ 48k

Provides a full agile delivery lifecycle with named expert-persona AI collaborators that elicit the human's best thinking rather…

Agent OS ★ 4.6k

Extracts implicit codebase conventions into token-efficient markdown standards files and injects them selectively into AI agent…

Claude Conductor ★ 367

Gives Claude Code a persistent, cross-linked, auto-analyzed documentation system so it retains codebase context across sessions.

Spec-Driver (Greenfield Spec-Driven Development) ★ 25

Prevents spec rot in AI-assisted development by making implementation changes flow back into evergreen, authoritative specs via…

Anthropic Knowledge Work Plugins ★ 16k

Role-specialized plugin bundles with live MCP connectors that turn Claude into a domain expert for enterprise knowledge workers.

Codex Integration for Claude Code (skill-codex) ★ 1.3k

Single Claude Code skill that handles Codex CLI invocation correctly (stdin blocking, thinking token suppression, session resume)…