claudeAutoPilot

claudeautopilot · travissutphin/openSource-claudeAutoPilot · ★ 0 · last commit 2026-02-22

PRD-driven autonomous development framework that manages a 4-column Kanban workflow, runs quality gates, and surfaces only classified decisions (deploy/architecture/scope/security) to the human.

Best when90% of workflow decisions can be automated; only production deploys, architecture choices, scope changes, and security issues require human input.

Skip ifStarting without a PRD, Running SetupProject without archiving the old kanban

vs seeds

bmad-method(multi-persona orchestration), it's single-agent with a 4-stage kanban. Claude Code only — no cross-tool portability.

Primitive shape 12 total

Commands 12

Summary

claudeAutoPilot — Summary

claudeAutoPilot is a PRD-driven autonomous development framework for Claude Code that transforms a Product Requirements Document into a managed Kanban workflow with automated quality gates (security scan, code review, test), environment setup, and deployment lifecycle management.

Problem it solves: Claude Code is a powerful assistant but requires the developer to manually manage the development lifecycle; claudeAutoPilot imposes a 4-column kanban (Backlog → In Progress → QA → Live) driven by PRD tasks, automating the transitions between stages and surfacing only the decisions that truly require human input.

Distinctive trait: The kanban board is a generated HTML file (docs/kanban/kanban_dev.html) — no external project management tool required — and the framework's decision loop routes only specific decision types (production deploy, architecture change, scope change, security issue) to the human, handling everything else autonomously.

Target audience: Solo developers and small teams using Claude Code who want to turn a PRD into a shipping application with minimal workflow overhead; the framework installs as a .claude/commands/ + .autopilot/ folder pair dropped into any project.

Production-readiness: v3.1.0 commands; MIT license; maintained by Travis Sutphin with 0 stars and 1 contributor; last pushed February 2026. Small project, early-stage.

Differs from seeds: Closest to agent-os in its Markdown-scaffold + bash-script architecture (no MCP, no skills beyond commands), but claudeAutoPilot adds an automated kanban board generation, PRD-driven task extraction with a Node.js validator, a 4-stage DevOps workflow with explicit QA gates, and daily digest/end-of-day commands. Unlike BMAD-METHOD (multi-persona orchestration), claudeAutoPilot is single-agent with human-mediated decision gates. Unlike taskmaster-ai (task decomposition via MCP), claudeAutoPilot uses bash scripts and simple JSON config files with no external server.

Overview

claudeAutoPilot — Origin and Philosophy

Origin

Created by Travis Sutphin (github.com/travissutphin) as an open-source project under MIT license. Repository: travissutphin/openSource-claudeAutoPilot. The author also maintains rapidPRD.app for AI-assisted PRD creation, suggesting the framework is part of a larger PRD-to-production toolchain.

Philosophy (from README)

"Turn a PRD into a production app with AI handling the process. You make decisions. AI does everything else."

On the decision model:

"Claude handles 90% of workflow automatically. You're only asked when:

Production deploy: 'Task #15 ready for production. Approve?'

Architecture change: 'Should we use Redis or in-memory caching?'

Scope change: 'This feature needs 2 more tasks. Proceed?'

Security issue: 'Medium vulnerability found. Fix now or defer?'"

On the workflow:

"Break down the PRD into tasks, Create and manage a kanban board, Move tasks through a 4-column DevOps workflow, Run tests, security scans, and code reviews automatically, Deploy to staging, verify, and request production approval, Only notify you when decisions are needed."

Design principles

PRD-first — the framework's entry point is always a PRD document; no PRD → no workflow
Kanban as artifact — project state lives in an HTML kanban board committed to the repo
Decision taxonomy — decisions are classified (production, architecture, scope, security) and only the relevant decision types surface to the human
Daily cadence — [StartDay] / [EndDay] commands structure the workday around task execution

Architecture

claudeAutoPilot — Architecture

Distribution

Type: standalone-repo (clone-and-use)
CLI binary: none
Source: https://github.com/travissutphin/openSource-claudeAutoPilot
Version analyzed: v3.1.0 (command version)
License: MIT

Install

git clone https://github.com/travissutphin/openSource-claudeAutoPilot.git my-project
cd my-project
# Place PRD at docs/prd/PRD.md
# Open Claude Code: claude
# Run: [SetupProject]

Required runtime

Claude Code
Node.js 18+ (for automation scripts)
Git

Target AI tools

Claude Code exclusively.

Directory structure

my-project/
├── .autopilot/
│   ├── automation/          # Node.js scripts (10 scripts)
│   │   ├── ai-code-review.js
│   │   ├── evaluate-progression.js
│   │   ├── file-generator.js
│   │   ├── generate-secrets.js
│   │   ├── iterative-refinement.js
│   │   ├── kanban-updater.js
│   │   ├── pattern-analyzer.js
│   │   ├── populate-env.js
│   │   ├── prd-validator.js
│   │   └── setup-database.js
│   ├── config/              # Configuration JSON files
│   │   ├── approval-levels.json
│   │   ├── decision-taxonomy.json
│   │   ├── environments.json
│   │   ├── placeholders.json
│   │   ├── quality-gates.json
│   │   ├── refinement-rules.json
│   │   └── workflow-states.json
│   ├── docs/                # Framework documentation
│   │   ├── SETUP-NEW-PROJECT.md
│   │   └── TASK-LIFECYCLE.md
│   ├── examples/            # Sample PRDs
│   │   └── sample-prd-saas.md
│   └── templates/           # Templates for generated files
├── .claude/
│   └── commands/            # 12 slash commands
├── docs/
│   ├── prd/
│   │   └── PRD.md           # YOUR PRD GOES HERE
│   └── kanban/
│       └── kanban_dev.html  # Generated Kanban board
├── CLAUDE.md                # Generated AI instructions
└── .env.example             # Generated environment template

Workflow states (4-column kanban)

Column	Color	Next states
Backlog	#ddd6fe	in_progress
In Progress	#bfdbfe	qa, backlog
QA	#fed7aa	live, in_progress
Live	#bbf7d0	(done)

Components

claudeAutoPilot — Components

Slash Commands (12)

Command	When to use	What happens
`[SetupProject]`	New project	Read + validate PRD, extract tasks, generate config, CLAUDE.md, kanban
`[SetupEnvironment]`	After setup	Provision database, generate secrets, populate .env
`[StartDay]`	Start of session	Greet user, check services, status report, suggest next task
`[TaskStart]`	Begin a task	Create branch, move card to In Progress
`[TaskQA]`	Code complete	Run security scan, code review, move to QA
`[TaskComplete]`	QA passed	Move to Live, update kanban
`[DeployLive]`	Ready for prod	Deploy to production (requires human approval)
`[Monitor]`	Runs automatically	Check environments, auto-progress tasks
`[Digest]`	Daily or manual	Generate daily summary
`[EndDay]`	End of session	Wrap up, set tomorrow's priorities
`[TaskRefine]`	When task unclear	Clarify task requirements via iterative refinement
`[TaskReview]`	For code review	Trigger AI code review via automation script
`[TaskStage]`	Before deploy	Stage deployment verification

Node.js Automation Scripts (10)

Script	Purpose
`prd-validator.js`	Validate PRD quality + extract setup data as JSON
`kanban-updater.js`	Update kanban HTML board state
`ai-code-review.js`	Run automated AI code review
`evaluate-progression.js`	Evaluate task progression/completion
`file-generator.js`	Generate CLAUDE.md, .env.example from templates
`generate-secrets.js`	Generate secure secrets for .env
`iterative-refinement.js`	Iterative task refinement loop
`pattern-analyzer.js`	Analyze code patterns for consistency
`populate-env.js`	Populate .env from template
`setup-database.js`	Database provisioning

Config files (7)

File	Purpose
`workflow-states.json`	Kanban columns, transitions, automation triggers
`decision-taxonomy.json`	Decision type classification (production/architecture/scope/security)
`quality-gates.json`	QA gate definitions
`approval-levels.json`	Who approves what
`environments.json`	Environment definitions (dev/staging/prod)
`placeholders.json`	Project variables (owner name, stack, etc.)
`refinement-rules.json`	Rules for task refinement loop

Generated artifacts

CLAUDE.md — AI instructions generated from PRD and project config
docs/kanban/kanban_dev.html — Interactive HTML kanban board
.env.example — Environment template from PRD tech requirements
.autopilot/docs/decisions/pending.md — Pending decisions queue

Prompts

claudeAutoPilot — Prompt Files (Verbatim Excerpts)

Excerpt 1: SetupProject command (`.claude/commands/setupproject.md`)

Technique: PRD-validated setup with single smart form and safe file generation

# [SetupProject] - New Project Setup

**Version**: 3.1.0
**Command**: `[SetupProject]` or `/setupproject`
**Trigger**: Run when starting a new project or when [StartDay] detects unconfigured project
**Purpose**: Read PRD to extract project info, validate quality, confirm with user,
generate config files safely
**Executor**: [Codey] (TPM) with [PRODUCT_OWNER]

---

## AUTO-EXECUTION INSTRUCTIONS

**This is a PRD-DRIVEN workflow with validation. Look for PRD first, validate quality,
extract info, confirm in ONE interaction, preview changes, then generate files with backup.**

**Key Improvements in v3.1:**
1. **Single Smart Form** - All questions in one interaction (not 4+ rounds)
2. **PRD Validation** - Quality check before task extraction
3. **Safe File Generation** - Preview changes, automatic backups, preserves customizations

## STEP 0: Check for PRD
**Executor**: [Codey]

### Search for PRD:
```bash
PRD_LOCATIONS=("docs/prd/PRD.md" "docs/prd/*.md" "docs/PRD.md" "PRD.md" "*.prd.md")
for loc in "${PRD_LOCATIONS[@]}"; do
    if compgen -G "$loc" > /dev/null 2>&1; then
        PRD_FILE=$(ls $loc 2>/dev/null | head -1)
        break
    fi
done

If PRD Found:

Use the PRD validator to analyze and extract data:

node .autopilot/automation/prd-validator.js "$PRD_FILE" --setup-data > /tmp/prd-data.json
node .autopilot/automation/prd-validator.js "$PRD_FILE" --score

Then proceed to STEP 1B (Smart Setup Form with PRD data pre-filled).


## Excerpt 2: StartDay command (`.claude/commands/startday.md`)

Technique: **Daily context initialization with service health checks and proactive task suggestion**

```markdown
# [StartDay] - Proactive Session Startup

**Version**: 2.0.0
**Command**: `[StartDay]` or `/startday`
**Trigger**: AUTO-RUNS on every conversation start
**Purpose**: Initialize session, check status, ask about missing setup, start server
**Executor**: [Codey] (TPM) as proactive assistant

---

## AUTO-EXECUTION INSTRUCTIONS

**This workflow runs AUTOMATICALLY when a conversation starts.**
Be proactive. Don't just report - ASK questions and TAKE actions.

## STEP 1: Greet User & Load Context

### Actions:
1. Read `.autopilot/config/placeholders.json`
2. Get product_owner name, project name, tech stack
3. Greet user personally

### Output:

Good [morning/afternoon], [product_owner]!

Project: [project_name] Tech: [primary_language] + [css_framework]


## STEP 2: Check Required Services

### Purpose:
Check if required local services are running. If not, ASK user to start them.

[PostgreSQL, MySQL/MariaDB, MongoDB, Redis health checks with appropriate messaging]

Prompting techniques observed

Named executor pattern — commands assign a named executor persona ("Codey", "Flow") to each step, creating role distinction without requiring separate agent instances
Auto-execution annotation — explicit "AUTO-RUNS on every conversation start" triggers the command as a session initializer
Version header — **Version**: 3.1.0 and changelog notes in command header prevent running outdated instructions
PRD-first gate — Step 0 always checks for a PRD before any other setup; prevents framework from running without required input

Uniqueness

claudeAutoPilot — Uniqueness and Positioning

Differs from seeds

claudeAutoPilot is closest to agent-os in its Markdown-scaffold + bash-script architecture (no MCP server, no skills beyond commands), but adds a PRD-driven setup phase, an HTML kanban board, a decision taxonomy classification system, and a daily cadence structure (StartDay/EndDay). Unlike taskmaster-ai which uses an MCP server and JSON task files, claudeAutoPilot uses Node.js automation scripts and a generated HTML kanban board. Unlike BMAD-METHOD which coordinates multiple named personas across the full development lifecycle, claudeAutoPilot is single-agent with a simpler 4-stage kanban. The decision-only notification model (routing only 4 decision types to the human) is closer to a "lights-out operations" philosophy than any seed framework explicitly implements.

Positioning

claudeAutoPilot targets developers who want to ship a PRD to production without managing a separate project management tool. The generated HTML kanban board is the project management surface, and Claude Code handles the workflow execution. The framework is explicitly "Claude Code only" — no cross-tool portability.

Observable failure modes

PRD quality bottleneck — if the PRD is thin or vague, task extraction fails and the entire framework has nothing to work from; prd-validator.js adds a quality gate but can't fix a bad PRD
HTML kanban limitation — a static HTML file doesn't support real-time multi-session collaboration; only one Claude session should update the board at a time
Small project maturity — 0 stars, 1 contributor, last push Feb 2026; the framework is early-stage and may have rough edges
No cross-tool portability — entirely Claude Code-specific; can't be used with Gemini CLI, Cursor, etc.
Decision routing reliability — the "only 4 decision types surface to human" promise depends on Claude correctly classifying all decisions; misclassification could lead to autonomous decisions that should have involved the human

Explicit antipatterns

Starting without a PRD
Running [SetupProject] multiple times without archiving the old kanban
Manually editing the kanban HTML (use the automation scripts instead)

Workflow

claudeAutoPilot — Workflow

Phases

Phase	Command	Artifact
Project Setup	`[SetupProject]`	CLAUDE.md, kanban board, config files, tasks extracted from PRD
Environment Setup	`[SetupEnvironment]`	.env populated, database provisioned
Daily Start	`[StartDay]`	Status report, suggested next task
Task Development	`[TaskStart]` → code → `[TaskQA]`	Branch created, code written, security scan, code review
Task Completion	`[TaskComplete]`	Card moved to Live
Deploy	`[DeployLive]`	Staging verified → production deploy (human approval required)
Daily End	`[EndDay]`	Session wrap-up, tomorrow's priorities

4-column Kanban workflow

Backlog → In Progress → QA → Live
   │            │          │      │
   │            │          │      └── Production deployed
   │            │          └── Security scan + code review
   │            └── Active development (branch created)
   └── All tasks from PRD

Approval gates

Gate	Type	When
PRD quality validation	file-review	Before `[SetupProject]` completes
Project config confirmation	freetext-clarify	Setup form: single smart interaction
Production deploy	yes-no	`[DeployLive]` requires human approval
Architecture changes	choice-list	Autonomous discovery → surfaces to human
Scope changes	yes-no	Task expansion requires approval
Security issue handling	choice-list	Medium+ vulnerabilities surface to human

Decision-only notification model

Only these decision types surface to the human:

Production deploy approval
Architecture decisions (tech choices)
Scope changes (task expansion)
Security issues (medium+ severity)

All other workflow transitions are automated.

Daily digest

.autopilot/docs/decisions/pending.md — daily digest shows pending decisions with context and recommended actions.

Memory Context

claudeAutoPilot — Memory and Context

State storage

All state is file-based. No database or external service.

State	Location
Project config	`.autopilot/config/placeholders.json` (owner name, tech stack, etc.)
Workflow state	`.autopilot/config/workflow-states.json`
Kanban board	`docs/kanban/kanban_dev.html`
Decisions pending	`.autopilot/docs/decisions/pending.md`
Task status	Kanban board HTML (task cards with state column)
Environment config	`.env` / `.env.example`
AI instructions	`CLAUDE.md` (regenerated from PRD)
PRD	`docs/prd/PRD.md`

Persistence

Project-scoped — all config files and kanban board are committed to the repo
Daily cadence — [StartDay] reads placeholders.json to restore context each session

Context injection

CLAUDE.md is generated by [SetupProject] from the PRD. It serves as the primary persistent context for Claude Code between sessions — customized to the specific project's tech stack, coding conventions, and task backlog.

The [StartDay] command auto-runs on conversation start and reinitializes context by reading placeholders.json.

Compaction handling

None built-in. The generated CLAUDE.md provides sufficient project context for a fresh session to pick up from the kanban board state.

PRD as source of truth

The PRD (docs/prd/PRD.md) is the authoritative source for:

Task breakdown (extracted by prd-validator.js)
Technology stack
Environment requirements
Quality standards

Changes to scope require PRD updates followed by re-running [SetupProject].

Orchestration

claudeAutoPilot — Orchestration

Multi-agent

No — single Claude Code session. The commands define named personas ("Codey", "Flow") for different roles within a single conversation, but there is no multi-agent infrastructure.

Orchestration pattern

Sequential — Backlog → In Progress → QA → Live. Tasks move through stages one at a time with automated transitions except for human-required decision gates.

Execution mode

Continuous-ralph — [StartDay] auto-runs on conversation start, [Monitor] runs periodically to check environments and auto-progress tasks. The framework is designed for autonomous continuous execution with interruptions only for classified decisions.

Subagent definition format

None — no subagents.

Isolation mechanism

git-branch — [TaskStart] creates a feature branch for each task.

Multi-model

No — single Claude Code session with user's configured model.

Consensus mechanism

None.

Prompt chaining

Yes — PRD → task extraction (prd-validator.js) → CLAUDE.md generation → task execution. The PRD's content flows forward into all subsequent commands via the generated CLAUDE.md.

Git automation

Yes (partial):

[TaskStart] creates a branch
[TaskQA] runs automated checks
[TaskComplete] updates kanban

Commits and PR creation are not fully automated — the commands instruct Claude to commit and push but require Claude Code's standard git operations.

Auto-validators (quality gates)

The [TaskQA] command triggers:

Security scan (via ai-code-review.js)
Code review (automated)
Test execution

All before moving a card to QA column.

Ui Cli Surface

claudeAutoPilot — UI and CLI Surface

CLI binary

None — no dedicated CLI. The framework is invoked entirely through Claude Code slash commands ([SetupProject], [StartDay], etc.) within a Claude Code session.

Local UI

HTML Kanban board — docs/kanban/kanban_dev.html is a generated interactive HTML file:

4 columns (Backlog, In Progress, QA, Live) with color coding
Task cards with state transitions
Opened in a browser from the local filesystem
Updated by kanban-updater.js automation script
Not a server-based dashboard — a static HTML file

This is distinct from WORCA's real-time WebSocket dashboard or Aigon's Next.js server.

IDE integration

Claude Code only — commands are in .claude/commands/ format.

Observability

Kanban board — visual task state in HTML
Daily digest — [Digest] generates a daily summary
Pending decisions — .autopilot/docs/decisions/pending.md lists decisions awaiting human input
PRD validator score — prd-validator.js --score rates PRD quality before setup

Decision routing surface

The framework classifies all decisions into 4 types (production deploy, architecture, scope, security) and routes them to .autopilot/docs/decisions/pending.md. Daily digest surfaces these to the developer.

No server required

Unlike WORCA (Python + Node.js servers), Aigon (localhost:4100), or DocBrain (Docker compose), claudeAutoPilot requires only Claude Code and Node.js with no running server or database.

Related frameworks

same archetype · same primary tool · same memory type

BMAD-METHOD ★ 48k

A4 Markdown scaffold

Provides a full agile delivery lifecycle with named expert-persona AI collaborators that elicit the human's best thinking rather…

Agent OS ★ 4.6k

A4 Markdown scaffold

Extracts implicit codebase conventions into token-efficient markdown standards files and injects them selectively into AI agent…

Claude Conductor ★ 367

A4 Markdown scaffold

Gives Claude Code a persistent, cross-linked, auto-analyzed documentation system so it retains codebase context across sessions.

Spec-Driver (Greenfield Spec-Driven Development) ★ 25

A4 Markdown scaffold

Prevents spec rot in AI-assisted development by making implementation changes flow back into evergreen, authoritative specs via…

Anthropic Knowledge Work Plugins ★ 16k

A4 Markdown scaffold

Role-specialized plugin bundles with live MCP connectors that turn Claude into a domain expert for enterprise knowledge workers.

Codex Integration for Claude Code (skill-codex) ★ 1.3k

A4 Markdown scaffold

Single Claude Code skill that handles Codex CLI invocation correctly (stdin blocking, thinking token suppression, session resume)…

Distribution

Type: standalone-repo
License: MIT
Install: clone-and-configure
Version: v3.1.0 (command version)

Surfaces

CLI binary: No
CLI subcmds: 0
Local UI: other
Tech stack: Static HTML (generated file — docs/kanban/kanban_dev.html)

Components

Commands: 12
Skills: 0
Subagents: 0
Hooks: 0
MCP servers: 0
MCP tools: 0
Scripts: 10
Templates: 3

Workflow

Phases: 8
Approval gates: 4
Spec format: markdown
Spec storage: flat-files
Delta or full: whole-file

Orchestration

Multi-agent: No
Pattern: sequential
Max concurrent: 1
Isolation: git-branch
Consensus: none
Prompt chaining: Yes

Multi-model

Multi-model: No
BYOK: Yes
Modal: text

Execution

Mode: continuous-ralph
Crash recovery: Yes
Compaction: No
Session handoff: Yes
Streaming: No

Memory

Type: file-based
Persistence: project
Search: none
State files: 5 files

Quality

TDD: Optional
TDD mechanism: dedicated-skill
Validators: 2
Self-review: inline-self

Git / Observability

Auto commit: Yes
Auto PR: No
Auto merge: No
Worktree/feat: No
Audit log: Yes
Audit format: structured-md
Replay: No

Tools

Primary: claude-code
Targets: 1
Portability: single-tool

Signals

Stars: 0
Last commit: 2026-02-22
Contributors: 1
Maintainer: active
Quality score: 4.9/10

Summary

claudeAutoPilot — Summary

Overview

claudeAutoPilot — Origin and Philosophy

Origin

Philosophy (from README)

Design principles

Architecture

claudeAutoPilot — Architecture

Distribution

Install

Required runtime

Target AI tools

Directory structure

Workflow states (4-column kanban)

Components

claudeAutoPilot — Components

Slash Commands (12)

Node.js Automation Scripts (10)

Config files (7)

Generated artifacts

Prompts

claudeAutoPilot — Prompt Files (Verbatim Excerpts)

Excerpt 1: SetupProject command (.claude/commands/setupproject.md)

If PRD Found:

Prompting techniques observed

Uniqueness

claudeAutoPilot — Uniqueness and Positioning

Differs from seeds

Positioning

Observable failure modes

Explicit antipatterns

Workflow

claudeAutoPilot — Workflow

Phases

4-column Kanban workflow

Approval gates

Decision-only notification model

Daily digest

Memory Context

claudeAutoPilot — Memory and Context

State storage

Persistence

Context injection

Compaction handling

PRD as source of truth

Orchestration

claudeAutoPilot — Orchestration

Multi-agent

Orchestration pattern

Execution mode

Subagent definition format

Isolation mechanism

Multi-model

Consensus mechanism

Prompt chaining

Git automation

Auto-validators (quality gates)

Ui Cli Surface

claudeAutoPilot — UI and CLI Surface

CLI binary

Local UI

IDE integration

Observability

Decision routing surface

No server required

Related frameworks

Excerpt 1: SetupProject command (`.claude/commands/setupproject.md`)