SpecCrew

speccrew · charlesmu99/speccrew · ★ 3 · last commit 2026-04-22

Apply ISA-95 industrial methodology to AI-assisted software delivery — 8 specialized agents orchestrate PRD through deployment with a 3-tier knowledge base and per-requirement iteration isolation.

Best whenEach software requirement deserves its own isolated project scope (iterations/iXXX/) and every phase transition deserves a human gate — speed comes from know…

Skip ifMonolithic AI sessions that span PRD through testing, Shared context across unrelated requirements

vs seeds

spec-kit(outputs specs for human review), SpecCrew executes them. Unlike openspec (API-focused), SpecCrew treats API contracts a…

Primitive shape 79 total

Skills 71 Subagents 8

Summary

speccrew — Summary

Slug: speccrew GitHub: https://github.com/charlesmu99/speccrew Stars: 3 License: Apache-2.0 Version: v0.7.76 Language: JavaScript

One-Liner

npm-installable multi-agent spec factory — 8 specialized agents orchestrate PRD → feature design → API contract → system design → implementation → deployment → test with ISA-95 methodology and a 3-tier knowledge base.

Archetype

Skills-based multi-agent orchestrator with CLI dispatcher. Closest to trw-mcp in the current batch (multi-agent pipeline with human-confirmation gates) but ships as an npm package with no MCP layer and uses ISA-95 industrial modeling as its intellectual framework instead of value-oriented hooks.

Key Facts

Install: npm install -g speccrew — speccrew CLI binary from bin/cli.js
8 agents: team-leader, product-manager, feature-designer, system-designer, system-developer, task-worker, test-manager, system-deployer
71 skills prefixed speccrew- covering planning, analysis, coding, testing, and knowledge management
ISA-95 6-stage enterprise integration modeling as architectural methodology
3-tier knowledge base: L1 technology docs (knowledge/techs/), L2 business domain docs (knowledge/bizs/), L3 iteration artifacts (iterations/iXXX/)
Iterations pattern: each requirement becomes an isolated project under iterations/iXXX-[name]/
4-stage knowledge pipeline: scan → parallel analysis → parallel summarize → aggregate
Gates: all phase transitions require human confirmation
Multi-language README: 10 languages (Chinese primary + EN, JA, RU, ES, DE, FR, PT-BR, AR, HI)
Sub-agent auto-dispatch for context management; agents are dynamically created per tech stack

Overview

speccrew — Overview

What It Is

SpecCrew is an npm-installable multi-agent AI development orchestration system. Teams install it globally (npm install -g speccrew), initialize a project, then run the speccrew CLI to drive a complete software delivery pipeline from PRD authoring through deployment and testing.

The central thesis: treat software specification and delivery as an industrial process modeled on ISA-95 (the enterprise integration standard), where each "requirement" is an independent project iteration with its own isolated artifact directory.

Problem It Solves

Traditional AI-assisted development has agents that span too many concerns in a single context window, producing inconsistent output quality across phases. SpecCrew solves this by assigning hard-bounded specialists (product manager knows PRD, system designer knows API contracts, etc.) and enforcing phase gates with human confirmation before each handoff.

Target User

Development teams wanting to go from idea → deployed software with AI doing the heavy lifting at each phase
Teams with polyglot tech stacks (the dynamic agent creation per tech stack supports this)
International teams: 10 README languages with Chinese as the primary, suggesting a strong Chinese developer community target

Distribution

npm package: speccrew (globally installed)
Repository: https://github.com/charlesmu99/speccrew
Version: v0.7.76 (frequent minor releases)
License: Apache-2.0

Positioning vs. Spec-Driven Seeds

Unlike spec-kit (which outputs spec documents for human review), SpecCrew executes the specs through agents automatically
Unlike openspec (OpenAPI-focused), SpecCrew generates API contracts as one phase of a broader pipeline
Unlike spec-driver (which focuses on test-driven spec enforcement), SpecCrew treats testing as the last phase after deployment
Closest analog in the seed set is a combination of taskmaster-ai (task breakdown) + bmad-method (role-based agent team) but with stronger industrial methodology grounding

Architecture

speccrew — Architecture

High-Level Structure

speccrew/
├── bin/cli.js                    # CLI entry point
├── .speccrew/
│   ├── agents/                   # 8 agent definitions
│   │   ├── speccrew-team-leader.md
│   │   ├── speccrew-product-manager.md
│   │   ├── speccrew-feature-designer.md
│   │   ├── speccrew-system-designer.md
│   │   ├── speccrew-system-developer.md
│   │   ├── speccrew-task-worker.md
│   │   ├── speccrew-test-manager.md
│   │   └── speccrew-system-deployer.md
│   └── skills/                   # 71 skills (speccrew-* prefix)
├── knowledge/
│   ├── techs/                    # L1: Technology documentation
│   └── bizs/                    # L2: Business domain documentation
└── iterations/
    └── iXXX-[name]/             # L3: Per-requirement isolated artifacts

Knowledge Tier Architecture

3-tier knowledge base:

Tier	Location	Content	Scope
L1	`knowledge/techs/`	Technology docs, framework references	Global / reused
L2	`knowledge/bizs/`	Business domain, glossary, context	Global / reused
L3	`iterations/iXXX/`	Iteration-specific artifacts, PRD, designs	Per-requirement

Knowledge Pipeline

4-stage pipeline for assembling context before each agent phase:

Scan — discover relevant L1/L2/L3 files
Parallel analysis — agents analyze files concurrently
Parallel summarize — distill key facts per file
Aggregate — combine into a unified context document

ISA-95 Methodology

SpecCrew uses ISA-95 (enterprise integration standard) as its modeling framework:

6-stage system modeling decomposition
Clear separation between functional specifications and physical implementations
Equipment hierarchy modeling applied to software component hierarchy

Agent Dispatch Model

Team Leader coordinates phase transitions and dispatches to specialists
Sub-agent auto-dispatch for context management: when context grows too large, the team leader spawns a sub-agent to handle the overflow
Agents are dynamically created per tech stack — the system-designer agent generates stack-specific variants
All phase transitions require human confirmation before proceeding

Execution Environment

Runs inside Claude Code (.speccrew/agents/ and .speccrew/skills/ loaded by CC)
No MCP server — agents communicate through filesystem artifacts
iterations/iXXX-[name]/ acts as the state store per requirement

Components

speccrew — Components

CLI Binary

Command: speccrew Entry point: bin/cli.js Distribution: npm global install

Agents (8 total)

Agent	Role	Primary Skills
`speccrew-team-leader`	Orchestrates phases, dispatches agents, manages gates	speccrew-knowledge-*
`speccrew-product-manager`	PRD authoring, requirement decomposition	speccrew-pm-*
`speccrew-feature-designer`	Feature design, UX/UI specs	speccrew-fd-*
`speccrew-system-designer`	API contracts, system architecture	speccrew-sd-*
`speccrew-system-developer`	Code implementation	speccrew-dev-*
`speccrew-task-worker`	Granular task execution (sub-tasks from developer)	speccrew-dev-*
`speccrew-test-manager`	Test planning and execution	speccrew-test-*
`speccrew-system-deployer`	Deployment configuration and execution	(deploy skills)

Skills (71 total)

Skill naming convention: speccrew-{domain}-{action}

Domain prefixes:

speccrew-pm-* — product management (requirement analysis, PRD structuring)
speccrew-sd-* — system design (API contracts, data models)
speccrew-dev-* — development (implementation, code review)
speccrew-test-* — testing (test planning, test case generation)
speccrew-knowledge-* — knowledge pipeline (scan, analyze, summarize, aggregate)
Additional domain-specific skills for feature design and deployment

Knowledge Base

L1 — Technology docs (knowledge/techs/):

Framework references, library documentation
Tech stack decision records

L2 — Business domain (knowledge/bizs/):

Domain glossary
Business rules
Stakeholder context

L3 — Iteration artifacts (iterations/iXXX-[name]/):

PRD.md (product requirements)
feature-design.md
api-contract.yaml
system-design.md
implementation artifacts
deployment config
test results

Configuration Files

speccrew.config.js / speccrew.json — project configuration
.speccrew/agents/ — agent markdown definitions
.speccrew/skills/ — skill markdown definitions

Output Artifacts Per Iteration

Each iterations/iXXX-[name]/ directory contains the complete lineage from PRD through test results for one requirement.

Prompts

speccrew — Prompts & Instructions

Agent Definition Format

Agent definitions are markdown files in .speccrew/agents/. Each agent file declares:

Role and responsibility scope
Permitted skills (the skill IDs the agent may invoke)
Output format requirements (the artifact structure the agent must produce)
Handoff protocol (what to confirm before marking phase complete)

Team Leader Dispatch Pattern

The speccrew-team-leader agent orchestrates the pipeline. Its definition includes:

Phase sequence and gate conditions
Agent dispatch rules per phase
Human confirmation request phrasing ("Please confirm to proceed to [next phase]")
Sub-agent auto-dispatch rules for context overflow management

Skill Format

Skills are markdown files in .speccrew/skills/. Each skill defines:

Input requirements (what context must be present)
Step-by-step instructions for completing the skill
Output format specification
Quality criteria the output must meet

ISA-95 Prompt Vocabulary

SpecCrew embeds ISA-95 terminology into agent prompts:

"Functional specifications" vs "physical implementations" as distinct layers
Equipment hierarchy (applied to software: system → subsystem → component → module)
Level boundaries: business planning, operations management, operations control, physical processes

Knowledge Aggregation Prompt Pattern

The knowledge pipeline produces a structured context document fed to each phase agent:

## Technology Context
[aggregated from L1 knowledge/techs/]

## Business Context  
[aggregated from L2 knowledge/bizs/]

## Iteration Context
[aggregated from L3 iterations/iXXX/]

Human Gate Prompt

Before each phase transition, the team leader presents:

Summary of what was completed in the current phase
Key decisions made and artifacts produced
What the next phase will do
Explicit confirmation request: "Type 'yes' to proceed to [Phase Name]"

Multi-Language Support

README and potentially agent prompts are available in 10 languages. The Chinese-primary structure suggests the agent prompt files may contain bilingual annotations.

Uniqueness

speccrew — Uniqueness

Differs from Seeds

SpecCrew is closest to a combination of bmad-method (role-based agent team with a product manager, architect, developer) and taskmaster-ai (task breakdown and tracking), but distinguished by:

ISA-95 industrial methodology — no other framework in the seeds or batch uses an enterprise integration standard as its intellectual backbone. This brings rigorous tier separation, level boundaries, and functional/physical distinction to AI-assisted development.
3-tier knowledge base with 4-stage pipeline — the scan→parallel analysis→parallel summarize→aggregate pattern for context assembly is more sophisticated than any seed's context management. Unlike agent-os (flat markdown rules), SpecCrew's knowledge architecture separates technology context (L1), business context (L2), and iteration context (L3) with deliberate staleness management.
71 skills with strict domain prefixing — speccrew-pm-*, speccrew-sd-*, speccrew-dev-*, speccrew-test-*, speccrew-knowledge-* — more skills than any seed, organized by agent role rather than by action type.
"Each requirement as independent project" pattern (iterations/iXXX/) — unlike taskmaster-ai's flat task list or bmad's single-project focus, SpecCrew treats every requirement as a completely isolated artifact tree. This enables parallel requirement development and clean audit trails.
10-language README (Chinese-primary) — only multilingual framework in the seed set or this batch, indicating intentional non-English-first design.

Compared to spec-kit (outputs spec docs for human review): SpecCrew executes the specs through agents. Compared to openspec (API-focused): SpecCrew treats API contracts as one phase of a 7-phase pipeline. Compared to spec-driver (test-driven spec enforcement): SpecCrew tests last, not first.

Positioning

Target user: Teams wanting full-pipeline automation from idea to tested deployment, with human gates at each phase
Key differentiator: ISA-95 methodology + 3-tier knowledge architecture + per-requirement iteration isolation
Honest gap: 3 GitHub stars and v0.7.76 suggest active development but pre-adoption; the industrial methodology may have a steep learning curve for typical web dev teams

Observable Failure Modes

7 human gates — the "all transitions require confirmation" design maximizes safety but maximizes interruption; teams that want hands-off automation will be frustrated
No test-first — testing is phase 7, not phase 1; bugs introduced in implementation aren't caught until the end
Context overflow reliance — sub-agent auto-dispatch for context overflow is a runtime hedge, not a structured solution; behavior may be inconsistent
ISA-95 mismatch — industrial modeling concepts (equipment hierarchy, level boundaries) may confuse teams unfamiliar with the standard
CLI underdocumented — bin/cli.js exists but subcommand documentation is sparse

Workflow

speccrew — Workflow

Phase Pipeline

PRD → Feature Design → API Contract → System Design → Dev → Deployment → Test
       [GATE]          [GATE]          [GATE]          [GATE]  [GATE]       [GATE]

Every phase transition requires explicit human confirmation before the next agent is dispatched.

Phase Details

1. PRD (Product Manager)

Input: user's high-level requirement / idea
Agent: speccrew-product-manager
Output: iterations/iXXX-[name]/PRD.md
Activities: requirement analysis, user story decomposition, acceptance criteria definition
Gate: human reviews PRD, confirms completeness

2. Feature Design (Feature Designer)

Input: PRD.md
Agent: speccrew-feature-designer
Output: iterations/iXXX-[name]/feature-design.md
Activities: UX/UI specification, component inventory, interaction flows
Gate: human confirms design aligns with PRD intent

3. API Contract (System Designer)

Input: PRD.md + feature-design.md
Agent: speccrew-system-designer
Output: iterations/iXXX-[name]/api-contract.yaml
Activities: API endpoint definition, data model specification, integration contracts
Gate: human confirms API contract is complete

4. System Design (System Designer)

Input: all previous artifacts
Agent: speccrew-system-designer
Output: iterations/iXXX-[name]/system-design.md
Activities: component architecture, database schema, deployment topology
Gate: human confirms system design before coding begins

5. Development (System Developer + Task Workers)

Input: all previous artifacts
Agents: speccrew-system-developer dispatches speccrew-task-worker for sub-tasks
Output: implementation code, iterations/iXXX-[name]/implementation/
Activities: code generation, unit test stubs, code review
Gate: human reviews and approves implementation

6. Deployment (System Deployer)

Input: implementation + system design
Agent: speccrew-system-deployer
Output: deployment configuration, CI/CD manifests
Gate: human confirms deployment config before test

7. Testing (Test Manager)

Input: all artifacts + deployed system
Agent: speccrew-test-manager
Output: test plan, test results in iterations/iXXX-[name]/test-results/
Activities: test planning, test case execution, defect reporting

Iteration Model

Each requirement is treated as an independent project
Iteration IDs: iXXX (e.g., i001, i002)
Full artifact lineage preserved per iteration
No shared state between iterations except L1/L2 knowledge

Knowledge Pipeline Integration

Before each phase, the knowledge pipeline runs:

Scan L1/L2/L3 for relevant context
Parallel analysis of discovered files
Parallel summarization
Aggregation into context document fed to the phase agent

Memory Context

speccrew — Memory & Context

Memory Model

SpecCrew uses the filesystem as its primary memory system. All state is written to iterations/iXXX-[name]/ — no in-process memory, no database, no external persistence layer beyond the project directory.

Context Architecture

3-Tier Knowledge Base

Tier	Path	Persistence	Scope
L1	`knowledge/techs/`	Permanent (team-maintained)	Cross-iteration
L2	`knowledge/bizs/`	Permanent (team-maintained)	Cross-iteration
L3	`iterations/iXXX/`	Per-iteration	Single requirement

Knowledge Pipeline (4 stages)

Context is assembled fresh before each phase:

Scan — find relevant files across all 3 tiers
Parallel analysis — multiple agents analyze files concurrently
Parallel summarize — distill each file to key facts
Aggregate — combine into unified context document

This is SpecCrew's primary mechanism for managing context window limits — rather than dumping all files into context, it summarizes and aggregates.

Iteration Artifacts as State

Each iterations/iXXX-[name]/ directory is the complete state for one requirement:

PRD.md — source of truth for requirement
feature-design.md — UX/UI decisions
api-contract.yaml — interface contracts
system-design.md — architecture decisions
implementation/ — code artifacts
deployment config
test-results/ — test outcomes

These files serve as both the AI's working memory and the human team's audit trail.

Sub-Agent Context Management

When any phase agent's context grows too large, the team leader auto-dispatches a sub-agent to handle the overflow. This is SpecCrew's runtime context compaction mechanism — active delegation rather than summarization.

Cross-Iteration Reuse

L1/L2 knowledge accumulates across iterations. A technology decision made in iteration i001 is captured in knowledge/techs/ and available to all subsequent iterations. This creates a project-level memory that grows over time.

No Automated Memory

No session state, no vector store, no learning capture. The system relies entirely on structured file artifacts written by agents and read through the knowledge pipeline.

Orchestration

speccrew — Orchestration

Multi-Agent: Yes

SpecCrew is a multi-agent system with 8 specialized agents operating in a sequential pipeline with parallel knowledge aggregation steps.

Orchestration Pattern: Hierarchical + Sequential

Team Leader is the orchestrator — it coordinates phase transitions and dispatches specialist agents
Phases are sequential — each phase must complete and be human-confirmed before the next starts
Within phases, parallelism is used for the knowledge pipeline (analysis + summarization steps run in parallel)
Sub-agent auto-dispatch handles context overflow at runtime

Phase Gate Mechanism

Every phase transition is a human confirmation gate:

Team Leader → completes phase → presents summary → requests confirmation → dispatches next agent

No automated phase progression — human is always in the loop between phases.

Agent Lifecycle

Agents are dynamically created per tech stack: the system-designer and system-developer agents have stack-specific variants generated for the project's technology choices. This means the effective agent count exceeds 8 for complex polyglot projects.

Execution Mode: Multi-shot (iterative per phase)

Each phase involves multiple LLM calls — the agent may ask clarifying questions, refine its output, and iterate before marking the phase complete.

Max Concurrent Agents

Within the knowledge pipeline: up to N parallel analysis + summarization agents (bounded by file count). During phase execution: 1 primary agent + N sub-agents for context overflow.

Context Management Strategy

Proactive: knowledge pipeline summarizes before feeding to agents
Reactive: sub-agent auto-dispatch when context grows during execution
Isolation: each iteration (iXXX/) is a clean scope — no cross-iteration context bleed

Consensus Mechanism

None in the traditional sense — the Team Leader is authoritative. Human confirmation is the only external input between phases.

Crash Recovery

No automated crash recovery. If a phase fails mid-execution, the iteration directory contains whatever artifacts were written before failure. Teams can restart from the last confirmed phase.

Target Runtime

Claude Code (.speccrew/agents/ and .speccrew/skills/ are Claude Code agent/skill directories).

Ui Cli Surface

speccrew — UI / CLI Surface

CLI Binary

Binary: speccrew Entry point: bin/cli.js Install: npm install -g speccrew

Subcommands are not documented in detail in the repository, but the CLI drives:

Project initialization
Iteration creation (new iXXX-[name]/ directory)
Phase progression commands
Knowledge base management

Local UI

None. All interaction is through the CLI and the Claude Code agent interface.

Primary Interface Pattern

The main user experience is:

Run speccrew CLI to initialize or manage iterations
Claude Code picks up .speccrew/agents/ and .speccrew/skills/
Team Leader agent guides the user through phases in the CC interface
Human provides confirmations at each gate via the CC chat interface

Multi-Language Interface

README is available in 10 languages:

Chinese (primary/default)
English
Japanese
Russian
Spanish
German
French
Portuguese (BR)
Arabic
Hindi

This is the most multilingual framework in the batch — suggesting intentional global market targeting with Chinese developer community as primary audience.

No Web Dashboard

No local web server, no browser UI, no port binding. Unlike vibe-architect (Next.js, port 3000) or vibedoc (Gradio, port 7860) in this batch, SpecCrew is purely terminal + CC.

Related frameworks

same archetype · same primary tool · same memory type

BMAD-METHOD ★ 48k

A4 Markdown scaffold

Provides a full agile delivery lifecycle with named expert-persona AI collaborators that elicit the human's best thinking rather…

Agent OS ★ 4.6k

A4 Markdown scaffold

Extracts implicit codebase conventions into token-efficient markdown standards files and injects them selectively into AI agent…

Claude Conductor ★ 367

A4 Markdown scaffold

Gives Claude Code a persistent, cross-linked, auto-analyzed documentation system so it retains codebase context across sessions.

Spec-Driver (Greenfield Spec-Driven Development) ★ 25

A4 Markdown scaffold

Prevents spec rot in AI-assisted development by making implementation changes flow back into evergreen, authoritative specs via…

Anthropic Knowledge Work Plugins ★ 16k

A4 Markdown scaffold

Role-specialized plugin bundles with live MCP connectors that turn Claude into a domain expert for enterprise knowledge workers.

Codex Integration for Claude Code (skill-codex) ★ 1.3k

A4 Markdown scaffold

Single Claude Code skill that handles Codex CLI invocation correctly (stdin blocking, thinking token suppression, session resume)…

Distribution

Type: npm-package
License: Apache-2.0
Install: npm-global
Version: v0.7.76

Surfaces

CLI binary: speccrew
CLI subcmds: 5
Local UI: No
Tech stack: none

Components

Commands: 0
Skills: 71
Subagents: 8
Hooks: 0
MCP servers: 0
MCP tools: 0
Scripts: 1
Templates: 7

Workflow

Phases: 7
Approval gates: 7
Spec format: markdown
Spec storage: directory-per-iteration
Delta or full: whole-file

Orchestration

Multi-agent: Yes
Pattern: hierarchical-sequential
Max concurrent: 8
Isolation: directory-per-iteration
Consensus: none
Prompt chaining: Yes

Multi-model

Multi-model: No
BYOK: Yes
Modal: text

Execution

Mode: multi-shot
Crash recovery: No
Compaction: Yes
Session handoff: Yes
Streaming: No

Memory

Type: filesystem
Persistence: per-iteration-directory
Search: knowledge-pipeline-scan
State files: 3 files

Quality

TDD: No
TDD mechanism: none
Self-review: human-required

Git / Observability

Auto commit: No
Auto PR: No
Auto merge: No
Worktree/feat: No
Audit log: Yes
Audit format: iteration-directory-artifacts
Replay: Yes

Tools

Primary: claude-code
Targets: 1
Portability: low

Signals

Stars: 3
Last commit: 2026-04-22
Contributors: 1
Maintainer: active
Quality score: 5.3/10