Skip to content
/

SpecCrew

speccrew · charlesmu99/speccrew · ★ 3 · last commit 2026-04-22

Apply ISA-95 industrial methodology to AI-assisted software delivery — 8 specialized agents orchestrate PRD through deployment with a 3-tier knowledge base and per-requirement iteration isolation.

Best whenEach software requirement deserves its own isolated project scope (iterations/iXXX/) and every phase transition deserves a human gate — speed comes from know…
Skip ifMonolithic AI sessions that span PRD through testing, Shared context across unrelated requirements
vs seeds
spec-kit(outputs specs for human review), SpecCrew executes them. Unlike openspec (API-focused), SpecCrew treats API contracts a…
Primitive shape 79 total
Skills 71 Subagents 8
00

Summary

speccrew — Summary

Slug: speccrew GitHub: https://github.com/charlesmu99/speccrew Stars: 3 License: Apache-2.0 Version: v0.7.76 Language: JavaScript

One-Liner

npm-installable multi-agent spec factory — 8 specialized agents orchestrate PRD → feature design → API contract → system design → implementation → deployment → test with ISA-95 methodology and a 3-tier knowledge base.

Archetype

Skills-based multi-agent orchestrator with CLI dispatcher. Closest to trw-mcp in the current batch (multi-agent pipeline with human-confirmation gates) but ships as an npm package with no MCP layer and uses ISA-95 industrial modeling as its intellectual framework instead of value-oriented hooks.

Key Facts

  • Install: npm install -g speccrewspeccrew CLI binary from bin/cli.js
  • 8 agents: team-leader, product-manager, feature-designer, system-designer, system-developer, task-worker, test-manager, system-deployer
  • 71 skills prefixed speccrew- covering planning, analysis, coding, testing, and knowledge management
  • ISA-95 6-stage enterprise integration modeling as architectural methodology
  • 3-tier knowledge base: L1 technology docs (knowledge/techs/), L2 business domain docs (knowledge/bizs/), L3 iteration artifacts (iterations/iXXX/)
  • Iterations pattern: each requirement becomes an isolated project under iterations/iXXX-[name]/
  • 4-stage knowledge pipeline: scan → parallel analysis → parallel summarize → aggregate
  • Gates: all phase transitions require human confirmation
  • Multi-language README: 10 languages (Chinese primary + EN, JA, RU, ES, DE, FR, PT-BR, AR, HI)
  • Sub-agent auto-dispatch for context management; agents are dynamically created per tech stack
01

Overview

speccrew — Overview

What It Is

SpecCrew is an npm-installable multi-agent AI development orchestration system. Teams install it globally (npm install -g speccrew), initialize a project, then run the speccrew CLI to drive a complete software delivery pipeline from PRD authoring through deployment and testing.

The central thesis: treat software specification and delivery as an industrial process modeled on ISA-95 (the enterprise integration standard), where each "requirement" is an independent project iteration with its own isolated artifact directory.

Problem It Solves

Traditional AI-assisted development has agents that span too many concerns in a single context window, producing inconsistent output quality across phases. SpecCrew solves this by assigning hard-bounded specialists (product manager knows PRD, system designer knows API contracts, etc.) and enforcing phase gates with human confirmation before each handoff.

Target User

  • Development teams wanting to go from idea → deployed software with AI doing the heavy lifting at each phase
  • Teams with polyglot tech stacks (the dynamic agent creation per tech stack supports this)
  • International teams: 10 README languages with Chinese as the primary, suggesting a strong Chinese developer community target

Distribution

Positioning vs. Spec-Driven Seeds

  • Unlike spec-kit (which outputs spec documents for human review), SpecCrew executes the specs through agents automatically
  • Unlike openspec (OpenAPI-focused), SpecCrew generates API contracts as one phase of a broader pipeline
  • Unlike spec-driver (which focuses on test-driven spec enforcement), SpecCrew treats testing as the last phase after deployment
  • Closest analog in the seed set is a combination of taskmaster-ai (task breakdown) + bmad-method (role-based agent team) but with stronger industrial methodology grounding
02

Architecture

speccrew — Architecture

High-Level Structure

speccrew/
├── bin/cli.js                    # CLI entry point
├── .speccrew/
│   ├── agents/                   # 8 agent definitions
│   │   ├── speccrew-team-leader.md
│   │   ├── speccrew-product-manager.md
│   │   ├── speccrew-feature-designer.md
│   │   ├── speccrew-system-designer.md
│   │   ├── speccrew-system-developer.md
│   │   ├── speccrew-task-worker.md
│   │   ├── speccrew-test-manager.md
│   │   └── speccrew-system-deployer.md
│   └── skills/                   # 71 skills (speccrew-* prefix)
├── knowledge/
│   ├── techs/                    # L1: Technology documentation
│   └── bizs/                    # L2: Business domain documentation
└── iterations/
    └── iXXX-[name]/             # L3: Per-requirement isolated artifacts

Knowledge Tier Architecture

3-tier knowledge base:

Tier Location Content Scope
L1 knowledge/techs/ Technology docs, framework references Global / reused
L2 knowledge/bizs/ Business domain, glossary, context Global / reused
L3 iterations/iXXX/ Iteration-specific artifacts, PRD, designs Per-requirement

Knowledge Pipeline

4-stage pipeline for assembling context before each agent phase:

  1. Scan — discover relevant L1/L2/L3 files
  2. Parallel analysis — agents analyze files concurrently
  3. Parallel summarize — distill key facts per file
  4. Aggregate — combine into a unified context document

ISA-95 Methodology

SpecCrew uses ISA-95 (enterprise integration standard) as its modeling framework:

  • 6-stage system modeling decomposition
  • Clear separation between functional specifications and physical implementations
  • Equipment hierarchy modeling applied to software component hierarchy

Agent Dispatch Model

  • Team Leader coordinates phase transitions and dispatches to specialists
  • Sub-agent auto-dispatch for context management: when context grows too large, the team leader spawns a sub-agent to handle the overflow
  • Agents are dynamically created per tech stack — the system-designer agent generates stack-specific variants
  • All phase transitions require human confirmation before proceeding

Execution Environment

  • Runs inside Claude Code (.speccrew/agents/ and .speccrew/skills/ loaded by CC)
  • No MCP server — agents communicate through filesystem artifacts
  • iterations/iXXX-[name]/ acts as the state store per requirement
03

Components

speccrew — Components

CLI Binary

Command: speccrew Entry point: bin/cli.js Distribution: npm global install

Agents (8 total)

Agent Role Primary Skills
speccrew-team-leader Orchestrates phases, dispatches agents, manages gates speccrew-knowledge-*
speccrew-product-manager PRD authoring, requirement decomposition speccrew-pm-*
speccrew-feature-designer Feature design, UX/UI specs speccrew-fd-*
speccrew-system-designer API contracts, system architecture speccrew-sd-*
speccrew-system-developer Code implementation speccrew-dev-*
speccrew-task-worker Granular task execution (sub-tasks from developer) speccrew-dev-*
speccrew-test-manager Test planning and execution speccrew-test-*
speccrew-system-deployer Deployment configuration and execution (deploy skills)

Skills (71 total)

Skill naming convention: speccrew-{domain}-{action}

Domain prefixes:

  • speccrew-pm-* — product management (requirement analysis, PRD structuring)
  • speccrew-sd-* — system design (API contracts, data models)
  • speccrew-dev-* — development (implementation, code review)
  • speccrew-test-* — testing (test planning, test case generation)
  • speccrew-knowledge-* — knowledge pipeline (scan, analyze, summarize, aggregate)
  • Additional domain-specific skills for feature design and deployment

Knowledge Base

L1 — Technology docs (knowledge/techs/):

  • Framework references, library documentation
  • Tech stack decision records

L2 — Business domain (knowledge/bizs/):

  • Domain glossary
  • Business rules
  • Stakeholder context

L3 — Iteration artifacts (iterations/iXXX-[name]/):

  • PRD.md (product requirements)
  • feature-design.md
  • api-contract.yaml
  • system-design.md
  • implementation artifacts
  • deployment config
  • test results

Configuration Files

  • speccrew.config.js / speccrew.json — project configuration
  • .speccrew/agents/ — agent markdown definitions
  • .speccrew/skills/ — skill markdown definitions

Output Artifacts Per Iteration

Each iterations/iXXX-[name]/ directory contains the complete lineage from PRD through test results for one requirement.

05

Prompts

speccrew — Prompts & Instructions

Agent Definition Format

Agent definitions are markdown files in .speccrew/agents/. Each agent file declares:

  • Role and responsibility scope
  • Permitted skills (the skill IDs the agent may invoke)
  • Output format requirements (the artifact structure the agent must produce)
  • Handoff protocol (what to confirm before marking phase complete)

Team Leader Dispatch Pattern

The speccrew-team-leader agent orchestrates the pipeline. Its definition includes:

  • Phase sequence and gate conditions
  • Agent dispatch rules per phase
  • Human confirmation request phrasing ("Please confirm to proceed to [next phase]")
  • Sub-agent auto-dispatch rules for context overflow management

Skill Format

Skills are markdown files in .speccrew/skills/. Each skill defines:

  • Input requirements (what context must be present)
  • Step-by-step instructions for completing the skill
  • Output format specification
  • Quality criteria the output must meet

ISA-95 Prompt Vocabulary

SpecCrew embeds ISA-95 terminology into agent prompts:

  • "Functional specifications" vs "physical implementations" as distinct layers
  • Equipment hierarchy (applied to software: system → subsystem → component → module)
  • Level boundaries: business planning, operations management, operations control, physical processes

Knowledge Aggregation Prompt Pattern

The knowledge pipeline produces a structured context document fed to each phase agent:

## Technology Context
[aggregated from L1 knowledge/techs/]

## Business Context  
[aggregated from L2 knowledge/bizs/]

## Iteration Context
[aggregated from L3 iterations/iXXX/]

Human Gate Prompt

Before each phase transition, the team leader presents:

  • Summary of what was completed in the current phase
  • Key decisions made and artifacts produced
  • What the next phase will do
  • Explicit confirmation request: "Type 'yes' to proceed to [Phase Name]"

Multi-Language Support

README and potentially agent prompts are available in 10 languages. The Chinese-primary structure suggests the agent prompt files may contain bilingual annotations.

09

Uniqueness

speccrew — Uniqueness

Differs from Seeds

SpecCrew is closest to a combination of bmad-method (role-based agent team with a product manager, architect, developer) and taskmaster-ai (task breakdown and tracking), but distinguished by:

  1. ISA-95 industrial methodology — no other framework in the seeds or batch uses an enterprise integration standard as its intellectual backbone. This brings rigorous tier separation, level boundaries, and functional/physical distinction to AI-assisted development.

  2. 3-tier knowledge base with 4-stage pipeline — the scan→parallel analysis→parallel summarize→aggregate pattern for context assembly is more sophisticated than any seed's context management. Unlike agent-os (flat markdown rules), SpecCrew's knowledge architecture separates technology context (L1), business context (L2), and iteration context (L3) with deliberate staleness management.

  3. 71 skills with strict domain prefixingspeccrew-pm-*, speccrew-sd-*, speccrew-dev-*, speccrew-test-*, speccrew-knowledge-* — more skills than any seed, organized by agent role rather than by action type.

  4. "Each requirement as independent project" pattern (iterations/iXXX/) — unlike taskmaster-ai's flat task list or bmad's single-project focus, SpecCrew treats every requirement as a completely isolated artifact tree. This enables parallel requirement development and clean audit trails.

  5. 10-language README (Chinese-primary) — only multilingual framework in the seed set or this batch, indicating intentional non-English-first design.

Compared to spec-kit (outputs spec docs for human review): SpecCrew executes the specs through agents. Compared to openspec (API-focused): SpecCrew treats API contracts as one phase of a 7-phase pipeline. Compared to spec-driver (test-driven spec enforcement): SpecCrew tests last, not first.

Positioning

  • Target user: Teams wanting full-pipeline automation from idea to tested deployment, with human gates at each phase
  • Key differentiator: ISA-95 methodology + 3-tier knowledge architecture + per-requirement iteration isolation
  • Honest gap: 3 GitHub stars and v0.7.76 suggest active development but pre-adoption; the industrial methodology may have a steep learning curve for typical web dev teams

Observable Failure Modes

  1. 7 human gates — the "all transitions require confirmation" design maximizes safety but maximizes interruption; teams that want hands-off automation will be frustrated
  2. No test-first — testing is phase 7, not phase 1; bugs introduced in implementation aren't caught until the end
  3. Context overflow reliance — sub-agent auto-dispatch for context overflow is a runtime hedge, not a structured solution; behavior may be inconsistent
  4. ISA-95 mismatch — industrial modeling concepts (equipment hierarchy, level boundaries) may confuse teams unfamiliar with the standard
  5. CLI underdocumentedbin/cli.js exists but subcommand documentation is sparse
04

Workflow

speccrew — Workflow

Phase Pipeline

PRD → Feature Design → API Contract → System Design → Dev → Deployment → Test
       [GATE]          [GATE]          [GATE]          [GATE]  [GATE]       [GATE]

Every phase transition requires explicit human confirmation before the next agent is dispatched.

Phase Details

1. PRD (Product Manager)

  • Input: user's high-level requirement / idea
  • Agent: speccrew-product-manager
  • Output: iterations/iXXX-[name]/PRD.md
  • Activities: requirement analysis, user story decomposition, acceptance criteria definition
  • Gate: human reviews PRD, confirms completeness

2. Feature Design (Feature Designer)

  • Input: PRD.md
  • Agent: speccrew-feature-designer
  • Output: iterations/iXXX-[name]/feature-design.md
  • Activities: UX/UI specification, component inventory, interaction flows
  • Gate: human confirms design aligns with PRD intent

3. API Contract (System Designer)

  • Input: PRD.md + feature-design.md
  • Agent: speccrew-system-designer
  • Output: iterations/iXXX-[name]/api-contract.yaml
  • Activities: API endpoint definition, data model specification, integration contracts
  • Gate: human confirms API contract is complete

4. System Design (System Designer)

  • Input: all previous artifacts
  • Agent: speccrew-system-designer
  • Output: iterations/iXXX-[name]/system-design.md
  • Activities: component architecture, database schema, deployment topology
  • Gate: human confirms system design before coding begins

5. Development (System Developer + Task Workers)

  • Input: all previous artifacts
  • Agents: speccrew-system-developer dispatches speccrew-task-worker for sub-tasks
  • Output: implementation code, iterations/iXXX-[name]/implementation/
  • Activities: code generation, unit test stubs, code review
  • Gate: human reviews and approves implementation

6. Deployment (System Deployer)

  • Input: implementation + system design
  • Agent: speccrew-system-deployer
  • Output: deployment configuration, CI/CD manifests
  • Gate: human confirms deployment config before test

7. Testing (Test Manager)

  • Input: all artifacts + deployed system
  • Agent: speccrew-test-manager
  • Output: test plan, test results in iterations/iXXX-[name]/test-results/
  • Activities: test planning, test case execution, defect reporting

Iteration Model

  • Each requirement is treated as an independent project
  • Iteration IDs: iXXX (e.g., i001, i002)
  • Full artifact lineage preserved per iteration
  • No shared state between iterations except L1/L2 knowledge

Knowledge Pipeline Integration

Before each phase, the knowledge pipeline runs:

  1. Scan L1/L2/L3 for relevant context
  2. Parallel analysis of discovered files
  3. Parallel summarization
  4. Aggregation into context document fed to the phase agent
06

Memory Context

speccrew — Memory & Context

Memory Model

SpecCrew uses the filesystem as its primary memory system. All state is written to iterations/iXXX-[name]/ — no in-process memory, no database, no external persistence layer beyond the project directory.

Context Architecture

3-Tier Knowledge Base

Tier Path Persistence Scope
L1 knowledge/techs/ Permanent (team-maintained) Cross-iteration
L2 knowledge/bizs/ Permanent (team-maintained) Cross-iteration
L3 iterations/iXXX/ Per-iteration Single requirement

Knowledge Pipeline (4 stages)

Context is assembled fresh before each phase:

  1. Scan — find relevant files across all 3 tiers
  2. Parallel analysis — multiple agents analyze files concurrently
  3. Parallel summarize — distill each file to key facts
  4. Aggregate — combine into unified context document

This is SpecCrew's primary mechanism for managing context window limits — rather than dumping all files into context, it summarizes and aggregates.

Iteration Artifacts as State

Each iterations/iXXX-[name]/ directory is the complete state for one requirement:

  • PRD.md — source of truth for requirement
  • feature-design.md — UX/UI decisions
  • api-contract.yaml — interface contracts
  • system-design.md — architecture decisions
  • implementation/ — code artifacts
  • deployment config
  • test-results/ — test outcomes

These files serve as both the AI's working memory and the human team's audit trail.

Sub-Agent Context Management

When any phase agent's context grows too large, the team leader auto-dispatches a sub-agent to handle the overflow. This is SpecCrew's runtime context compaction mechanism — active delegation rather than summarization.

Cross-Iteration Reuse

L1/L2 knowledge accumulates across iterations. A technology decision made in iteration i001 is captured in knowledge/techs/ and available to all subsequent iterations. This creates a project-level memory that grows over time.

No Automated Memory

No session state, no vector store, no learning capture. The system relies entirely on structured file artifacts written by agents and read through the knowledge pipeline.

07

Orchestration

speccrew — Orchestration

Multi-Agent: Yes

SpecCrew is a multi-agent system with 8 specialized agents operating in a sequential pipeline with parallel knowledge aggregation steps.

Orchestration Pattern: Hierarchical + Sequential

  • Team Leader is the orchestrator — it coordinates phase transitions and dispatches specialist agents
  • Phases are sequential — each phase must complete and be human-confirmed before the next starts
  • Within phases, parallelism is used for the knowledge pipeline (analysis + summarization steps run in parallel)
  • Sub-agent auto-dispatch handles context overflow at runtime

Phase Gate Mechanism

Every phase transition is a human confirmation gate:

Team Leader → completes phase → presents summary → requests confirmation → dispatches next agent

No automated phase progression — human is always in the loop between phases.

Agent Lifecycle

Agents are dynamically created per tech stack: the system-designer and system-developer agents have stack-specific variants generated for the project's technology choices. This means the effective agent count exceeds 8 for complex polyglot projects.

Execution Mode: Multi-shot (iterative per phase)

Each phase involves multiple LLM calls — the agent may ask clarifying questions, refine its output, and iterate before marking the phase complete.

Max Concurrent Agents

Within the knowledge pipeline: up to N parallel analysis + summarization agents (bounded by file count). During phase execution: 1 primary agent + N sub-agents for context overflow.

Context Management Strategy

  • Proactive: knowledge pipeline summarizes before feeding to agents
  • Reactive: sub-agent auto-dispatch when context grows during execution
  • Isolation: each iteration (iXXX/) is a clean scope — no cross-iteration context bleed

Consensus Mechanism

None in the traditional sense — the Team Leader is authoritative. Human confirmation is the only external input between phases.

Crash Recovery

No automated crash recovery. If a phase fails mid-execution, the iteration directory contains whatever artifacts were written before failure. Teams can restart from the last confirmed phase.

Target Runtime

Claude Code (.speccrew/agents/ and .speccrew/skills/ are Claude Code agent/skill directories).

08

Ui Cli Surface

speccrew — UI / CLI Surface

CLI Binary

Binary: speccrew Entry point: bin/cli.js Install: npm install -g speccrew

Subcommands are not documented in detail in the repository, but the CLI drives:

  • Project initialization
  • Iteration creation (new iXXX-[name]/ directory)
  • Phase progression commands
  • Knowledge base management

Local UI

None. All interaction is through the CLI and the Claude Code agent interface.

Primary Interface Pattern

The main user experience is:

  1. Run speccrew CLI to initialize or manage iterations
  2. Claude Code picks up .speccrew/agents/ and .speccrew/skills/
  3. Team Leader agent guides the user through phases in the CC interface
  4. Human provides confirmations at each gate via the CC chat interface

Multi-Language Interface

README is available in 10 languages:

  • Chinese (primary/default)
  • English
  • Japanese
  • Russian
  • Spanish
  • German
  • French
  • Portuguese (BR)
  • Arabic
  • Hindi

This is the most multilingual framework in the batch — suggesting intentional global market targeting with Chinese developer community as primary audience.

No Web Dashboard

No local web server, no browser UI, no port binding. Unlike vibe-architect (Next.js, port 3000) or vibedoc (Gradio, port 7860) in this batch, SpecCrew is purely terminal + CC.

Related frameworks

same archetype · same primary tool · same memory type

BMAD-METHOD ★ 48k

Provides a full agile delivery lifecycle with named expert-persona AI collaborators that elicit the human's best thinking rather…

Agent OS ★ 4.6k

Extracts implicit codebase conventions into token-efficient markdown standards files and injects them selectively into AI agent…

Claude Conductor ★ 367

Gives Claude Code a persistent, cross-linked, auto-analyzed documentation system so it retains codebase context across sessions.

Spec-Driver (Greenfield Spec-Driven Development) ★ 25

Prevents spec rot in AI-assisted development by making implementation changes flow back into evergreen, authoritative specs via…

Anthropic Knowledge Work Plugins ★ 16k

Role-specialized plugin bundles with live MCP connectors that turn Claude into a domain expert for enterprise knowledge workers.

Codex Integration for Claude Code (skill-codex) ★ 1.3k

Single Claude Code skill that handles Codex CLI invocation correctly (stdin blocking, thinking token suppression, session resume)…