Aegis

aegis · BuildSomethingAI/aegis-framework · ★ 60 · last commit 2025-05-01

Gives AI assistants a shared project state machine so sessions don't lose context and tasks have verifiable state transitions.

Best whenAI-interpreted slash commands with mandatory checklists — the agent must check every box before reporting completion.

Skip ifUsing cp instead of mv for task state transitions, Running /aegis commands as shell commands

vs seeds

superpowersAegis has no hooks, no autonomous activation, no TDD enforcement, and no git automation.

Primitive shape 8 total

Commands 8

Summary

Aegis — Summary

Aegis is an AI-powered project management framework delivered as a .context/ folder drop-in that gives any AI assistant a shared understanding of project state, tasks, decisions, and sessions through structured markdown files and a /aegis command vocabulary.

Problem it solves: AI coding assistants lose context between sessions, lack a consistent way to track task state transitions, and have no shared memory of architectural decisions; Aegis imposes a .context/ directory convention with YAML-driven operation patterns (plan.yaml, task.yaml, etc.) so the agent always knows the project's current position.

Distinctive trait: All /aegis commands are interpreted by the AI, not executed in a terminal — they are protocol-level instructions that tell the AI how to update structured markdown files, enforcing a strict checklist before claiming any command complete.

Target audience: Individual developers and small teams using any AI assistant who want a consistent project state machine (planned → active → hold → completed) with human-readable audit trails stored in .context/sessions/ and .context/decisions/.

Production-readiness: Alpha-quality (60 GitHub stars), maintained by BuildSomethingAI, MIT licensed, last pushed May 2025.

Differs from seeds: Closest to agent-os (Archetype 4 — markdown-scaffold, zero-binary primitives); Aegis adds YAML operation specs in .context/ai/operations/ that define each command's validation, execution, and output steps more rigorously than agent-os's standards/ folder, but both rely entirely on the AI reading files rather than a CLI enforcing behavior.

Overview

Aegis — Overview

Origin

BuildSomethingAI, a startup building AI-assisted development tooling, released Aegis as an open-source framework (MIT). It ships as a single GitHub repo whose payload is entirely a .context/ folder and a COMMANDS.md reference. There is no separate SDK, CLI package, or IDE extension.

Philosophy

Aegis is built on a collaborative philosophy where humans and AI work together with a shared understanding of principles and practices. The README states documentation is dual-purpose: it guides Aegis's behavior while serving as a human reference. The framework's core bet is that transparent operation — documenting not just what to do, but why — yields more reliable AI behavior than opaque instructions.

Manifesto-style quotes (verbatim from AI_INSTRUCTIONS.md)

"CRITICAL: These rules must be followed for all command operations"

"/aegis commands are NEVER executed directly in the terminal — These commands are interpreted by the AI assistant, not the shell"

"If a step fails, retry with more specific instructions. If a file update fails, check permissions and retry. If validation fails, fix the issues before proceeding. Never skip steps in the command implementation."

"Common Failure Points: Incomplete command implementation, Missing decision records, Missing self-improvement updates, Outdated timestamps, Inconsistent state files"

Design Choices

YAML operation files in .context/ai/operations/ describe each command's steps declaratively — the AI reads these as operational contracts
Tasks move between filesystem folders (planned/, active/, hold/, completed/) using mv, never cp — the file's location IS its state
Self-improvement records are first-class artifacts; the agent is expected to learn and write reflections into the memory system
All commands have mandatory checklists with [ ] boxes the agent must mentally complete before claiming success

Architecture

Aegis — Architecture

Distribution

Standalone GitHub repo. No npm package, no pip package, no CLI binary. Install = clone/copy the .context/ folder into your project.

Install

# Clone or download, then copy .context/ into your project root
cp -r aegis-framework/.context /path/to/your-project/

Directory Tree

.context/
├── AI_INSTRUCTIONS.md           # Master instruction set for the AI
├── README.md                    # Human quick reference
├── ai/
│   └── operations/              # YAML command specs
│       ├── context.yaml
│       ├── help.yaml
│       ├── plan.yaml
│       ├── reconcile.yaml
│       ├── save.yaml
│       ├── session_triggers.yaml
│       ├── start.yaml
│       ├── status.yaml
│       ├── task.yaml
│       ├── task_creation.yaml
│       ├── task_transition.yaml
│       └── validation.yaml
├── memory/
│   ├── core/                    # Framework-level memory
│   ├── project/                 # Project context & decisions
│   └── session/                 # Active session state
├── templates/
│   ├── tasks/TEMPLATE.md
│   ├── sessions/TEMPLATE.md
│   └── decisions/TEMPLATE.md
├── tasks/
│   ├── planned/
│   ├── active/
│   ├── hold/
│   └── completed/
├── sessions/                    # Session logs
├── decisions/                   # Architectural decisions
└── plan/
    └── planning_document.md

Also ships COMMANDS.md and COMMANDS.json at root as human quick-reference cards.

Required Runtime

None. Pure markdown + YAML files. Any AI assistant that can read files works.

Target AI Tools

Any (Claude Code, Cursor, ChatGPT, Copilot). Cross-tool portability is high by design — the entire framework is passive markdown.

Config Files

.context/AI_INSTRUCTIONS.md — primary agent instructions
.context/ai/operations/*.yaml — per-command operation specs
COMMANDS.md — human+AI reference

Components

Aegis — Components

Commands (AI-interpreted slash commands — 8 total)

All commands use the /aegis prefix and are interpreted by the AI, not executed in a shell.

Command	Purpose
`/aegis plan`	Create/update planning document and generate tasks in `tasks/planned/`
`/aegis start`	Begin dev session — load context, set focus, validate task states
`/aegis save`	Preserve session progress — write session log, decisions, update tasks
`/aegis status`	Show current project state — active tasks, recent changes, focus
`/aegis task`	Manage tasks — create, update, transition between states
`/aegis context`	Quick context refresh — active tasks, recent decisions
`/aegis reconcile`	Check and repair framework consistency — validate all references
`/aegis help`	Show command help

Operation Specs (YAML files in `.context/ai/operations/`)

File	Purpose
`plan.yaml`	Defines validation, execution steps, output for `/aegis plan`
`start.yaml`	Defines session initialization sequence
`save.yaml`	Defines session save checklist and file update order
`status.yaml`	Defines status report generation
`task.yaml`	Defines task lifecycle management
`task_creation.yaml`	Defines new task creation from template
`task_transition.yaml`	Defines task state machine transitions
`context.yaml`	Defines context report generation
`reconcile.yaml`	Defines consistency check workflow
`session_triggers.yaml`	Defines automatic session events
`validation.yaml`	Shared validation rules referenced by all operations
`help.yaml`	Help command handler

Templates

Template	Purpose
`templates/tasks/TEMPLATE.md`	Task file with front matter (status, priority, dependencies, test_status)
`templates/sessions/TEMPLATE.md`	Session log with start/end timestamps, focus, decisions
`templates/decisions/TEMPLATE.md`	Architectural decision record

Memory System

Layer	Path	Purpose
Core	`.context/memory/core/`	Framework operational memory
Project	`.context/memory/project/`	Project context, project.json
Session	`.context/memory/session/`	Active session state

Scripts / Hooks

None — no programmatic hooks or scripts. All behavior is AI-interpreted.

Prompts

Aegis — Prompts

Excerpt 1: AI_INSTRUCTIONS.md — Command Execution Rules (verbatim)

File: .context/AI_INSTRUCTIONS.md Technique: Categorical prohibition + action list. Prevents the AI from confusing framework commands with shell execution.

## Command Execution Rules
> CRITICAL: These rules must be followed for all command operations

1. **Command Interpretation**
   - `/aegis` commands are NEVER executed directly in the terminal
   - These commands are interpreted by the AI assistant, not the shell
   - The AI should gather information from project files and present it to the user
   - NEVER use `run_terminal_cmd` to execute any `/aegis` command

2. **Information Gathering**
   - Use file reading and search tools to gather information
   - Process and organize information according to command requirements
   - Present information in a structured, readable format
   - Do not attempt to execute framework commands as shell commands

3. **Command vs. Shell Distinction**
   - `/aegis status` → AI interprets and presents project status
   - `/aegis plan` → AI helps with planning, doesn't run a shell command
   - `/aegis task` → AI manages task information, doesn't execute in terminal
   - All other `/aegis` commands follow the same pattern

Excerpt 2: AI_INSTRUCTIONS.md — /aegis save Checklist (verbatim)

File: .context/AI_INSTRUCTIONS.md Technique: Mandatory checkbox pattern. Forces sequential verification before completion claim.

#### `/aegis save` Checklist
- [ ] Update session document (status, progress, timestamps)
- [ ] Create decision record for significant decisions
- [ ] Update self-improvement record with new insights
- [ ] Update task progress and status
- [ ] Update project.json with current task lists
- [ ] Update timestamps to current time
- [ ] Verify all files have been properly updated
- [ ] Confirm completion only when ALL steps are checked

Excerpt 3: plan.yaml Operation Spec (verbatim)

File: .context/ai/operations/plan.yaml Technique: Declarative YAML step specification — the AI reads this as a contract for what each command must do, separating validation from execution from output.

steps:
  1_validate:
    - check_framework:
        rules:
          - framework_structure
          - required_files
          - permissions
    - verify_permissions:
        rules:
          - state_valid
          - can_plan
    
  2_prepare:
    - load_context:
        from: memory/project
        required: true
        validation:
          - exists
          - readable
          - valid_format
    - load_existing:
        file: planning_document.md
        required: false
        
  3_execute:
    - create_or_update:
        file: planning_document.md
        template: planning
        content:
          - project_overview
          - implementation_phases
          - task_breakdown
          - dependencies
        validation:
          - format_valid
          - content_complete

Uniqueness

Aegis — Uniqueness

Differs from Seeds

Aegis is most similar to agent-os (Archetype 4 — markdown scaffold). Both drop structured markdown files into a project and rely on the AI to read them; neither ships a CLI, MCP server, or hooks. Aegis's distinguishing delta is its YAML operation specification layer in .context/ai/operations/: each command is defined as a declarative YAML contract with validation steps, execution steps, and output requirements — something agent-os lacks entirely. Aegis also treats self-improvement records as first-class artifacts (the AI is instructed to write lessons-learned after every save), whereas agent-os is purely passive context. Compared to BMAD-METHOD, Aegis has no persona model and no skills system; it is simpler and more PM-focused than BMAD's persona-driven approach. Compared to superpowers, Aegis has no autonomous activation (no hooks), no TDD enforcement, and no git workflow automation — it is purely a project state tracking layer.

Positioning

Aegis positions itself as a "collaborative framework" — a shared protocol between human and AI for maintaining project state. It is not a coding agent harness, it is a PM/state-management overlay that any AI can use without being Claude Code-specific.

Distinctive Opinion

The YAML operation specs in .context/ai/operations/ are Aegis's most opinionated feature: by separating command definition (the YAML) from command instructions (AI_INSTRUCTIONS.md), Aegis creates a dual-layer specification that lets you modify command behavior without rewriting the master instruction file.

Observable Failure Modes

Token overhead: Loading all .context/ files at session start can consume significant context; mitigated by /aegis context (quick refresh) vs /aegis start (full load)
AI discipline dependency: The entire framework relies on the AI following the checklists faithfully; a non-compliant AI or truncated context window will silently skip steps
No enforcement: Since there are no hooks or tests, there is no way to verify the AI actually completed all checklist items — it self-reports
Timestamp staleness: The instructions warn repeatedly about "always use current year" — a symptom that AI models sometimes write stale timestamps

Workflow

Aegis — Workflow

Phases

Phase	Trigger	Key Actions	Artifacts
Planning	`/aegis plan`	AI creates planning doc, generates tasks in `planned/`	`plan/planning_document.md`, `tasks/planned/TASK-XXX.md`
Session Start	`/aegis start`	Load context, create session doc, set focus	`sessions/SESSION-YYYY-MM-DD.md`
Active Work	`/aegis task`	Transition task to `active/`, track progress	`tasks/active/TASK-XXX.md`
Save	`/aegis save`	Write session log, record decisions, update project.json	`sessions/`, `decisions/` updated
Status Check	`/aegis status`	Read all state files, generate report	Report (in conversation)
Reconcile	`/aegis reconcile`	Validate all references and consistency	Updated `.context/` files
Task Complete	`/aegis task` (complete)	Verify test_status, move task to `completed/`	`tasks/completed/TASK-XXX.md`

Task State Machine

planned/ → active/    (starting work)
active/  → completed/ (work done, test_status verified)
active/  → hold/      (blocked)
hold/    → active/    (unblocked)

State transitions use mv, never cp. The file's location IS its state.

Approval Gates

Test Status Gate: Before completing a task, AI must verify test_status front matter; if not "Passing" or "Not Applicable", must prompt user for confirmation
Command Checklist: Every command has a mandatory [ ] checklist in AI_INSTRUCTIONS.md — all boxes must be checked before the AI can report completion
Reconcile Gate: /aegis reconcile validates all cross-references; failures block the operation

Artifact Map

Phase	Artifact(s)
Planning	`plan/planning_document.md`, `tasks/planned/*.md`
Session Start	`sessions/SESSION-*.md`, `memory/session/` updated
Active Work	`tasks/active/*.md` with updated front matter
Save	`sessions/SESSION-.md`, `decisions/DECISION-.md`
Completion	`tasks/completed/*.md`, `memory/project/project.json`

Memory Context

Aegis — Memory & Context

State Storage

All state is file-based markdown/YAML within .context/. No external database, no MCP server.

Memory Layers

Layer	Path	Content	Persistence
Project Memory	`.context/memory/project/`	`project.json` — current task lists, overview; project context markdown	Project-scoped (git tracked)
Session Memory	`.context/memory/session/`	Active session state, current focus	Session-scoped (wiped/updated per session)
Core Memory	`.context/memory/core/`	Framework operational memory, self-improvement records	Project-scoped
Decisions	`.context/decisions/`	Architectural decision records (ADRs)	Project-scoped, permanent
Sessions	`.context/sessions/`	Session logs with timestamps, progress, insights	Project-scoped, append-only

Context Loading

The AI is instructed (in /aegis start checklist) to:

Load project context from memory/project/
Load active session from memory/session/
Load recent decisions from decisions/
Check task states across tasks/active/ and tasks/planned/

Cross-Session Handoff

Yes — sessions are persisted to .context/sessions/ and decisions to .context/decisions/. The /aegis start command loads previous session context. /aegis reconcile can verify and repair cross-session state consistency.

Context Compaction

No explicit compaction mechanism. The framework relies on the AI to read only what it needs from the file tree; no automatic summarization.

Self-Improvement

A dedicated "self-improvement record" is a first-class artifact — the AI is instructed to update it with new insights during every /aegis save. This is distinct from most frameworks: Aegis explicitly asks the agent to reflect on what it learned and write it back.

Orchestration

Aegis — Orchestration

Multi-Agent

No. Aegis is a single-agent framework. There is no mechanism to spawn sub-agents or run agents in parallel.

Orchestration Pattern

Sequential. All operations are sequential single-agent interactions following command checklists.

Execution Mode

Interactive-loop. The human issues an /aegis command, the AI executes the operation checklist, and the conversation continues.

Isolation Mechanism

None. All edits are in-place in the project's .context/ directory.

Multi-Model

No. The framework is model-agnostic and has no routing between multiple models.

Consensus Mechanism

None.

Prompt Chaining

Minimal. The /aegis start command loads context which seeds subsequent command execution, but there is no formal chaining where one command's output becomes another's prompt.

Crash Recovery

Partial. The /aegis reconcile command checks and repairs consistency. If the AI partially completes a command, running reconcile can identify and fix broken state.

Context Compaction

Not handled. No PreCompact hooks or compaction-aware mechanisms.

Ui Cli Surface

Aegis — UI / CLI Surface

CLI Binary

None. Aegis ships no executable. There is no aegis binary, no npm package, no pip package.

Local UI

None. No dashboard, no TUI, no web interface.

IDE Integration

None dedicated. Aegis works with any AI assistant that can read files — Claude Code, Cursor, ChatGPT, GitHub Copilot, etc.

Commands Surface

8 AI-interpreted slash commands via the /aegis prefix. These are instructions to the AI, not shell commands.

Observability

Audit trail: .context/sessions/ provides a log of every session with progress and decisions
Decision log: .context/decisions/ stores all architectural decisions
Task history: Task markdown files include timestamps, state transitions, and progress notes
Self-improvement: The framework explicitly asks the AI to record insights and lessons

Documentation Reference

External docs at https://buildsomething.ai/aegis/ (not shipped in repo)
COMMANDS.md — quick reference card in repo root
COMMANDS.json — machine-readable command reference
.context/README.md — human-readable structure overview

Related frameworks

same archetype · same primary tool · same memory type

BMAD-METHOD ★ 48k

A4 Markdown scaffold

Provides a full agile delivery lifecycle with named expert-persona AI collaborators that elicit the human's best thinking rather…

Agent OS ★ 4.6k

A4 Markdown scaffold

Extracts implicit codebase conventions into token-efficient markdown standards files and injects them selectively into AI agent…

Claude Conductor ★ 367

A4 Markdown scaffold

Gives Claude Code a persistent, cross-linked, auto-analyzed documentation system so it retains codebase context across sessions.

Spec-Driver (Greenfield Spec-Driven Development) ★ 25

A4 Markdown scaffold

Prevents spec rot in AI-assisted development by making implementation changes flow back into evergreen, authoritative specs via…

Anthropic Knowledge Work Plugins ★ 16k

A4 Markdown scaffold

Role-specialized plugin bundles with live MCP connectors that turn Claude into a domain expert for enterprise knowledge workers.

Codex Integration for Claude Code (skill-codex) ★ 1.3k

A4 Markdown scaffold

Single Claude Code skill that handles Codex CLI invocation correctly (stdin blocking, thinking token suppression, session resume)…

Distribution

Type: standalone-repo
License: MIT
Install: clone-and-configure

Surfaces

CLI binary: No
CLI subcmds: 0
Local UI: No

Components

Commands: 8
Skills: 0
Subagents: 0
Hooks: 0
MCP servers: 0
MCP tools: 0
Scripts: 0
Templates: 3

Workflow

Phases: 7
Approval gates: 2
Spec format: markdown
Spec storage: per-feature-folder
Delta or full: whole-file

Orchestration

Multi-agent: No
Pattern: sequential
Max concurrent: 1
Isolation: none
Consensus: none
Prompt chaining: No

Multi-model

Multi-model: No
BYOK: Yes
Modal: text

Execution

Mode: interactive-loop
Crash recovery: Yes
Compaction: No
Session handoff: Yes
Streaming: No

Memory

Type: file-based
Persistence: project
Search: none
State files: 5 files

Quality

TDD: Optional
TDD mechanism: none
Self-review: none

Git / Observability

Auto commit: No
Auto PR: No
Auto merge: No
Worktree/feat: No
Audit log: Yes
Audit format: structured-md
Replay: No

Tools

Primary: claude-code
Targets: 5
Portability: high

Signals

Stars: 60
Last commit: 2025-05-01
Contributors: 2
Maintainer: active
Quality score: 3.5/10

Summary

Aegis — Summary

Overview

Aegis — Overview

Origin

Philosophy

Manifesto-style quotes (verbatim from AI_INSTRUCTIONS.md)

Design Choices

Architecture

Aegis — Architecture

Distribution

Install

Directory Tree

Required Runtime

Target AI Tools

Config Files

Components

Aegis — Components

Commands (AI-interpreted slash commands — 8 total)

Operation Specs (YAML files in .context/ai/operations/)

Templates

Memory System

Scripts / Hooks

Prompts

Aegis — Prompts

Excerpt 1: AI_INSTRUCTIONS.md — Command Execution Rules (verbatim)

Excerpt 2: AI_INSTRUCTIONS.md — /aegis save Checklist (verbatim)

Excerpt 3: plan.yaml Operation Spec (verbatim)

Uniqueness

Aegis — Uniqueness

Differs from Seeds

Positioning

Distinctive Opinion

Observable Failure Modes

Workflow

Aegis — Workflow

Phases

Task State Machine

Approval Gates

Artifact Map

Memory Context

Aegis — Memory & Context

State Storage

Memory Layers

Context Loading

Cross-Session Handoff

Context Compaction

Self-Improvement

Orchestration

Aegis — Orchestration

Multi-Agent

Orchestration Pattern

Execution Mode

Isolation Mechanism

Multi-Model

Consensus Mechanism

Prompt Chaining

Crash Recovery

Context Compaction

Ui Cli Surface

Aegis — UI / CLI Surface

CLI Binary

Local UI

IDE Integration

Commands Surface

Observability

Documentation Reference

Related frameworks

Operation Specs (YAML files in `.context/ai/operations/`)