Skip to content
/

Aegis

aegis · BuildSomethingAI/aegis-framework · ★ 60 · last commit 2025-05-01

Gives AI assistants a shared project state machine so sessions don't lose context and tasks have verifiable state transitions.

Best whenAI-interpreted slash commands with mandatory checklists — the agent must check every box before reporting completion.
Skip ifUsing cp instead of mv for task state transitions, Running /aegis commands as shell commands
vs seeds
superpowersAegis has no hooks, no autonomous activation, no TDD enforcement, and no git automation.
Primitive shape 8 total
Commands 8
00

Summary

Aegis — Summary

Aegis is an AI-powered project management framework delivered as a .context/ folder drop-in that gives any AI assistant a shared understanding of project state, tasks, decisions, and sessions through structured markdown files and a /aegis command vocabulary.

Problem it solves: AI coding assistants lose context between sessions, lack a consistent way to track task state transitions, and have no shared memory of architectural decisions; Aegis imposes a .context/ directory convention with YAML-driven operation patterns (plan.yaml, task.yaml, etc.) so the agent always knows the project's current position.

Distinctive trait: All /aegis commands are interpreted by the AI, not executed in a terminal — they are protocol-level instructions that tell the AI how to update structured markdown files, enforcing a strict checklist before claiming any command complete.

Target audience: Individual developers and small teams using any AI assistant who want a consistent project state machine (planned → active → hold → completed) with human-readable audit trails stored in .context/sessions/ and .context/decisions/.

Production-readiness: Alpha-quality (60 GitHub stars), maintained by BuildSomethingAI, MIT licensed, last pushed May 2025.

Differs from seeds: Closest to agent-os (Archetype 4 — markdown-scaffold, zero-binary primitives); Aegis adds YAML operation specs in .context/ai/operations/ that define each command's validation, execution, and output steps more rigorously than agent-os's standards/ folder, but both rely entirely on the AI reading files rather than a CLI enforcing behavior.

01

Overview

Aegis — Overview

Origin

BuildSomethingAI, a startup building AI-assisted development tooling, released Aegis as an open-source framework (MIT). It ships as a single GitHub repo whose payload is entirely a .context/ folder and a COMMANDS.md reference. There is no separate SDK, CLI package, or IDE extension.

Philosophy

Aegis is built on a collaborative philosophy where humans and AI work together with a shared understanding of principles and practices. The README states documentation is dual-purpose: it guides Aegis's behavior while serving as a human reference. The framework's core bet is that transparent operation — documenting not just what to do, but why — yields more reliable AI behavior than opaque instructions.

Manifesto-style quotes (verbatim from AI_INSTRUCTIONS.md)

"CRITICAL: These rules must be followed for all command operations"

"/aegis commands are NEVER executed directly in the terminal — These commands are interpreted by the AI assistant, not the shell"

"If a step fails, retry with more specific instructions. If a file update fails, check permissions and retry. If validation fails, fix the issues before proceeding. Never skip steps in the command implementation."

"Common Failure Points: Incomplete command implementation, Missing decision records, Missing self-improvement updates, Outdated timestamps, Inconsistent state files"

Design Choices

  • YAML operation files in .context/ai/operations/ describe each command's steps declaratively — the AI reads these as operational contracts
  • Tasks move between filesystem folders (planned/, active/, hold/, completed/) using mv, never cp — the file's location IS its state
  • Self-improvement records are first-class artifacts; the agent is expected to learn and write reflections into the memory system
  • All commands have mandatory checklists with [ ] boxes the agent must mentally complete before claiming success
02

Architecture

Aegis — Architecture

Distribution

Standalone GitHub repo. No npm package, no pip package, no CLI binary. Install = clone/copy the .context/ folder into your project.

Install

# Clone or download, then copy .context/ into your project root
cp -r aegis-framework/.context /path/to/your-project/

Directory Tree

.context/
├── AI_INSTRUCTIONS.md           # Master instruction set for the AI
├── README.md                    # Human quick reference
├── ai/
│   └── operations/              # YAML command specs
│       ├── context.yaml
│       ├── help.yaml
│       ├── plan.yaml
│       ├── reconcile.yaml
│       ├── save.yaml
│       ├── session_triggers.yaml
│       ├── start.yaml
│       ├── status.yaml
│       ├── task.yaml
│       ├── task_creation.yaml
│       ├── task_transition.yaml
│       └── validation.yaml
├── memory/
│   ├── core/                    # Framework-level memory
│   ├── project/                 # Project context & decisions
│   └── session/                 # Active session state
├── templates/
│   ├── tasks/TEMPLATE.md
│   ├── sessions/TEMPLATE.md
│   └── decisions/TEMPLATE.md
├── tasks/
│   ├── planned/
│   ├── active/
│   ├── hold/
│   └── completed/
├── sessions/                    # Session logs
├── decisions/                   # Architectural decisions
└── plan/
    └── planning_document.md

Also ships COMMANDS.md and COMMANDS.json at root as human quick-reference cards.

Required Runtime

None. Pure markdown + YAML files. Any AI assistant that can read files works.

Target AI Tools

Any (Claude Code, Cursor, ChatGPT, Copilot). Cross-tool portability is high by design — the entire framework is passive markdown.

Config Files

  • .context/AI_INSTRUCTIONS.md — primary agent instructions
  • .context/ai/operations/*.yaml — per-command operation specs
  • COMMANDS.md — human+AI reference
03

Components

Aegis — Components

Commands (AI-interpreted slash commands — 8 total)

All commands use the /aegis prefix and are interpreted by the AI, not executed in a shell.

Command Purpose
/aegis plan Create/update planning document and generate tasks in tasks/planned/
/aegis start Begin dev session — load context, set focus, validate task states
/aegis save Preserve session progress — write session log, decisions, update tasks
/aegis status Show current project state — active tasks, recent changes, focus
/aegis task Manage tasks — create, update, transition between states
/aegis context Quick context refresh — active tasks, recent decisions
/aegis reconcile Check and repair framework consistency — validate all references
/aegis help Show command help

Operation Specs (YAML files in .context/ai/operations/)

File Purpose
plan.yaml Defines validation, execution steps, output for /aegis plan
start.yaml Defines session initialization sequence
save.yaml Defines session save checklist and file update order
status.yaml Defines status report generation
task.yaml Defines task lifecycle management
task_creation.yaml Defines new task creation from template
task_transition.yaml Defines task state machine transitions
context.yaml Defines context report generation
reconcile.yaml Defines consistency check workflow
session_triggers.yaml Defines automatic session events
validation.yaml Shared validation rules referenced by all operations
help.yaml Help command handler

Templates

Template Purpose
templates/tasks/TEMPLATE.md Task file with front matter (status, priority, dependencies, test_status)
templates/sessions/TEMPLATE.md Session log with start/end timestamps, focus, decisions
templates/decisions/TEMPLATE.md Architectural decision record

Memory System

Layer Path Purpose
Core .context/memory/core/ Framework operational memory
Project .context/memory/project/ Project context, project.json
Session .context/memory/session/ Active session state

Scripts / Hooks

None — no programmatic hooks or scripts. All behavior is AI-interpreted.

05

Prompts

Aegis — Prompts

Excerpt 1: AI_INSTRUCTIONS.md — Command Execution Rules (verbatim)

File: .context/AI_INSTRUCTIONS.md Technique: Categorical prohibition + action list. Prevents the AI from confusing framework commands with shell execution.

## Command Execution Rules
> CRITICAL: These rules must be followed for all command operations

1. **Command Interpretation**
   - `/aegis` commands are NEVER executed directly in the terminal
   - These commands are interpreted by the AI assistant, not the shell
   - The AI should gather information from project files and present it to the user
   - NEVER use `run_terminal_cmd` to execute any `/aegis` command

2. **Information Gathering**
   - Use file reading and search tools to gather information
   - Process and organize information according to command requirements
   - Present information in a structured, readable format
   - Do not attempt to execute framework commands as shell commands

3. **Command vs. Shell Distinction**
   - `/aegis status` → AI interprets and presents project status
   - `/aegis plan` → AI helps with planning, doesn't run a shell command
   - `/aegis task` → AI manages task information, doesn't execute in terminal
   - All other `/aegis` commands follow the same pattern

Excerpt 2: AI_INSTRUCTIONS.md — /aegis save Checklist (verbatim)

File: .context/AI_INSTRUCTIONS.md Technique: Mandatory checkbox pattern. Forces sequential verification before completion claim.

#### `/aegis save` Checklist
- [ ] Update session document (status, progress, timestamps)
- [ ] Create decision record for significant decisions
- [ ] Update self-improvement record with new insights
- [ ] Update task progress and status
- [ ] Update project.json with current task lists
- [ ] Update timestamps to current time
- [ ] Verify all files have been properly updated
- [ ] Confirm completion only when ALL steps are checked

Excerpt 3: plan.yaml Operation Spec (verbatim)

File: .context/ai/operations/plan.yaml Technique: Declarative YAML step specification — the AI reads this as a contract for what each command must do, separating validation from execution from output.

steps:
  1_validate:
    - check_framework:
        rules:
          - framework_structure
          - required_files
          - permissions
    - verify_permissions:
        rules:
          - state_valid
          - can_plan
    
  2_prepare:
    - load_context:
        from: memory/project
        required: true
        validation:
          - exists
          - readable
          - valid_format
    - load_existing:
        file: planning_document.md
        required: false
        
  3_execute:
    - create_or_update:
        file: planning_document.md
        template: planning
        content:
          - project_overview
          - implementation_phases
          - task_breakdown
          - dependencies
        validation:
          - format_valid
          - content_complete
09

Uniqueness

Aegis — Uniqueness

Differs from Seeds

Aegis is most similar to agent-os (Archetype 4 — markdown scaffold). Both drop structured markdown files into a project and rely on the AI to read them; neither ships a CLI, MCP server, or hooks. Aegis's distinguishing delta is its YAML operation specification layer in .context/ai/operations/: each command is defined as a declarative YAML contract with validation steps, execution steps, and output requirements — something agent-os lacks entirely. Aegis also treats self-improvement records as first-class artifacts (the AI is instructed to write lessons-learned after every save), whereas agent-os is purely passive context. Compared to BMAD-METHOD, Aegis has no persona model and no skills system; it is simpler and more PM-focused than BMAD's persona-driven approach. Compared to superpowers, Aegis has no autonomous activation (no hooks), no TDD enforcement, and no git workflow automation — it is purely a project state tracking layer.

Positioning

Aegis positions itself as a "collaborative framework" — a shared protocol between human and AI for maintaining project state. It is not a coding agent harness, it is a PM/state-management overlay that any AI can use without being Claude Code-specific.

Distinctive Opinion

The YAML operation specs in .context/ai/operations/ are Aegis's most opinionated feature: by separating command definition (the YAML) from command instructions (AI_INSTRUCTIONS.md), Aegis creates a dual-layer specification that lets you modify command behavior without rewriting the master instruction file.

Observable Failure Modes

  1. Token overhead: Loading all .context/ files at session start can consume significant context; mitigated by /aegis context (quick refresh) vs /aegis start (full load)
  2. AI discipline dependency: The entire framework relies on the AI following the checklists faithfully; a non-compliant AI or truncated context window will silently skip steps
  3. No enforcement: Since there are no hooks or tests, there is no way to verify the AI actually completed all checklist items — it self-reports
  4. Timestamp staleness: The instructions warn repeatedly about "always use current year" — a symptom that AI models sometimes write stale timestamps
04

Workflow

Aegis — Workflow

Phases

Phase Trigger Key Actions Artifacts
Planning /aegis plan AI creates planning doc, generates tasks in planned/ plan/planning_document.md, tasks/planned/TASK-XXX.md
Session Start /aegis start Load context, create session doc, set focus sessions/SESSION-YYYY-MM-DD.md
Active Work /aegis task Transition task to active/, track progress tasks/active/TASK-XXX.md
Save /aegis save Write session log, record decisions, update project.json sessions/, decisions/ updated
Status Check /aegis status Read all state files, generate report Report (in conversation)
Reconcile /aegis reconcile Validate all references and consistency Updated .context/ files
Task Complete /aegis task (complete) Verify test_status, move task to completed/ tasks/completed/TASK-XXX.md

Task State Machine

planned/ → active/    (starting work)
active/  → completed/ (work done, test_status verified)
active/  → hold/      (blocked)
hold/    → active/    (unblocked)

State transitions use mv, never cp. The file's location IS its state.

Approval Gates

  1. Test Status Gate: Before completing a task, AI must verify test_status front matter; if not "Passing" or "Not Applicable", must prompt user for confirmation
  2. Command Checklist: Every command has a mandatory [ ] checklist in AI_INSTRUCTIONS.md — all boxes must be checked before the AI can report completion
  3. Reconcile Gate: /aegis reconcile validates all cross-references; failures block the operation

Artifact Map

Phase Artifact(s)
Planning plan/planning_document.md, tasks/planned/*.md
Session Start sessions/SESSION-*.md, memory/session/ updated
Active Work tasks/active/*.md with updated front matter
Save sessions/SESSION-*.md, decisions/DECISION-*.md
Completion tasks/completed/*.md, memory/project/project.json
06

Memory Context

Aegis — Memory & Context

State Storage

All state is file-based markdown/YAML within .context/. No external database, no MCP server.

Memory Layers

Layer Path Content Persistence
Project Memory .context/memory/project/ project.json — current task lists, overview; project context markdown Project-scoped (git tracked)
Session Memory .context/memory/session/ Active session state, current focus Session-scoped (wiped/updated per session)
Core Memory .context/memory/core/ Framework operational memory, self-improvement records Project-scoped
Decisions .context/decisions/ Architectural decision records (ADRs) Project-scoped, permanent
Sessions .context/sessions/ Session logs with timestamps, progress, insights Project-scoped, append-only

Context Loading

The AI is instructed (in /aegis start checklist) to:

  1. Load project context from memory/project/
  2. Load active session from memory/session/
  3. Load recent decisions from decisions/
  4. Check task states across tasks/active/ and tasks/planned/

Cross-Session Handoff

Yes — sessions are persisted to .context/sessions/ and decisions to .context/decisions/. The /aegis start command loads previous session context. /aegis reconcile can verify and repair cross-session state consistency.

Context Compaction

No explicit compaction mechanism. The framework relies on the AI to read only what it needs from the file tree; no automatic summarization.

Self-Improvement

A dedicated "self-improvement record" is a first-class artifact — the AI is instructed to update it with new insights during every /aegis save. This is distinct from most frameworks: Aegis explicitly asks the agent to reflect on what it learned and write it back.

07

Orchestration

Aegis — Orchestration

Multi-Agent

No. Aegis is a single-agent framework. There is no mechanism to spawn sub-agents or run agents in parallel.

Orchestration Pattern

Sequential. All operations are sequential single-agent interactions following command checklists.

Execution Mode

Interactive-loop. The human issues an /aegis command, the AI executes the operation checklist, and the conversation continues.

Isolation Mechanism

None. All edits are in-place in the project's .context/ directory.

Multi-Model

No. The framework is model-agnostic and has no routing between multiple models.

Consensus Mechanism

None.

Prompt Chaining

Minimal. The /aegis start command loads context which seeds subsequent command execution, but there is no formal chaining where one command's output becomes another's prompt.

Crash Recovery

Partial. The /aegis reconcile command checks and repairs consistency. If the AI partially completes a command, running reconcile can identify and fix broken state.

Context Compaction

Not handled. No PreCompact hooks or compaction-aware mechanisms.

08

Ui Cli Surface

Aegis — UI / CLI Surface

CLI Binary

None. Aegis ships no executable. There is no aegis binary, no npm package, no pip package.

Local UI

None. No dashboard, no TUI, no web interface.

IDE Integration

None dedicated. Aegis works with any AI assistant that can read files — Claude Code, Cursor, ChatGPT, GitHub Copilot, etc.

Commands Surface

8 AI-interpreted slash commands via the /aegis prefix. These are instructions to the AI, not shell commands.

Observability

  • Audit trail: .context/sessions/ provides a log of every session with progress and decisions
  • Decision log: .context/decisions/ stores all architectural decisions
  • Task history: Task markdown files include timestamps, state transitions, and progress notes
  • Self-improvement: The framework explicitly asks the AI to record insights and lessons

Documentation Reference

  • External docs at https://buildsomething.ai/aegis/ (not shipped in repo)
  • COMMANDS.md — quick reference card in repo root
  • COMMANDS.json — machine-readable command reference
  • .context/README.md — human-readable structure overview

Related frameworks

same archetype · same primary tool · same memory type

BMAD-METHOD ★ 48k

Provides a full agile delivery lifecycle with named expert-persona AI collaborators that elicit the human's best thinking rather…

Agent OS ★ 4.6k

Extracts implicit codebase conventions into token-efficient markdown standards files and injects them selectively into AI agent…

Claude Conductor ★ 367

Gives Claude Code a persistent, cross-linked, auto-analyzed documentation system so it retains codebase context across sessions.

Spec-Driver (Greenfield Spec-Driven Development) ★ 25

Prevents spec rot in AI-assisted development by making implementation changes flow back into evergreen, authoritative specs via…

Anthropic Knowledge Work Plugins ★ 16k

Role-specialized plugin bundles with live MCP connectors that turn Claude into a domain expert for enterprise knowledge workers.

Codex Integration for Claude Code (skill-codex) ★ 1.3k

Single Claude Code skill that handles Codex CLI invocation correctly (stdin blocking, thinking token suppression, session resume)…