Skip to content
/

VoltAgent awesome-codex-subagents

voltagent-codex-subagents · VoltAgent/awesome-codex-subagents · ★ 4.9k · last commit 2026-05-25

Primitive shape 166 total
Subagents 166
00

Summary

VoltAgent awesome-codex-subagents — Summary

VoltAgent awesome-codex-subagents is the OpenAI Codex CLI counterpart to awesome-claude-code-subagents, providing 166+ subagents in TOML format across 13 categories. The TOML format is Codex-native with fields like model, model_reasoning_effort, sandbox_mode, and developer_instructions — distinct from Claude Code's YAML frontmatter + markdown body format. Three additional categories beyond the 10 in the Claude version cover AI governance/safety (5 agents), platform engineering/IDP (5 agents), and LLMOps/evals/observability (5 agents). Smart model routing assigns GPT-5.4 to deep-reasoning tasks (architecture reviews, security audits, financial logic) and GPT-5.3-codex-spark to fast scanning/synthesis. Sandbox mode philosophy distinguishes read-only agents (reviewers, auditors) from workspace-write agents (developers, engineers) at the TOML level — a security-by-default pattern absent from the Claude Code version. Compared to voltagent-subagents, this repo is architecturally identical in catalog philosophy but uses TOML format instead of Markdown and targets a different runtime (Codex CLI instead of Claude Code). The relationship between these two VoltAgent repos is the most explicit example of dual-harness mirroring in this batch.

01

Overview

VoltAgent awesome-codex-subagents — Overview

Origin

Same maintainer as awesome-claude-code-subagents (VoltAgent). 4,894 stars, last push 2026-05-25. MIT license. "Written specifically for Codex and aligned with the official docs."

Philosophy

From README:

"This repository serves as the definitive collection of Codex Subagents, specialized AI assistants designed for specific development tasks. Written specifically for Codex and aligned with the official docs."

Key differences from Claude Code version:

  1. TOML format (Codex-native) vs Markdown
  2. sandbox_mode field — explicit security boundary per agent
  3. model_reasoning_effort — controls reasoning depth
  4. developer_instructions replaces markdown body
  5. GPT model routing instead of Claude model routing

TOML Format Philosophy

From README:

"Each subagent includes a model field that automatically routes it to the right model — balancing quality and cost."

Sandbox mode:

"Read-only agents (reviewers, auditors): sandbox_mode = 'read-only' — analyze without modifying. Workspace-write agents (developers, engineers): sandbox_mode = 'workspace-write' — create and modify files."

Installation Model

More manual than Claude Code version:

mkdir -p ~/.codex/agents
cp categories/01-core-development/backend-developer.toml ~/.codex/agents/

No interactive installer, no plugin marketplace integration. Codex does NOT auto-spawn custom subagents — requires explicit delegation in prompts.

Three Additional Categories

Categories 11-13 extend beyond the 10 in the Claude Code version:

  • 11-ai-governance-safety — AI safety and governance specialists
  • 12-platform-engineering-idp — Internal Developer Platform engineers
  • 13-llmops-evals-observability — LLMOps, evaluation, and observability specialists
02

Architecture

VoltAgent awesome-codex-subagents — Architecture

Distribution

Manual copy. No plugin marketplace, no installer script.

# Global agents
mkdir -p ~/.codex/agents
cp categories/01-core-development/backend-developer.toml ~/.codex/agents/

# Project-specific
mkdir -p .codex/agents
cp categories/04-quality-security/reviewer.toml .codex/agents/

Directory Structure

categories/
  01-core-development/         # 12 agents (.toml)
  02-language-specialists/     # 31 agents (.toml)
  03-infrastructure/           # 16 agents (.toml)
  04-quality-security/         # 19 agents (.toml)
  05-data-ai/                  # 13 agents (.toml)
  06-developer-experience/     # 15 agents (.toml)
  07-specialized-domains/      # 13 agents (.toml)
  08-business-product/         # 16 agents (.toml)
  09-meta-orchestration/       # 11 agents (.toml)
  10-research-analysis/        # 9 agents (.toml)
  11-ai-governance-safety/     # 4 agents (.toml)  [NEW vs Claude version]
  12-platform-engineering-idp/ # 4 agents (.toml)  [NEW]
  13-llmops-evals-observability/ # 4 agents (.toml) [NEW]

Agent Format (TOML)

name = "backend-developer"
description = "Use when a task needs scoped backend implementation..."
model = "gpt-5.4"
model_reasoning_effort = "high"
sandbox_mode = "workspace-write"
developer_instructions = """
Own backend changes as production behavior...
"""

Storage Locations

Type Path Availability Precedence
Project Subagents .codex/agents/ Current project Higher
Global Subagents ~/.codex/agents/ All projects Lower

Project-specific overrides global on naming conflict.

Target AI Tools

OpenAI Codex CLI only.

Required Runtime

Codex CLI (OpenAI).

03

Components

VoltAgent awesome-codex-subagents — Components

Totals

Type Count
Subagents (TOML) 166+
Commands 0
Skills 0
Hooks 0

Category Breakdown (166 agents across 13 categories)

Category Count
01 Core Development 12
02 Language Specialists 31
03 Infrastructure 16
04 Quality & Security 19
05 Data & AI 13
06 Developer Experience 15
07 Specialized Domains 13
08 Business & Product 16
09 Meta & Orchestration 11
10 Research & Analysis 9
11 AI Governance & Safety 4
12 Platform Engineering / IDP 4
13 LLMOps, Evals & Observability 4

Three New Categories (vs Claude Code version)

11 AI Governance & Safety

Agents covering AI policy compliance, model safety auditing, AI ethics, bias detection, and governance frameworks.

12 Platform Engineering / IDP

Internal Developer Platform specialists: platform tooling, self-service infrastructure, developer portals, IDP architecture.

13 LLMOps, Evals & Observability

LLM operations specialists: model evaluation pipelines, observability instrumentation, evals framework design, monitoring for AI systems.

Model Routing

Model When Used
gpt-5.4 Deep reasoning: architecture reviews, security audits, financial logic
gpt-5.3-codex-spark Fast scanning, synthesis, lighter research tasks

Sandbox Modes

Mode Agents
read-only Reviewers, auditors, analyzers
workspace-write Developers, engineers, implementers

Sample TOML (backend-developer)

name = "backend-developer"
description = "Use when a task needs scoped backend implementation..."
model = "gpt-5.4"
model_reasoning_effort = "high"
sandbox_mode = "workspace-write"
developer_instructions = """
Own backend changes as production behavior with explicit data, auth, and failure-path integrity.

Working mode:
1. Map entry point, domain logic boundary, and persistence side effects.
2. Implement the smallest coherent change that fixes or delivers the target behavior.
3. Validate behavior under normal and high-risk failure paths.

Focus on:
- request/event entry points and service boundary ownership
- input validation and contract-safe output behavior
...

Quality checks:
- validate one critical success path and one high-risk failure path
"""

Representative Agents

  • Category 01: api-designer, code-mapper (NEW), ui-fixer (NEW), backend-developer, frontend-developer
  • Category 02: typescript-pro, python-pro, golang-pro, rust-engineer
  • Category 03: kubernetes-specialist, terraform-engineer, cloud-architect
  • Category 04: security-auditor, code-reviewer, qa-expert
  • Category 09: agent-installer, workflow-orchestrator, multi-agent-coordinator
05

Prompts

VoltAgent awesome-codex-subagents — Prompt Excerpts

Excerpt 1: backend-developer.toml

Technique: Minimal, surgery-focused instructions (contrast with Claude version's checklist)

name = "backend-developer"
description = "Use when a task needs scoped backend implementation or backend bug fixes after the owning path is known."
model = "gpt-5.4"
model_reasoning_effort = "high"
sandbox_mode = "workspace-write"
developer_instructions = """
Own backend changes as production behavior with explicit data, auth, and failure-path integrity.

Working mode:
1. Map entry point, domain logic boundary, and persistence side effects.
2. Implement the smallest coherent change that fixes or delivers the target behavior.
3. Validate behavior under normal and high-risk failure paths.

Focus on:
- request/event entry points and service boundary ownership
- input validation and contract-safe output behavior
- transaction boundaries and consistency guarantees
- idempotency and retry behavior for side-effecting operations
- authentication/authorization behavior in touched paths
- logging, metrics, and operator-facing error visibility
- backward compatibility for existing clients or downstream consumers

Implementation checks:
- avoid hidden side effects in shared helpers
- keep domain logic centralized, not split across adapters/controllers
- preserve existing behavior outside changed scope
- make failure semantics explicit

Quality checks:
- validate one critical success path and one high-risk failure path
"""

Analysis: Significantly more terse and surgical than the Claude Code version of the same agent. "Implement the smallest coherent change" reflects a minimal-footprint philosophy. The description says "after the owning path is known" — presupposes a discovery agent ran first (like code-mapper). Model gpt-5.4 with model_reasoning_effort = "high" shows explicit compute budget control.

Comparison: Same Agent, Two Formats

Dimension Claude Code (MD) Codex (TOML)
Format YAML frontmatter + MD body Pure TOML
Instructions Checklists + requirement tables Terse surgical focus
Model model: sonnet model = "gpt-5.4"
Sandbox implicit sandbox_mode = "workspace-write"
Auto-spawn Yes No — explicit delegation only
09

Uniqueness

VoltAgent awesome-codex-subagents — Uniqueness & Positioning

Differs From Seeds

No seed targets Codex CLI as primary tool — this is the only Codex-native catalog in this batch. The TOML format with sandbox_mode and model_reasoning_effort fields has no equivalent in any seed framework. The dual-repo strategy (pairing with voltagent-subagents for Claude Code) represents an explicit "one content, two harnesses" philosophy implemented through separate repos rather than an adapter framework.

Relationship to Sister Repo

The most important thing to understand about this repo is its relationship to voltagent-subagents: same catalog structure (categories 1-10 are identical domains), same agent names, same organizing company — but different format, different runtime, and three additional categories (11-13). This is the most explicit same-company dual-harness strategy in this batch.

Sandbox Mode Innovation

The TOML sandbox_mode = "read-only" for reviewers/auditors and sandbox_mode = "workspace-write" for developers/engineers is a security-by-default pattern that Claude Code's format cannot express. This is the most distinctive technical feature of this repo vs any other in the batch.

Observable Failure Modes

  1. No auto-spawn: Users migrating from Claude Code expect auto-activation; Codex requires explicit delegation.
  2. Model name drift: GPT model versions will be deprecated; TOML files with hardcoded model names will break.
  3. Manual sync: No automated mechanism to keep this repo in sync with the Claude Code version — categories 11-13 only exist here; future Claude Code additions may not propagate.

Explicit Antipatterns

None stated. README focuses on "written specifically for Codex" positioning.

04

Workflow

VoltAgent awesome-codex-subagents — Workflow

User Workflow

  1. Copy desired TOML files to ~/.codex/agents/ or .codex/agents/
  2. In Codex: explicitly delegate to the agent in your prompt (Codex does NOT auto-spawn)
  3. Agent executes with specified model and sandbox_mode

Key Difference From Claude Code Version

"Codex does not auto-spawn custom subagents."

This is a critical behavioral difference: Claude Code auto-activates agents based on description trigger phrases; Codex requires explicit delegation in the user's prompt.

Explicit Delegation Pattern

# Codex delegation
> Have the security-auditor review the authentication module
> Use backend-developer to implement the payment API

No Formal Phases

Pure catalog. No workflow phases, no approval gates, no spec-driven process.

Configuration Override

# In .codex/config.toml
[agents]
# per-project agent configuration overrides
06

Memory Context

VoltAgent awesome-codex-subagents — Memory & Context

No persistent state. Each agent operates in Codex's native context window.

No cross-session handoff, no memory files, no external state stores.

The sandbox_mode field in TOML controls filesystem access (read-only vs workspace-write) — this is the closest thing to a context boundary mechanism, but it's security isolation not memory persistence.

07

Orchestration

VoltAgent awesome-codex-subagents — Orchestration

Multi-Agent Support

Yes — category 09 provides orchestration agents. But Codex requires explicit delegation; no auto-spawning.

Orchestration Pattern

Hierarchical (orchestrator agents coordinate specialists). Explicit delegation model means the user or an orchestrator agent must explicitly name which sub-agent to invoke.

Isolation Mechanism

Sandbox mode (TOML field) — read-only or workspace-write. This is a Codex-native filesystem access control mechanism, distinct from Claude Code's process isolation.

Multi-Model

Yes — two model tiers:

  • gpt-5.4: deep reasoning tasks
  • gpt-5.3-codex-spark: fast/lighter tasks

Plus model_reasoning_effort field: high, medium, low — compute budget control.

Execution Mode

Interactive-loop with explicit delegation.

Cross-Tool Portability

Single-tool (Codex CLI only). Paired with voltagent-subagents (Claude Code) as dual-repo strategy.

08

Ui Cli Surface

VoltAgent awesome-codex-subagents — UI / CLI Surface

Dedicated CLI

None. Manual TOML file copy.

Local Dashboard

None.

IDE Integration

Codex CLI only. TOML files placed in .codex/agents/ (project) or ~/.codex/agents/ (global).

Discovery

No automated discovery. User explicitly delegates in Codex prompts.

Related frameworks

same archetype · same primary tool · same memory type

Daytona ★ 72k

Provide secure, elastic, sub-90ms sandbox compute infrastructure for running AI-generated code, accessible via multi-language…

CUA ★ 17k

Unified SDK for building, benchmarking, and deploying agents that interact with full OS GUIs via isolated VMs.

E2B ★ 12k

Run AI-generated code safely in cloud-hosted isolated sandboxes via a 3-line SDK integration.

OpenSandbox ★ 11k

Protocol-first general-purpose sandbox platform for AI applications with multi-language SDKs and pluggable isolation backends.

Microsandbox ★ 6.3k

Spawn hardware-isolated microVMs as child processes directly from application code, with no server setup, in under 100ms.

CubeSandbox ★ 5.9k

Sub-60ms KVM microVM sandboxes for AI agents with E2B drop-in compatibility and <5MB memory overhead.