VoltAgent awesome-codex-subagents

voltagent-codex-subagents · VoltAgent/awesome-codex-subagents · ★ 4.9k · last commit 2026-05-25

Primitive shape 166 total

166

Subagents 166

Summary

VoltAgent awesome-codex-subagents — Summary

VoltAgent awesome-codex-subagents is the OpenAI Codex CLI counterpart to awesome-claude-code-subagents, providing 166+ subagents in TOML format across 13 categories. The TOML format is Codex-native with fields like model, model_reasoning_effort, sandbox_mode, and developer_instructions — distinct from Claude Code's YAML frontmatter + markdown body format. Three additional categories beyond the 10 in the Claude version cover AI governance/safety (5 agents), platform engineering/IDP (5 agents), and LLMOps/evals/observability (5 agents). Smart model routing assigns GPT-5.4 to deep-reasoning tasks (architecture reviews, security audits, financial logic) and GPT-5.3-codex-spark to fast scanning/synthesis. Sandbox mode philosophy distinguishes read-only agents (reviewers, auditors) from workspace-write agents (developers, engineers) at the TOML level — a security-by-default pattern absent from the Claude Code version. Compared to voltagent-subagents, this repo is architecturally identical in catalog philosophy but uses TOML format instead of Markdown and targets a different runtime (Codex CLI instead of Claude Code). The relationship between these two VoltAgent repos is the most explicit example of dual-harness mirroring in this batch.

Overview

VoltAgent awesome-codex-subagents — Overview

Origin

Same maintainer as awesome-claude-code-subagents (VoltAgent). 4,894 stars, last push 2026-05-25. MIT license. "Written specifically for Codex and aligned with the official docs."

Philosophy

From README:

"This repository serves as the definitive collection of Codex Subagents, specialized AI assistants designed for specific development tasks. Written specifically for Codex and aligned with the official docs."

Key differences from Claude Code version:

TOML format (Codex-native) vs Markdown
sandbox_mode field — explicit security boundary per agent
model_reasoning_effort — controls reasoning depth
developer_instructions replaces markdown body
GPT model routing instead of Claude model routing

TOML Format Philosophy

From README:

"Each subagent includes a model field that automatically routes it to the right model — balancing quality and cost."

Sandbox mode:

"Read-only agents (reviewers, auditors): sandbox_mode = 'read-only' — analyze without modifying. Workspace-write agents (developers, engineers): sandbox_mode = 'workspace-write' — create and modify files."

Installation Model

More manual than Claude Code version:

mkdir -p ~/.codex/agents
cp categories/01-core-development/backend-developer.toml ~/.codex/agents/

No interactive installer, no plugin marketplace integration. Codex does NOT auto-spawn custom subagents — requires explicit delegation in prompts.

Three Additional Categories

Categories 11-13 extend beyond the 10 in the Claude Code version:

11-ai-governance-safety — AI safety and governance specialists
12-platform-engineering-idp — Internal Developer Platform engineers
13-llmops-evals-observability — LLMOps, evaluation, and observability specialists

Architecture

VoltAgent awesome-codex-subagents — Architecture

Distribution

Manual copy. No plugin marketplace, no installer script.

# Global agents
mkdir -p ~/.codex/agents
cp categories/01-core-development/backend-developer.toml ~/.codex/agents/

# Project-specific
mkdir -p .codex/agents
cp categories/04-quality-security/reviewer.toml .codex/agents/

Directory Structure

categories/
  01-core-development/         # 12 agents (.toml)
  02-language-specialists/     # 31 agents (.toml)
  03-infrastructure/           # 16 agents (.toml)
  04-quality-security/         # 19 agents (.toml)
  05-data-ai/                  # 13 agents (.toml)
  06-developer-experience/     # 15 agents (.toml)
  07-specialized-domains/      # 13 agents (.toml)
  08-business-product/         # 16 agents (.toml)
  09-meta-orchestration/       # 11 agents (.toml)
  10-research-analysis/        # 9 agents (.toml)
  11-ai-governance-safety/     # 4 agents (.toml)  [NEW vs Claude version]
  12-platform-engineering-idp/ # 4 agents (.toml)  [NEW]
  13-llmops-evals-observability/ # 4 agents (.toml) [NEW]

Agent Format (TOML)

name = "backend-developer"
description = "Use when a task needs scoped backend implementation..."
model = "gpt-5.4"
model_reasoning_effort = "high"
sandbox_mode = "workspace-write"
developer_instructions = """
Own backend changes as production behavior...
"""

Storage Locations

Type	Path	Availability	Precedence
Project Subagents	`.codex/agents/`	Current project	Higher
Global Subagents	`~/.codex/agents/`	All projects	Lower

Project-specific overrides global on naming conflict.

Target AI Tools

OpenAI Codex CLI only.

Required Runtime

Codex CLI (OpenAI).

Components

VoltAgent awesome-codex-subagents — Components

Totals

Type	Count
Subagents (TOML)	166+
Commands	0
Skills	0
Hooks	0

Category Breakdown (166 agents across 13 categories)

Category	Count
01 Core Development	12
02 Language Specialists	31
03 Infrastructure	16
04 Quality & Security	19
05 Data & AI	13
06 Developer Experience	15
07 Specialized Domains	13
08 Business & Product	16
09 Meta & Orchestration	11
10 Research & Analysis	9
11 AI Governance & Safety	4
12 Platform Engineering / IDP	4
13 LLMOps, Evals & Observability	4

Three New Categories (vs Claude Code version)

11 AI Governance & Safety

Agents covering AI policy compliance, model safety auditing, AI ethics, bias detection, and governance frameworks.

12 Platform Engineering / IDP

Internal Developer Platform specialists: platform tooling, self-service infrastructure, developer portals, IDP architecture.

13 LLMOps, Evals & Observability

LLM operations specialists: model evaluation pipelines, observability instrumentation, evals framework design, monitoring for AI systems.

Model Routing

Model	When Used
`gpt-5.4`	Deep reasoning: architecture reviews, security audits, financial logic
`gpt-5.3-codex-spark`	Fast scanning, synthesis, lighter research tasks

Sandbox Modes

Mode	Agents
`read-only`	Reviewers, auditors, analyzers
`workspace-write`	Developers, engineers, implementers

Sample TOML (backend-developer)

name = "backend-developer"
description = "Use when a task needs scoped backend implementation..."
model = "gpt-5.4"
model_reasoning_effort = "high"
sandbox_mode = "workspace-write"
developer_instructions = """
Own backend changes as production behavior with explicit data, auth, and failure-path integrity.

Working mode:
1. Map entry point, domain logic boundary, and persistence side effects.
2. Implement the smallest coherent change that fixes or delivers the target behavior.
3. Validate behavior under normal and high-risk failure paths.

Focus on:
- request/event entry points and service boundary ownership
- input validation and contract-safe output behavior
...

Quality checks:
- validate one critical success path and one high-risk failure path
"""

Representative Agents

Category 01: api-designer, code-mapper (NEW), ui-fixer (NEW), backend-developer, frontend-developer
Category 02: typescript-pro, python-pro, golang-pro, rust-engineer
Category 03: kubernetes-specialist, terraform-engineer, cloud-architect
Category 04: security-auditor, code-reviewer, qa-expert
Category 09: agent-installer, workflow-orchestrator, multi-agent-coordinator

Prompts

VoltAgent awesome-codex-subagents — Prompt Excerpts

Excerpt 1: backend-developer.toml

Technique: Minimal, surgery-focused instructions (contrast with Claude version's checklist)

name = "backend-developer"
description = "Use when a task needs scoped backend implementation or backend bug fixes after the owning path is known."
model = "gpt-5.4"
model_reasoning_effort = "high"
sandbox_mode = "workspace-write"
developer_instructions = """
Own backend changes as production behavior with explicit data, auth, and failure-path integrity.

Working mode:
1. Map entry point, domain logic boundary, and persistence side effects.
2. Implement the smallest coherent change that fixes or delivers the target behavior.
3. Validate behavior under normal and high-risk failure paths.

Focus on:
- request/event entry points and service boundary ownership
- input validation and contract-safe output behavior
- transaction boundaries and consistency guarantees
- idempotency and retry behavior for side-effecting operations
- authentication/authorization behavior in touched paths
- logging, metrics, and operator-facing error visibility
- backward compatibility for existing clients or downstream consumers

Implementation checks:
- avoid hidden side effects in shared helpers
- keep domain logic centralized, not split across adapters/controllers
- preserve existing behavior outside changed scope
- make failure semantics explicit

Quality checks:
- validate one critical success path and one high-risk failure path
"""

Analysis: Significantly more terse and surgical than the Claude Code version of the same agent. "Implement the smallest coherent change" reflects a minimal-footprint philosophy. The description says "after the owning path is known" — presupposes a discovery agent ran first (like code-mapper). Model gpt-5.4 with model_reasoning_effort = "high" shows explicit compute budget control.

Comparison: Same Agent, Two Formats

Dimension	Claude Code (MD)	Codex (TOML)
Format	YAML frontmatter + MD body	Pure TOML
Instructions	Checklists + requirement tables	Terse surgical focus
Model	`model: sonnet`	`model = "gpt-5.4"`
Sandbox	implicit	`sandbox_mode = "workspace-write"`
Auto-spawn	Yes	No — explicit delegation only

Uniqueness

VoltAgent awesome-codex-subagents — Uniqueness & Positioning

Differs From Seeds

No seed targets Codex CLI as primary tool — this is the only Codex-native catalog in this batch. The TOML format with sandbox_mode and model_reasoning_effort fields has no equivalent in any seed framework. The dual-repo strategy (pairing with voltagent-subagents for Claude Code) represents an explicit "one content, two harnesses" philosophy implemented through separate repos rather than an adapter framework.

Relationship to Sister Repo

The most important thing to understand about this repo is its relationship to voltagent-subagents: same catalog structure (categories 1-10 are identical domains), same agent names, same organizing company — but different format, different runtime, and three additional categories (11-13). This is the most explicit same-company dual-harness strategy in this batch.

Sandbox Mode Innovation

The TOML sandbox_mode = "read-only" for reviewers/auditors and sandbox_mode = "workspace-write" for developers/engineers is a security-by-default pattern that Claude Code's format cannot express. This is the most distinctive technical feature of this repo vs any other in the batch.

Observable Failure Modes

No auto-spawn: Users migrating from Claude Code expect auto-activation; Codex requires explicit delegation.
Model name drift: GPT model versions will be deprecated; TOML files with hardcoded model names will break.
Manual sync: No automated mechanism to keep this repo in sync with the Claude Code version — categories 11-13 only exist here; future Claude Code additions may not propagate.

Explicit Antipatterns

None stated. README focuses on "written specifically for Codex" positioning.

Workflow

VoltAgent awesome-codex-subagents — Workflow

User Workflow

Copy desired TOML files to ~/.codex/agents/ or .codex/agents/
In Codex: explicitly delegate to the agent in your prompt (Codex does NOT auto-spawn)
Agent executes with specified model and sandbox_mode

Key Difference From Claude Code Version

"Codex does not auto-spawn custom subagents."

This is a critical behavioral difference: Claude Code auto-activates agents based on description trigger phrases; Codex requires explicit delegation in the user's prompt.

Explicit Delegation Pattern

# Codex delegation
> Have the security-auditor review the authentication module
> Use backend-developer to implement the payment API

No Formal Phases

Pure catalog. No workflow phases, no approval gates, no spec-driven process.

Configuration Override

# In .codex/config.toml
[agents]
# per-project agent configuration overrides

Memory Context

VoltAgent awesome-codex-subagents — Memory & Context

No persistent state. Each agent operates in Codex's native context window.

No cross-session handoff, no memory files, no external state stores.

The sandbox_mode field in TOML controls filesystem access (read-only vs workspace-write) — this is the closest thing to a context boundary mechanism, but it's security isolation not memory persistence.

Orchestration

VoltAgent awesome-codex-subagents — Orchestration

Multi-Agent Support

Yes — category 09 provides orchestration agents. But Codex requires explicit delegation; no auto-spawning.

Orchestration Pattern

Hierarchical (orchestrator agents coordinate specialists). Explicit delegation model means the user or an orchestrator agent must explicitly name which sub-agent to invoke.

Isolation Mechanism

Sandbox mode (TOML field) — read-only or workspace-write. This is a Codex-native filesystem access control mechanism, distinct from Claude Code's process isolation.

Multi-Model

Yes — two model tiers:

gpt-5.4: deep reasoning tasks
gpt-5.3-codex-spark: fast/lighter tasks

Plus model_reasoning_effort field: high, medium, low — compute budget control.

Execution Mode

Interactive-loop with explicit delegation.

Cross-Tool Portability

Single-tool (Codex CLI only). Paired with voltagent-subagents (Claude Code) as dual-repo strategy.

Ui Cli Surface

VoltAgent awesome-codex-subagents — UI / CLI Surface

Dedicated CLI

None. Manual TOML file copy.

Local Dashboard

None.

IDE Integration

Codex CLI only. TOML files placed in .codex/agents/ (project) or ~/.codex/agents/ (global).

Discovery

No automated discovery. User explicitly delegates in Codex prompts.

Related frameworks

same archetype · same primary tool · same memory type

Daytona ★ 72k

A9 Sandbox substrate

Provide secure, elastic, sub-90ms sandbox compute infrastructure for running AI-generated code, accessible via multi-language…

CUA ★ 17k

A9 Sandbox substrate

Unified SDK for building, benchmarking, and deploying agents that interact with full OS GUIs via isolated VMs.

E2B ★ 12k

A9 Sandbox substrate

Run AI-generated code safely in cloud-hosted isolated sandboxes via a 3-line SDK integration.

OpenSandbox ★ 11k

A9 Sandbox substrate

Protocol-first general-purpose sandbox platform for AI applications with multi-language SDKs and pluggable isolation backends.

Microsandbox ★ 6.3k

A9 Sandbox substrate

Spawn hardware-isolated microVMs as child processes directly from application code, with no server setup, in under 100ms.

CubeSandbox ★ 5.9k

A9 Sandbox substrate

Sub-60ms KVM microVM sandboxes for AI agents with E2B drop-in compatibility and <5MB memory overhead.

Distribution

Type: standalone-repo
License: MIT
Install: clone-and-configure
Version: unknown (no semver; last push 2026-05-25)

Surfaces

CLI binary: No
CLI subcmds: 0
Local UI: No

Components

Commands: 0
Skills: 0
Subagents: 166
Hooks: 0
MCP servers: 0
MCP tools: 0
Scripts: 0
Templates: 0

Workflow

Phases: 3
Approval gates: 0
Spec format: none
Spec storage: none
Delta or full: none

Orchestration

Multi-agent: Yes
Pattern: hierarchical
Isolation: sandbox-api
Consensus: none
Prompt chaining: No

Multi-model

Multi-model: Yes
BYOK: No
Modal: text

Execution

Mode: interactive-loop
Crash recovery: No
Compaction: No
Session handoff: No
Streaming: Yes

Memory

Type: none
Persistence: none
Search: none

Quality

TDD: No
TDD mechanism: none
Self-review: none

Git / Observability

Auto commit: No
Auto PR: No
Auto merge: No
Worktree/feat: No
Audit log: No
Audit format: none
Replay: No

Tools

Primary: codex-cli
Targets: 1
Portability: single-tool

Signals

Stars: 4.9k
Last commit: 2026-05-25
Maintainer: active
Quality score: 2.6/10